Recruitment.bg is a boutique IT recruitment company, based in Bulgaria. We aim to work with the top employers in the industry, companies that we thoroughly vet and trust. Our mission is to guide IT professionals toward improved career paths by understanding their skills, crafting employment strategies, and supporting them every step of the way. Placing emphasis on honesty, respect and reliability while delivering exceptional service by ‘going the extra mile’ we build long term relationships with the people and organizations we work with.
About the Company
Our client – a global tech-driven organization with an advanced R&D center in Sofia – is looking for a highly experienced Senior AI/ML Engineer with a strong focus on Generative AI and Large Language Models (LLMs). This is a strategic role that combines cutting-edge machine learning development with scalable, production-ready architecture.
Key Responsibilities:
Design and build high-performance infrastructure for training and serving LLMs.
Develop optimized inference pipelines for low-latency and high-efficiency model serving.
Create robust APIs to integrate AI solutions into core software systems.
Apply advanced NLP methods such as prompt engineering, few-shot learning, and Retrieval-Augmented Generation (RAG).
Fine-tune and adapt pre-trained models to industry-specific use cases.
Build evaluation frameworks to measure model accuracy, bias, and performance.
Collaborate with product, engineering, and research teams to identify and implement AI-driven innovations.
Required Skills & Experience:
Degree in Computer Science, Machine Learning, or a related field.
5+ years of hands-on experience in machine learning, with a focus on NLP and deep learning.
Strong experience with PyTorch or TensorFlow, and tools like Hugging Face Transformers.
Proficiency in Python and familiarity with ML Ops tools (MLflow, Kubeflow, etc.).
Experience in optimizing inference using quantization, distillation, or ONNX.
Background in distributed training and deploying large-scale models.
Knowledge of vector databases and embedding-based retrieval systems.
Familiarity with version control, CI/CD, and containerization (Docker, Kubernetes).
Nice to Have:
Understanding of transformer-based architectures and attention mechanisms.
Experience with reinforcement learning in language model contexts (e.g., RLHF).
Familiarity with prompt engineering and in-context learning methods.
Experience with major cloud platforms: AWS, GCP, Azure.
Awareness of ethical AI practices, fairness, and bias mitigation..
Tech Stack:
Languages: Python, C++ (optimization)
Frameworks: PyTorch, TensorFlow, Hugging Face Transformers
Infrastructure: Docker, Kubernetes, MLflow, SageMaker, Azure ML, Google Cloud AI
Competitive salary and annual performance-based bonuses
Bi-annual performance and salary reviews
25 days paid annual leave
Flexible working hours and hybrid model (2 days remote/week)
Premium health insurance package
Fully covered transportation and sports cards
Team sports events and wellness initiatives
Opportunities for growth through company-sponsored trainings, conferences, and seminars
Participation in cutting-edge AI projects with real-world applications
Referral bonuses and corporate discount programs
If you’re passionate about generative AI and want to work on large-scale, impactful projects, we’d love to hear from you. Apply today and take your AI/ML career to the next level!
All applications will be treated with strict confidentiality.
By enabling them, you help us to develop and deliver better services in the way that's most convenient for you. For information and settings, see our Cookie Policy.