Recruitment.bg is a boutique IT recruitment company, based in Bulgaria. We aim to work with the top employers in the industry, companies that we thoroughly vet and trust.
Our mission is to guide IT professionals toward improved career paths by understanding their skills, crafting employment strategies, and supporting them every step of the way.
Placing emphasis on honesty, respect and reliability while delivering exceptional service by ‘going the extra mile’ we build long term relationships with the people and organizations we work with.
For one of our trusted clients we are looking for:
About the Company
Our client is a next-generation tech company specializing in online gaming products, offering a portfolio that includes Casino Games, Sportsbook, and an all-in-one Gambling Platform.
The company is a part of a globally recognized gaming group headquartered in Sofia, Bulgaria, with operations across 25 countries and installations in over 85 jurisdictions spanning Europe, Asia, Africa, and the Americas.
To support its rapid growth and commitment to innovation, our client is expanding its Platform & Payments Department. They are now seeking a talented Senior AI/ML Engineer to enhance their generative AI capabilities.
Your Key Responsibilities
Design and deploy scalable infrastructure for training and serving large language models (LLMs).
Optimize model serving pipelines to ensure low-latency inference and efficient resource utilization.
Develop APIs and integrate advanced AI functionalities into the existing software ecosystem.
Apply advanced NLP methodologies such as few-shot learning, prompt engineering, and Retrieval-Augmented Generation (RAG).
Fine-tune pre-trained models to align with specific business domains and use cases.
Establish evaluation frameworks to monitor model performance and output quality.
Collaborate with cross-functional teams to identify opportunities for AI-driven innovation.
Who We Are Looking For
Bachelor’s or Master’s degree in Computer Science, Machine Learning, or a related field.
Over 5 years of professional experience in machine learning, with a specialization in NLP and deep learning.
Expertise in frameworks like PyTorch or TensorFlow, and familiarity with Hugging Face Transformers.
Advanced proficiency in Python and experience with ML ops tools such as MLflow or Kubeflow.
Proven track record in optimizing model inference (e.g., quantization, distillation, ONNX runtime).
Hands-on experience with distributed training and large-scale model serving architectures.
Knowledge of vector databases and embedding techniques.
Strong grasp of software engineering practices including version control, CI/CD, and containerization.
Preferred Qualifications
Experience working with transformer architectures and attention mechanisms.
Familiarity with reinforcement learning techniques (e.g., Reinforcement Learning from Human Feedback – RLHF).
Proficiency in cloud-based ML tools (AWS, GCP, or Azure).
Knowledge of AI ethics, bias mitigation, and responsible AI practices.
Contributions to open-source ML projects or research publications.
Technical Stack
Languages: Python, C++
Frameworks: PyTorch, TensorFlow, Hugging Face Transformers
Infrastructure: Docker, Kubernetes
Cloud Platforms: AWS SageMaker, Azure ML, Google Cloud AI
By enabling them, you help us to develop and deliver better services in the way that's most convenient for you. For information and settings, see our Cookie Policy.