About the Role
We are looking for a highly skilled and motivated AI/ML Engineer to design, develop, and train state-of-the-art Large Language Models (LLMs). The ideal candidate will have a strong background in machine learning, natural language processing (NLP), and deep learning frameworks. This role will involve working closely with our data science team to build and optimize LLMs for various applications, ensuring high performance and scalability
Requirements
Bachelor's or Master's degree in Computer Science, Engineering, Mathematics, or a related field.
4+ years of experience in machine learning, natural language processing, or a related field.
Strong programming skills in Python, with experience in libraries such as TensorFlow, PyTorch, or Keras.
Proven experience in training and fine-tuning Large Language Models (e.g., GPT, BERT, LLAMA, MISTRAL).
Deep understanding of NLP concepts, including tokenization, language modeling, and text classification.
Experience with cloud platforms (AWS, GCP, Azure) for training and deploying models.
Familiarity with data preprocessing techniques and tools for handling large datasets.
About the Company
Deep Nucleus is a dynamic and innovative technology company, specialised in delivering comprehensive IT and marketing solutions.