We are seeking a highly skilled Senior Data Scientist with expertise in Natural Language Processing (NLP) to join our team. The ideal candidate will have at least 5 years of experience in data science, with a minimum of 2 years specializing in NLP. You will work on various aspects of data science, including machine learning, deep learning, leading an independent research to build and optimize NLP-driven solutions.
We are looking for a self-starter who is passionate about solving complex problems, able to drive research initiatives, and thrives in a collaborative team environment.
Key Responsibilities:
- Develop and optimize NLP models for tasks such as text classification, named entity recognition (NER), sentiment analysis,document summarization and others.
- Work with large-scale datasets to extract insights and enhance language models.
- Design and implement machine learning pipelines, ensuring scalability and efficiency.
- Fine-tune pretrained models (BERT, GPT, T5, etc.) for domain-specific tasks.
- Apply different techniques for text preprocessing such as noun chunk extraction, sentence parsing, segmentation, and tokenization in NLP pipelines.
- Conduct deep research on state-of-the-art NLP techniques and integrate innovative methodologies into solutions.
- Deploy and optimize models on cloud platforms like AWS.
- Collaborate with researchers, engineers, and business stakeholders to drive data-driven decision-making.
Required Qualifications:
- 5+ years of experience in Data Science, with at least 2 years in NLP.
- Proficiency in Python and ML frameworks such as TensorFlow, PyTorch, NumPy, Pandas, SpaCy.
- Experience working with AWS or other cloud providers.
- Hands-on experience with machine learning algorithms, including supervised and unsupervised learning.
- Deep understanding of research methodologies and the ability to implement novel approaches.
- Strong analytical and problem-solving skills.
- Ability to work independently and in a collaborative, team-oriented environment.
Preferred Qualifications:
- Experience with LLMs (Large Language Models) and prompt engineering.
- Understanding of semantic search, knowledge graphs, or information retrieval.
- Familiarity with vector search for efficient similarity searches.
- Experience with MongoDB for handling unstructured and semi-structured data.
- Experience with MLOps tools for CI/CD and model monitoring.
- Strong publication record or contributions to NLP research communities.
What We Offer
- Flexible work environment.
- Opportunities for deep research, innovation, and professional development.
- Access to cutting-edge technology and resources.