Looking for Expert NLP/ML Engineer for Language Translation Model Training (Indic Languages)

Remote Full-time
Project Description: I am looking to hire an experienced NLP/ML engineer to train high-quality machine translation models for Indic languages. The goal is to develop single language-pair models, such as: ● English → Telugu ● English → Hindi (and additional language pairs, if needed) You may choose the most suitable model architecture based on your expertise (e.g., mBART, mT5, NLLB fine-tuning, Transformer variants, etc.), as long as the final models deliver strong translation quality. Dataset: ● You can use the AI4Bharat datasets including: ● Samanantar ● BPCC ● Other open Indic parallel corpora Scope of Work: The freelancer will be responsible for: 1. Data Handling ● Cleaning, filtering, and preprocessing datasets Sentence alignment (if needed) ● Tokenization and vocabulary preparation (SentencePiece/BPE/etc.) 2. Model Training ● Selecting an appropriate model architecture ● Training single language-pair translation models ● Implementing best practices for training efficiency (FP16, gradient accumulation, etc.) ● Hyperparameter tuning Checkpoint management and monitoring 3. Evaluation ● Compute BLEU, SacreBLEU, and other relevant metrics ● Provide side-by-side qualitative translation samples ● Benchmarking against baseline models 4. Delivery ● Final trained model weights ● Inference scripts (Python) for quick testing ● Instructions for running and continuing training ● Documentation of preprocessing and training pipeline ● Optional: Dockerfile or virtual environment setup Requirements: The ideal candidate should have: ● Strong experience in NLP, Transformers, and neural MT models ● Prior work with Indic languages (big plus) ● Experience with training libraries such as PyTorch, Hugging Face Transformers, Fairseq, OpenNMT, or similar ● Ability to handle large-scale training and dataset preprocessing ● Familiarity with SentencePiece, tokenization strategies, and MT evaluation metrics ● Ability to deliver clean, well-documented code Additional Notes: ● Compute resources can be discussed (I can provide compute, or you can use yours). ● More language pairs may be added later as separate follow-up projects. ● Quality of translation is the highest priority. Apply tot his job
Apply Now

Similar Opportunities

Freelance Writer: Politics and Trending News at GAMURS Group

Remote Full-time

Junior AI/NLP/Machine Learning Engineer 2

Remote Full-time

[Remote] Senior Account Manager, Nordstrom Media Network (Remote)

Remote Full-time

Professional Services Engineer - Network Security Vendor

Remote Full-time

Trending News Writer & Editor, Soccer - Sports Illustrated FC

Remote Full-time

Overnight Inpatient Pharmacy Technician - IP 500P - (Part-Time, 10-Hour Night Shifts)

Remote Full-time

Customer Service Representative (Guam Night Shift)

Remote Full-time

Live Chat Assistant - Remote - Night Shift Premium - $25-$35/hr

Remote Full-time

Senior Principal, Stakeholder Engagement, Global Sustainability

Remote Full-time

[PART_TIME Remote] Nike Data Entry Remote Jobs $27/Hour

Remote Full-time

Google Cloud Engineer

Remote Full-time

Experienced Full Remote Guest Relations Officer and Live Chat Agent - Delivering Exceptional Customer Experiences from Home

Remote Full-time

Digital Editor of Technology & Equipment – Agriculture Part-Time Work From Home Jobs

Remote Full-time

**Experienced Customer Support Representative – Chat Call Customer Support at arenaflex**

Remote Full-time

Experienced Full Stack Coach and Operations Manager Trainee - Retail Leadership Development in Leesburg, VA

Remote Full-time

Spanish Language Instructor - Omaha, NE

Remote Full-time

**Experienced Data Entry Clerk – 3rd Shift in Carlisle, PA at blithequark**

Remote Full-time

**Experienced Live Chat Support Specialist – Customer Service Representative (Work From Home)**

Remote Full-time

Administrative Assistant / Data Entry Clerk (Work From Home)

Remote Full-time

Experienced Remote Data Entry Specialist – Accurate Data Management and Entry for Operational Efficiency at blithequark

Remote Full-time
← Back to Home