[Remote] Student Researcher [LLM Post Training – Agent & Reinforcement Learning] - 2026 Start (PhD)

Remote Full-time
Note: The job is a remote job and is open to candidates in USA. ByteDance is dedicated to pioneering advanced AI foundation models and is seeking a Student Researcher for their Seed LLM Post Training team. The role involves researching and developing advanced technologies in reinforcement learning and agent capabilities. Responsibilities Develop generalized agents capable of solving complex real-world tasks through long-horizon reasoning, memory, and multi-turn interaction Tackle the challenges of large-scale reinforcement learning, building systems that can scale across compute, data, and environments to improve model intelligence and alignment with human preferences Advance agent capabilities in long-horizon, multi-step reasoning across diverse domains, aiming to match or surpass expert-level performance Explore planning, tool use, and feedback mechanisms to enhance agent robustness and adaptability across domains Skills Currently pursuing a PhD in Computer Science, AI, or a related field Research experience in reinforcement learning, sequential decision-making, or agent behavior First-author publications in top-tier ML/AI conferences (e.g., NeurIPS, ICLR, ICML) Solid programming and experimentation skills, including with RL or LLM frameworks Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment Experience with LLM agents, tool use, or prompt-based control Familiarity with environments such as WebArena, ALFWorld, or programmatic reasoning tasks Understanding of RL techniques such as reward shaping, memory augmentation, or curriculum learning Benefits Interns have day one access to health insurance Life insurance Wellbeing benefits and more Interns also receive 10 paid holidays per year Paid sick time (56 hours if hired in first half of year, 40 if hired in second half of year) Interns who are not working 100% remote may also be eligible for housing allowance Company Overview ByteDance is a technology company that develops content creation platforms and services. It was founded in 2012, and is headquartered in Beijing, Beijing, CHN, with a workforce of 10001+ employees. Its website is Company H1B Sponsorship ByteDance has a track record of offering H1B sponsorships, with 1350 in 2025, 1123 in 2024, 775 in 2023, 487 in 2022, 417 in 2021, 245 in 2020. Please note that this does not guarantee sponsorship for this specific role.
Apply Now

Similar Opportunities

Machine Learning Intern, Regulatory

Remote Full-time

Probabilistic Risk Assessment Engineer 1

Remote Full-time

Healthcare Service Representative I – Patient Access Center – Kelsey Seybold Clinic: Pearland

Remote Full-time

Research Internship (Winter 2026)

Remote Full-time

[Remote] Research Intern (AI/LLM Network) - 2026 Start (PhD)

Remote Full-time

Licensed Sales Professional (LSP) - RI

Remote Full-time

Corporate Account Executive

Remote Full-time

Internal Wholesaler

Remote Full-time

[Remote] Student Researcher [Seed LLM - Code Generation] – 2026 Start (PhD)

Remote Full-time

[Remote] Machine Learning Engineer Intern (Privacy and Data Protection Office) - 2026 Start (PhD)

Remote Full-time

**Experienced IT Support Associate (Dedicated Chat Support) – Remote W2 Opportunity**

Remote Full-time

Enterprise Account Executive – Remote B2B Meal Solutions Sales Leader for DoorDash for Business (North America)

Remote Full-time

Experienced Customer Service Agent – Remote Job Opportunities for Exceptional Client Support Specialists at arenaflex

Remote Full-time

Compassionate Entry Level Caregiver - Remote Office - Multiple Shifts Available

Remote Full-time

**Experienced Customer Experience Specialist – Social Media Support and Client Engagement at blithequark**

Remote Full-time

Consultant, Customer and Product Insights

Remote Full-time

Experienced Remote Ecommerce Customer Service Chat Support Specialist - Work from Home Opportunity with Blithequark

Remote Full-time

**Experienced High School Math Tutor – Part-time Opportunity in Chattanooga, TN**

Remote Full-time

Director, People and Culture

Remote Full-time

Experienced Data Entry Specialist for Magical Entertainment Brand – Remote Career Opportunity in Digital Data Management and Curation

Remote Full-time
← Back to Home