Discover Technata Job board

Find your next tech job in Kanata North, Canada’s largest technology park. Then explore endless international opportunities and dream about where your career will take you. With the Country’s largest density of technology companies ranging from promising startups to leading global giants, Kanata North is the place to be if you are serious about a career in tech.

Search

My job alerts

西门子中国研究院大模型强化学习研究员（上海、北京、苏州）

Siemens

Beijing, China

Posted on Jul 31, 2025

Apply now

Job Description

Job ID

472846

Company

Siemens Ltd., China

Organization

Foundational Technologies

Job Family

Research & Development

Experience Level

Early Professional

Full Time / Part Time

Full-time

Contract Type

Fixed Term

We empower our people to stay resilient and relevant in a constantly changing world. We're looking for people who are always searching for creative ways to grow and learn. People who want to make a real impact, now and in the future. Does that sound like you? Then it seems like you'd make a great addition to our vibrant international team. DAI AIX – AI Acceleration and Exploration, is working on the cutting-edge research of Data Analytics and AI with Siemens global technology network, and consulting, co-creation, data driven applications for the end customers. Research Scientist is to do applied research for Industrial AI applications in the team. We are seeking a Reinforcement Learning (RL) Specialist to lead the design, implementation, and optimization of RL-driven systems for post-training of foundation models. The primary focus of this role is advancing our RL capabilities for real-world applications such as industrial control systems and LLM agents. You will develop cutting-edge algorithms, improve post-training efficiency, and deploy scalable RL solutions in industry.

You'll make an impact by

1. Reinforcement learning development for post-training:
- Design and implement state-of-the-art RL algorithms (e.g., PPO, SAC, DQN) for post-training of foundation models like LLMs and time series foundation models.
- Implement distributed RL training pipelines using frameworks like Ray RLlib, Deepspeed, or custom solutions.
- Design and implement benchmark pipelines for model evaluation.
2. Align foundation models like LLMs and time series foundation models with specific areas/tasks through techniques like SFT, RL.
3. Coding & Infrastructure:
- Write production-grade Python code using PyTorch, numpy, and pandas.
- Manage Linux-based clusters for distributed training and deployment.
4. All other support required by the line manager if necessary.

Your defining qualities

Master's or Doctor degree or above in Computer Science, Automation, Mathematics or related.
Self-motivated, good communication skills and good team player.
Ability to handle multiple competing priorities in a fast-paced environment.

The skills you are expected to have:

1~3 years of hands-on RL experience (academic or industry).
Expertise in deep RL algorithms (model-based/model-free) and frameworks (e.g., RLlib, Gymnasium).
Strong Python skills with PyTorch/TensorFlow and proficiency in Linux.
Experience with distributed training (Horovod, DeepSpeed) and cloud platforms (AWS/Azure/Alicloud).
Familiarity with LLM agents or LLM post-training.
Prefer: Background in robotics, control systems, or game AI.
Prefer: Contributions to RL open-source projects or publications at top conferences (NeurIPS, ICML, ICLR, KDD, IROS, etc).

You'll benefit from

Diverse and inclusive culture, doing the work you like with people who appreciate it
Systematic career development platform, various training courses, and online learning resources for you to help you tailor your growth path based on your strengths
15 days+ annual leaves, with additional benefits such as Christmas leave
Generous benefits package, long-term care corporate annuity plan, flexible allocation of commercial insurance, employee stock sharing matching plan for mutual growth, etc

Transform the everyday with us!

At Siemens, we are human enthusiasts with a diverse set of backgrounds, skills, interests, and needs, united in a unique mission to create a better society. We believe in a culture of diversity and inclusion, reflecting a society with various backgrounds, nationalities, expertise, and mindsets. Here, you'll find trust and freedom to excel. Here, you'll find peers, mentors, and savvy people, for co-creating and growing. If you have curiosity, breakthroughs, and creativity, looking for an equal opportunity to grow and unleash your full potential, join us, bring your authentic self, and transform the everyday with us. Explore more here.