Back to all jobs
H
Synthetic Data Engineer (AI Data/Training)
Hyphen Connect Limited
Singapore1mo ago
About the role
<p>We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing and model training within the organization.</p>
<p> </p>
<p><strong>Responsibilities:</strong></p>
<ul>
<li>Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting.</li>
<li>Implement automated quality scoring and de-duplication systems.</li>
<li>Manage data pipelines that feed directly into SFT and DPO training loops.</li>
</ul>
<p><strong>Qualifications:</strong></p>
<ul>
<li>Proven experience building large-scale data pipelines (Airflow, Spark, Ray).</li>
<li>Deep knowledge of prompt engineering for data generation.</li>
<li>Familiarity with dataset distillation and bias mitigation.</li>
</ul>
753,000+ hidden jobs like this
Hyphen Connect Limited and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites