Back to all jobs
H

Synthetic Data Engineer (AI Data/Training)

Hyphen Connect Limited

Hong Kong1mo ago

About the role

<p>We are seeking a talented and innovative Synthetic Data Engineer. In this role, you will design and implement domain-specific synthetic data generation pipelines, ensuring high-quality data management for training loops. Your expertise will drive the success of data processing and model training within the organization.</p> <p>&nbsp;</p> <p><strong>Responsibilities:</strong></p> <ul> <li>Design domain-specific synthetic data generation (SDG) pipelines via self-instruct and constitutional prompting.</li> <li>Implement automated quality scoring and de-duplication systems.</li> <li>Manage data pipelines that feed directly into SFT and DPO training loops.</li> </ul> <p><strong>Qualifications:</strong></p> <ul> <li>Proven experience building large-scale data pipelines (Airflow, Spark, Ray).</li> <li>Deep knowledge of prompt engineering for data generation.</li> <li>Familiarity with dataset distillation and bias mitigation.</li> </ul>

747,000+ hidden jobs like this

Hyphen Connect Limited and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

  • Unlimited applications — free stops at 5
  • Track every application in one place
  • Apply straight to the source, one click
  • Save & organize roles you love
  • Roles pulled from company boards before the big sites

Weekly

$9.99
$4.99/week

For an active search. Cancel anytime.

Most popular

Monthly

$24.99
$12.99/month

The smart pick. Save 35% vs weekly.

Lifetime

$99
$49.99once

Pay once. Every future feature, forever.