Back to all jobs

- Seniority
- Senior
About the role
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>About Vetto</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">Vetto is a tech company focused on building and scaling high-quality datasets for artificial intelligence systems. We work at the intersection of human expertise and AI, ensuring that models are trained on technically accurate, well-defined, and realistic data.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">Our projects support the training and evaluation of Large Language Models (LLMs), where technical rigor and correctness are non-negotiable.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>About the Project</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">This project focuses on the technical review and validation of coding tasks used to train AI models.<br>Automated code is generated in response to software engineering prompts, and your role is to evaluate whether that code is correct and truly solves what was asked.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">The core questions you will be answering on every task:</p>
<ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3">
<li class="whitespace-normal break-words pl-2">Is the coding task technically well-defined?</li>
<li class="whitespace-normal break-words pl-2">Does the generated code actually solve the problem?</li>
<li class="whitespace-normal break-words pl-2">Are the associated tests robust, correct, and aligned with real-world software behavior?</li>
</ul>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">Tests are treated as the mechanism of truth in this context. Mistakes here propagate at scale into AI systems, error criticality is high.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>Languages</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">Tasks in this project involve code written across multiple languages. You will be expected to review and evaluate tasks in any of the following: <strong>Python, JavaScript / TypeScript, Go, Rust, and Java.</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">Strong command of at least two of these languages is required. Breadth across languages is a plus.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>Responsibilities</strong></p>
<ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3">
<li class="whitespace-normal break-words pl-2">Review and analyze generated code against the original software engineering prompt</li>
<li class="whitespace-normal break-words pl-2">Evaluate whether the coding task itself is clearly and correctly defined</li>
<li class="whitespace-normal break-words pl-2">Validate whether tests accurately reflect whether the problem has been solved</li>
<li class="whitespace-normal break-words pl-2">Identify gaps, ambiguities, false positives, and false negatives in test suites</li>
<li class="whitespace-normal break-words pl-2">Determine whether a solution that passes the tests genuinely solves the underlying problem</li>
<li class="whitespace-normal break-words pl-2">Apply strict technical criteria and quality standards consistently across tasks</li>
</ul>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>Required Profile</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">This role is designed for mid/senior-level software engineers with real professional experience.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><em>Technical requirements</em></p>
<ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3">
<li class="whitespace-normal break-words pl-2">Proven professional experience in software development (production environments)</li>
<li class="whitespace-normal break-words pl-2">Strong command of at least two of the listed languages</li>
<li class="whitespace-normal break-words pl-2">Experience reviewing and evaluating code written by other engineers</li>
<li class="whitespace-normal break-words pl-2">Solid understanding of automated testing — how tests validate (or fail to validate) behavior</li>
<li class="whitespace-normal break-words pl-2">Experience contributing to or working with open source projects</li>
<li class="whitespace-normal break-words pl-2">High attention to detail and strong technical judgment</li>
<li class="whitespace-normal break-words pl-2">Comfortable working fully in English (reading and writing)</li>
</ul>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><em>Nice to have</em></p>
<ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3">
<li class="whitespace-normal break-words pl-2">Experience with test-driven development (TDD) or test design</li>
<li class="whitespace-normal break-words pl-2">Familiarity with large or complex codebases</li>
<li class="whitespace-normal break-words pl-2">Background in AI, ML, or data-centric projects</li>
</ul>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>Project Details</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">This is expert, task-based technical work focused on analysis, validation, and judgment — not code production. Each task takes approximately 30 minutes. Tasks are reviewed under continuous QA and calibration.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">Compensation is in the range of $100 per hour (task-equivalent reference), varying based on task complexity and approved volume.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>Selection Process</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">The selection process is fully asynchronous and based on your application. There are no interviews: we evaluate candidates through their background, screening responses, and a short technical exercise focused on code review and test validation.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">No traditional interviews required.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]"><strong>Final Note</strong></p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">This role is not about writing more code. It is about technical judgment, rigor, and responsibility.</p>
<p class="font-claude-response-body break-words whitespace-normal leading-[1.7]">If you are comfortable challenging problem definitions, questioning tests that "pass but are wrong", and acting as a technical quality gate, this project is for you!</p>
731,000+ hidden jobs like this
vetto and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites