Senior Software Code Reviewer

WorldwideRemote1w ago

Seniority: Senior

About the role

About Vetto Vetto is a tech company focused on building and scaling high-quality datasets for artificial intelligence systems. We work at the intersection of human expertise and AI, ensuring that models are trained on technically accurate, well-defined, and realistic data. Our projects support the training and evaluation of Large Language Models (LLMs), where technical rigor and correctness are non-negotiable. About the Project This project focuses on the technical review and validation of coding tasks used to train AI models. Automated code is generated in response to software engineering prompts, and your role is to evaluate whether that code is correct and truly solves what was asked. The core questions you will be answering on every task: <ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3"> <li class="whitespace-normal break-words pl-2">Is the coding task technically well-defined?</li> <li class="whitespace-normal break-words pl-2">Does the generated code actually solve the problem?</li> <li class="whitespace-normal break-words pl-2">Are the associated tests robust, correct, and aligned with real-world software behavior?</li> </ul> Tests are treated as the mechanism of truth in this context. Mistakes here propagate at scale into AI systems, error criticality is high. Languages Tasks in this project involve code written across multiple languages. You will be expected to review and evaluate tasks in any of the following: Python, JavaScript / TypeScript, Go, Rust, and Java. Strong command of at least two of these languages is required. Breadth across languages is a plus. Responsibilities <ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3"> <li class="whitespace-normal break-words pl-2">Review and analyze generated code against the original software engineering prompt</li> <li class="whitespace-normal break-words pl-2">Evaluate whether the coding task itself is clearly and correctly defined</li> <li class="whitespace-normal break-words pl-2">Validate whether tests accurately reflect whether the problem has been solved</li> <li class="whitespace-normal break-words pl-2">Identify gaps, ambiguities, false positives, and false negatives in test suites</li> <li class="whitespace-normal break-words pl-2">Determine whether a solution that passes the tests genuinely solves the underlying problem</li> <li class="whitespace-normal break-words pl-2">Apply strict technical criteria and quality standards consistently across tasks</li> </ul> Required Profile This role is designed for mid/senior-level software engineers with real professional experience. Technical requirements <ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3"> <li class="whitespace-normal break-words pl-2">Proven professional experience in software development (production environments)</li> <li class="whitespace-normal break-words pl-2">Strong command of at least two of the listed languages</li> <li class="whitespace-normal break-words pl-2">Experience reviewing and evaluating code written by other engineers</li> <li class="whitespace-normal break-words pl-2">Solid understanding of automated testing — how tests validate (or fail to validate) behavior</li> <li class="whitespace-normal break-words pl-2">Experience contributing to or working with open source projects</li> <li class="whitespace-normal break-words pl-2">High attention to detail and strong technical judgment</li> <li class="whitespace-normal break-words pl-2">Comfortable working fully in English (reading and writing)</li> </ul> Nice to have <ul class="[li_&]:mb-0 [li_&]:mt-1 [li_&]:gap-1 [&:not(:last-child)_ul]:pb-1 [&:not(:last-child)_ol]:pb-1 list-disc flex flex-col gap-1 pl-8 mb-3"> <li class="whitespace-normal break-words pl-2">Experience with test-driven development (TDD) or test design</li> <li class="whitespace-normal break-words pl-2">Familiarity with large or complex codebases</li> <li class="whitespace-normal break-words pl-2">Background in AI, ML, or data-centric projects</li> </ul> Project Details This is expert, task-based technical work focused on analysis, validation, and judgment — not code production. Each task takes approximately 30 minutes. Tasks are reviewed under continuous QA and calibration. Compensation is in the range of $100 per hour (task-equivalent reference), varying based on task complexity and approved volume. Selection Process The selection process is fully asynchronous and based on your application. There are no interviews: we evaluate candidates through their background, screening responses, and a short technical exercise focused on code review and test validation. No traditional interviews required. Final Note This role is not about writing more code. It is about technical judgment, rigor, and responsibility. If you are comfortable challenging problem definitions, questioning tests that "pass but are wrong", and acting as a technical quality gate, this project is for you!

731,000+ hidden jobs like this

vetto and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.

Everything Pro unlocks:

Unlimited applications — free stops at 5
Track every application in one place
Apply straight to the source, one click
Save & organize roles you love
Roles pulled from company boards before the big sites

Weekly

$9.99

$4.99/week

For an active search. Cancel anytime.

Get Weekly

Monthly

$24.99

$12.99/month

The smart pick. Save 35% vs weekly.

Get Monthly

Lifetime

$99

$49.99once

Pay once. Every future feature, forever.

Get Lifetime