Back to all jobs

- Seniority
- Junior
About the role
<div class="content-intro"><p><strong><span data-contrast="auto">About ProCogia:</span></strong><span data-ccp-props="{"134233117":false,"134233118":false,"335551550":0,"335551620":0,"335557856":16777215,"335559738":0,"335559739":0}"> </span></p>
<div class="elementToProof">We help businesses transform data into real growth!</div>
<div class="elementToProof"> </div>
<div class="elementToProof">Our clients operate in high-stakes, highly regulated industries (such as telecom, financial services, life sciences, and more), where precision, compliance, and measurable outcomes are non-negotiable. We partner with them by embedding expert data science, engineering, and AI talent directly into projects that matter.</div>
<div class="elementToProof">We’re a diverse, close-knit team with a shared goal: delivering top-class, end-to-end data solutions. We don’t just analyse data, we push the boundaries of what’s possible, helping clients unlock new value and insights.</div>
<div class="elementToProof"> </div>
<div class="elementToProof">When you join ProCogia, you’ll find a supportive, growth-driven environment where your ideas are welcomed, and your development is prioritized. We offer competitive salaries, generous benefits and perks for personal and professional development.<span class="Apple-converted-space"> </span></div>
<div class="elementToProof"> </div>
<div class="elementToProof">If you’re ready to unleash your potential and work at the cutting edge of data consulting, we’d love to meet you!</div>
<p><span data-contrast="auto">The core of our culture is maintaining a high level of cultural equality throughout the company. Our diversity and differences allow us to create innovative and effective data solutions for our clients.</span><span data-ccp-props="{"134233117":false,"134233118":false,"335551550":0,"335551620":0,"335557856":16777215,"335559738":0,"335559739":0}"> </span></p>
<p>Our Core Values: Trust, Growth, Innovation, Excellence, and Ownership</p></div><p><strong><span data-contrast="none">Job Title:</span><span data-contrast="none"> LLM Research Intern</span><span data-ccp-props="{}"> </span></strong></p>
<p><strong><span data-contrast="none">Job Summary</span></strong><span data-ccp-props="{}"> </span></p>
<p><span data-contrast="auto">Reporting directly to the AI Solutions Specialist, ProCogia is looking for a curious and driven </span><strong><span data-contrast="auto">LLM Research Intern</span></strong><span data-contrast="auto"> to join our team in Vancouver, BC. In this role, you will work alongside our data science and AI engineering teams to research, experiment with, and evaluate large language models for real-world client applications. You'll have the opportunity to dig into the latest developments in generative AI, from prompt engineering and fine-tuning to retrieval-augmented generation (RAG) and model evaluation, while contributing to meaningful projects that push the boundaries of what's possible with LLMs.</span><span data-ccp-props="{"335551550":0,"335551620":0}"> </span></p>
<p><span data-contrast="auto">This is a hands-on internship designed for someone who is passionate about NLP and AI research, eager to bridge the gap between cutting-edge academic work and practical implementation, and excited to grow in a collaborative, fast-paced consulting environment.</span><span data-ccp-props="{"335551550":0,"335551620":0}"> </span></p>
<p><span data-contrast="auto">This is an in-person role, 40 hours/week, based in our beautiful downtown Vancouver office. The internship is from June-August, with the possibility to extend until December based on performance and business needs. </span><span data-ccp-props="{"335551550":0,"335551620":0}"> </span></p>
<p><strong><span data-contrast="none">Responsibilities</span></strong><span data-ccp-props="{}"> </span></p>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="1" data-aria-level="1"><span data-contrast="none">Assess client-specific data assets and determine the appropriate adaptation strategy — continued pretraining, supervised fine-tuning, or a combination — based on the domain, data volume, and use case requirements</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="2" data-aria-level="1"><span data-contrast="none">Curate, clean, structure, and prepare domain-specific datasets from raw client data for use in model training pipelines</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="3" data-aria-level="1"><span data-contrast="none">Fine-tune large language models in the 70B–100B+ parameter range using techniques such as LoRA, QLoRA, and multi-adapter patterns</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="4" data-aria-level="1"><span data-contrast="none">Perform continued pretraining on open-weight models (Qwen, Llama, and related ecosystems) to embed domain knowledge directly into model weights</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="5" data-aria-level="1"><span data-contrast="none">Manage distributed training workflows across multi-node GPU clusters</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="6" data-aria-level="1"><span data-contrast="none">Design and execute evaluation frameworks to validate domain adaptation quality, factual grounding, and model behavior</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="7" data-aria-level="1"><span data-contrast="none">Support RAG system development where applicable, including vector database integration, chunking strategies, and reranking pipelines</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="1" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="8" data-aria-level="1"><span data-contrast="none">Contribute to inference optimization and deployment pipeline integration</span><span data-ccp-props="{}"> </span></li>
</ul>
<p><strong><span data-contrast="none">Required Qualifications</span></strong><span data-ccp-props="{}"> </span></p>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="1" data-aria-level="1"><span data-contrast="none">Currently enrolled in or recently completed a Bachelor's, Master's, or PhD program in Computer Science, Machine Learning, or a related field</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="2" data-aria-level="1"><span data-contrast="none">Demonstrated hands-on experience fine-tuning large language models, supported by concrete project work, research, or open-source contributions</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="3" data-aria-level="1"><span data-contrast="none">Experience with frontier-scale models (100B+ parameters) or distributed training across multi-node GPU clusters</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="4" data-aria-level="1"><span data-contrast="none">Familiarity with parameter-efficient fine-tuning methods (LoRA, QLoRA) and open-weight model architectures</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="5" data-aria-level="1"><span data-contrast="none">Experience with data curation and preparation workflows for LLM training, including cleaning, formatting, deduplication, and quality filtering</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="6" data-aria-level="1"><span data-contrast="none">Proficiency in Python-based ML frameworks such as PyTorch, HuggingFace Transformers, DeepSpeed, or FSDP</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="2" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="7" data-aria-level="1"><span data-contrast="none">Understanding of training compute, memory constraints, and inference trade-offs at scale</span><span data-ccp-props="{}"> </span></li>
</ul>
<p><strong><span data-contrast="none">Nice to Have</span></strong><span data-ccp-props="{}"> </span></p>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="1" data-aria-level="1"><span data-contrast="none">Familiarity with RAG architectures or production inference serving frameworks (vLLM, TGI, TensorRT-LLM)</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="2" data-aria-level="1"><span data-contrast="none">Experience in low-resource or multilingual NLP settings</span><span data-ccp-props="{}"> </span></li>
</ul>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="3" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-aria-posinset="3" data-aria-level="1"><span data-contrast="none">Relevant publications, open-source contributions, or documented projects involving LLM training.</span><span data-ccp-props="{}"> </span></li>
</ul>
<p><strong><span data-contrast="none">Compensation</span></strong><span data-ccp-props="{"335559685":0}"> </span></p>
<ul>
<li data-leveltext="" data-font="Symbol" data-listid="4" data-list-defn-props="{"335552541":1,"335559683":0,"335559684":-2,"335559685":720,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-aria-posinset="1" data-aria-level="1"><span data-contrast="none">$23/hour</span><span data-ccp-props="{}"> </span></li>
</ul>
<p><span data-contrast="auto">ProCogia is proud to be an equal-opportunity employer. We are committed to creating a </span><span data-contrast="auto">diverse and inclusive workspace. All qualified applicants will receive consideration for </span> <span data-contrast="auto">employment without regard to race, national origin, gender, gender identity, sexual </span> <span data-contrast="auto">orientation, protected veteran status, disability, age, or other legally protected status.</span><span data-ccp-props="{}"> </span></p><div class="content-conclusion"><p><span class="TextRun SCXW66097629 BCX0" lang="EN-CA" data-contrast="auto"><span class="NormalTextRun SCXW66097629 BCX0">ProCogia</span><span class="NormalTextRun SCXW66097629 BCX0"> is proud to be an equal-opportunity employer. We are committed to creating a diverse and inclusive workspace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.</span></span><span class="EOP SCXW66097629 BCX0" data-ccp-props="{}"> </span></p></div>
731,000+ hidden jobs like this
ProCogia and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites