Back to all jobs
L
Software Engineer Java + Data (PySpark)
lineate
Georgian office; New York1mo ago
About the role
<p></p>
<p></p>
<p><strong>About Lineate</strong></p>
<p>Lineate is a US-based international software development company with over two decades of experience.</p>
<p>From Intelligent Document Processing(IDP) and Agentic RAG systems to scalable cloud architectures, we turn complex ideas into real, measurable results.</p>
<p>We deliver AI-driven custom solutions for FinTech, HealthTech, AdTech, and beyond, empowering businesses to grow smarter, faster, and more efficiently.</p>
<p>Our expertise falls into three main categories:</p>
<ul>
<li>Building Custom AI Solutions: Deploying high-impact, AI-enabled technology utilizing IDP, Agentic RAG.</li>
<li>Cloud and Data Infrastructure: Optimizing business operations with our data management and cloud computing solutions.</li>
<li>Team Augmentation: Providing specialized experts in FinTech, AdTech, and HealthTech to integrate seamlessly and accelerate project timelines.</li>
<li>Our goal is not just to build technology, but to build the future operating model for our clients.</li>
</ul>
<p> </p>
<p><strong>Responsibilities</strong></p>
<ul>
<li>Design, develop, and maintain scalable backend services using <strong>Java</strong> and <strong>Python</strong></li>
<li>Build and optimize data processing pipelines and APIs for high-performance applications</li>
<li>Collaborate with cross-functional teams to deliver reliable and efficient solutions</li>
<li>Improve system performance, scalability, and reliability</li>
<li>Work with large datasets to support search, recommendation, or ML-driven features</li>
<li>Contribute to architecture decisions and technical design</li>
<li>Write clean, maintainable, and well-documented code</li>
</ul>
<p> </p>
<p><strong>Requirements (Must-have)</strong></p>
<ul>
<li>6+ years of commercial software development experience</li>
<li>Strong hands-on experience with <strong>both Java and Python</strong> (primarily <strong>PySpark</strong> code)</li>
<li>Experience in designing, developing, and optimizing scalable data processing pipelines and backend APIs for <strong>high-performance applications</strong></li>
<li>Solid understanding of backend development principles and system design</li>
<li>Experience working with APIs, <strong>microservices</strong>, and distributed systems</li>
</ul>
<p> </p>
<p><strong>Nice-to-have</strong></p>
<ul>
<li>Databricks OR AWS EMR OR Hadoop</li>
<li><strong>Search technologies experience</strong>, such as:</li>
</ul>
<p>Lexical search (e.g., Solr, Elasticsearch)</p>
<p>Semantic search, vector search, or RAG-based systems</p>
<p>Search relevance tuning and optimization</p>
<ul>
<li><strong>Machine Learning experience</strong>, especially in:</li>
</ul>
<p>Recommendation systems</p>
<p>User behavior prediction (e.g., click-through rate prediction, relevance estimation)</p>
<p>Practical ML application in production systems</p>
<p> </p>
<p><strong>We offer:</strong></p>
<ul>
<li>B2B contract with our US office</li>
<li>NY working hours (at least 6 hours overlap)</li>
</ul>
<p></p>
731,000+ hidden jobs like this
lineate and thousands of companies post here first — often days before LinkedIn or Indeed. Your first 5 applications are free; go Pro to apply without limits.
Everything Pro unlocks:
- Unlimited applications — free stops at 5
- Track every application in one place
- Apply straight to the source, one click
- Save & organize roles you love
- Roles pulled from company boards before the big sites