Senior Python Systems Developer - Functional Testing Project

Mindrift

Job Overview

Location

Remote

Salary

USD 30 hourly

Employment Type

Contract

Work Arrangement

Remote

Sector

Information Technology & Software

Experience Level

Senior (5-8 years)

About the Company

Mindrift is at the forefront of connecting specialized talent with cutting-edge, project-based AI opportunities. We partner with leading technology companies to enhance, evaluate, and improve their AI systems. Our focus is on providing flexible, project-based collaborations that empower AI specialists to contribute their expertise to innovative projects. Mindrift is powered by Toloka AI, ensuring a robust platform for freelance engagements. We offer a supportive global community and opportunities to work on groundbreaking AI initiatives, allowing professionals to choose their working hours and contribute meaningfully to the advancement of artificial intelligence.

Job Description

Mindrift is seeking a highly skilled Senior Python Systems Developer to join a critical functional testing project focused on AI systems. This role offers a unique opportunity to contribute to the advancement of artificial intelligence by ensuring the quality and reliability of AI models.

As a Senior Python Systems Developer, you will be instrumental in creating and executing comprehensive functional tests for large codebases. You will leverage your expertise in Python, particularly with pytest, to design robust testing strategies. Your responsibilities will include managing Docker environments to guarantee reproducible builds and test executions across various platforms, ensuring the integrity of the testing process.

This project requires a strong command of Linux and Bash scripting, along with the ability to debug effectively within containerized environments. You will also utilize modern Python tooling and LLMs, such as Claude Code and Roo Code, to accelerate development cycles and enhance code quality. The role is project-based and fully remote, offering flexibility in working hours (20-30 hours per week) and a competitive compensation equivalent of up to $30 per hour.

To apply for this role, click the Apply button on this page and follow the instructions.

Required Skills

PythonpytestDockerLinuxBash ScriptingFunctional TestingCode ReviewLLMGit

Key Responsibilities

  • Create functional black box tests for large codebases in various source languages
  • Create and manage Docker environments to ensure 100% reproducible builds and test execution across different platforms
  • Monitor code coverage and configure automated scoring criteria to meet industry benchmark-level standards
  • Leverage LLMs (Roo Code, Claude) to accelerate development cycles, automate repetitive tasks, and improve overall code quality

Qualifications

  • 5+ years of experience as a Software Engineer (primarily Python)
  • Deep experience with pytest (fixtures, session-scoped, timeouts) and designing black-box functional tests for CLI tools
  • Expert-level Docker skills (reproducible Dockerfiles, user contexts, secure workspaces)
  • Strong Linux & Bash scripting skills and comfort debugging inside containers
  • Proficiency with modern Python tooling (uv, pyproject.toml, packaging)
  • Ability to read and understand with LLM many coding languages (for example C, C++, Rust, or Go)
  • Experience using LLMs (Claude Code, Roo Code, Cursor) to accelerate iterative development and test-case generation
  • English language - B2 or higher
  • Prior experience with agent evaluation platforms and MCP CLI

Benefits & Perks

  • Freelance project-based collaboration via the Mindrift platform (powered by Toloka AI)
  • Fully remote and flexible participation — choose when and how much to contribute (20-30 hours per week)
  • Each project has its own compensation level based on scope and expertise required. On this project, AI trainers earn up to $30 per hour equivalent.
  • Opportunity to contribute to innovative AI projects for leading tech companies
  • Supportive global community

How to Apply

This job has expired

The AI industry is experiencing rapid expansion, creating a high demand for specialized talent. This role focuses on enhancing AI system reliability through rigorous functional testing. You will leverage advanced Python scripting, Docker containerization, and Linux environments to build robust testing frameworks. Proficiency in pytest, understanding of CI/CD pipelines, and experience with LLM-assisted code analysis are crucial for success. Your work will directly impact the quality and performance of AI models, contributing to significant improvements in business ROI and product scalability.

Posted Date

April 13, 2026