Download
(compact version)

I'm a Middle ML engineer with experience in data science (analytics) and software engineering. I specialize in designing/implementing/refining/deploying ML systems, data-related problems (collection, processing, transforming, modeling, visualization, analysis, interpreting), and developing various automation software.

This page provides a more detailed description of my professional skills, as opposed to the minimal version of the CV provided in the link above. For my freelance services and projects, check out the Freelance page.


👨🏻‍💻 WORK EXPERIENCE

Machine Learning Engineer (LLMs & multimodal learning) @ NPmatic GmbH 🤖
calendar_logoJul 2025 — Nov 2025      location_logoBerlin, Germany (remote)

Designed and delivered production-ready components that combined modern LLM engineering with multimodal learning for a Russian-speaking fintech & enterprise analytics startup team based in the EU; moving from Data Scientist responsibilities toward ML Engineering (model adaptation, inference optimization, MLOps). NPmatic focuses on B2B, building LLM systems for reasoning, retrieval, large-scale data analysis and business process automation.

  • Designed and implemented hybrid LLM + multimodal architectures (text + vision/audio encoders as perception frontends) to enable structured reasoning and grounded generation in product features, mostly for analyzing complicated reports & live presentations in order to provide automated question answering. Integrated CLIP into production pipelines.
  • Built E2E RAG pipelines for corporate analytics: data ingestion workflows, chunking, embeddings generation, tuning & indexing into a vector store, vector search (OpenSearch), and reranking with cross-encoders & cache layers for low-latency contextual vector retrieval with fast access. Implemented hybrid retrieval strategies (vector + keyword) using OpenSearch and FAISS to balance recall and precision. Passed the extracted relevant passages to an LLM module in order to integrate the pipeline with LLM inference for factual grounding and tool-calling, enabling faster document summarization.
  • Fine-tuned some LLMs & multimodal backbones to project-specific needs using PEFT approaches (LoRA/QLoRA), synthetic instruction and data augmentation, adapting them to financial domain while keeping training cost practical, which basically reduced hallucinations. Applied knowledge distillation to obtain smaller models for cost-efficient inference and reduced dependency on closed APIs.
  • Implemented latency- and cost-reduction techniques for model inference optimization in production: request batching & shaping, operator fusion, dynamic truncation of long inputs and weight quantization improving throughput and Time-to-First-Token, which reduced memory footprint, per-request cost and accelerated response generation with minimal quality degradation, so we met strict enterprise latency SLAs. Applied INT8/INT4 quantization and optimized GPU utilization for cloud-based inference.
  • Implemented model serving and MLOps practices for rapid response to data shifts and safe model updates without downtime: deployed containerized services and established CI/CD pipelines for automated delivery of model and data updates; implemented model versioning and quality monitoring (drift detectors, regression tests, metrics using LLM-as-judge, offline benchmarks) for stable production behavior and fast iteration. Monitored token usage, latency, failure modes and quality drift in production.
  • Deployed models and services as scalable microservices on cloud infrastructure (Docker + Kubernetes, AWS), defined API contracts for frontend/backend integration and participated in PoC > MVP cycles: building APIs, demoing prototypes to clients and integrating feedback iteratively (so we accelerated feature delivery).
  • Developed reusable components that combined LLM reasoning with vision/audio modules. These components were reused across multiple product features and halved the development time for new scenarios. In particular, applying this solution shortened prototyping cycles, improved contextual accuracy in target features and sped up generation of analytic reports from corporate data (tasks that previously took hours or days now completed in minutes). Integrated agentic workflows (tool-calling, function execution, planners) using LlamaIndex–style abstractions with custom orchestration logic for multi-step reasoning tasks.

Data Scientist & Python Developer @ Freelance 🥥
calendar_logoJun 2024 — Nov 2025      location_logoVarious places

Long-term full-time freelancing & seeking for a Middle ML Engineer position.

  • Designed/implemented/refined/deployed ML systems.
  • Handled data-related problems: collection, processing, transforming, modeling, visualization, analysis, etc.
  • Developed various automation software.

Middle Data Analyst @ Remokate ✈️
calendar_logoJul 2023 — May 2024      location_logoTbilisi, Georgia (remote)

Worked on solutions that impacted product and business outcomes for clients in fintech, retail, SaaS and large-scale product companies. Remokate is a recruitment agency that connects top IT talent with leading companies in Russia and abroad, including startups, large corporations and international relocation opportunities.

  • Designed, monitored and interpreted key business and product metrics to identify growth opportunities and bottlenecks across customer funnels.
  • Wrote and optimized complex SQL queries for large datasets, improving data processing speed and accuracy.
  • Built and maintained interactive dashboards in BI tools (DataLens, Tableau), enabling stakeholders to track KPIs, detect anomalies and make data-informed decisions.
  • Conducted hypothesis testing and A/B experiments to validate product changes and measure their impact on core metrics.
  • Performed cohort, funnel and factor analysis to understand user behavior, retention and conversion trends.

Python Developer & Data Analyst 🌱
calendar_logoDec 2021 — Jul 2023      location_logoRussia (remote)

Various occasional short-term remote jobs and freelance gigs to get some experience (mostly freelance.habr.com).

  • Built Python software: automation scripts, web scrapers and Telegram bots.
  • Gathered data and performed data processing.
  • Managed PostgreSQL databases for collected data.
  • Implemented simple data pipelines.
  • Designed and developed KPI dashboards and other analytical tools.

Web Developer 🐣
calendar_logoNov 2019 — Dec 2021      location_logoRussia (remote)

Freelance gigs on freelance.habr.com outside of university classes.

  • Designed landing pages, blogs and promotional web pages.
  • Implemented frontend using React.


🛠️ TECHNICAL SKILLS

My key skills:

🏗️ Software engineering: building, debugging, testing, deploying
🤹‍♂️ Data wrangling: collecting, processing, transforming
🔍︎ Data analysis: statistical inference & visualization
🤖 Traditional ML algorithms for supervised & unsupervised learning
🧠 Deep leaning: designing, implementing, fine-tuning models
☁️ Cloud infrastructure & big data processing
🛢 Databases

Technologies and tools I use (clickable):

python-logosql-logocpp-logojavascript-logotypescript-logographql-logotensorflow-logopytorch-logokeras-logoopencv-logosklearn-logoapache-spark-logopandas-logonumpy-logomatplotlib-logoplotly-logoseaborn-logoscipy-logoreact-logodjango-logoxgboost-logopostgresql-logosqlite-logobeautifulsoup4-logoscrapy-logostreamlit-logoflask-logogatsby-logojekyll-logonodejs-logoexpressjs-logohtml-logocss-logodocker-logoamazon-web-services-logonpm-logowebpack-logoyarn-logopoetry-logogoogle-colab-logogit-logoopenssl-logoarch-linux-logobash-logovim-logoalacritty-logovisualstudio-logopycharm-logogooglesheets-logolatex-logo

🎓 EDUCATION

Academic

🎓 B.S. in Applied Mathematics & Computer Science @ Novosibirsk State Technical University, Novosibirsk; major: "Computer Modeling and Information Technologies" (2019-2023), minor: "Statistics and Data Analysis" (2022-2023)
📜 Thesis topic: "Research of state-of-the-art neural network architectures for scene text recognition problems" (paper); research supervisor: Popov A.A.

Courses




💡 PROJECTS

My projects are organized into category pages:

I've also contributed to many other open-source and closed-source projects. In addition, I've tackled several competitive Kaggle projects and have the Kaggle Expert rank.


📝 PUBLICATIONS

Blog articles

The most compelling blog posts will be featured here soon.

Courses

I'm currently actively building & maintaining a free DS/ML course along with several paid mini-courses.

Academic research

MobileEAST: a lightweight & fast scene text detection based on EAST architecture and MobileNet layers (repository, 2023)


🤝 SOFT SKILLS

My main strengths lie in analytical thinking (i.e. I love to do research and analyze everything) and strategizing. As a traveler, I'm adaptable and flexible. I used to work with a flexible schedule, so I can self-organize myself easily. Moreover, I'm curious, goal-oriented and thoughtful about project details: I learn new things easily and know how to search for information. I'm also good at explaining complex things in simple words.


😎 PERSONAL

💪 I'm actively contributing to open-source community
🔒 I'm a cryptography & privacy enthusiast
🐧 I'm a long-time Linux user
🗄️ I'm slowly moving towards complete self-hosting of all my data and apps
⚖️ I'm trying hard to build a career in both full-time and freelance work in order to achieve a balance between flexibility and professional growth
💬 I'm fluent in English (B2+) and Russian (native)


🔗 LINKS

Related profiles

▶️ YouTube
📚 Telegram (tech blog)
🏆 Kaggle
🧩 LeetCode
⚙️ GitHub
🎓 Coursera
🌟 Upwork
Fiverr
Kwork
Habr Freelance

Contact me

📲 Telegram (fastest way): @averett
📩 Email: avrtt@tuta.io
💼 LinkedIn: @avrtt

UPDATED ON FEB 7, 2025