button_icon
button_icon
button_icon
button_icon
button_icon
button_icon
button_icon
button_icon
button_icon
button_icon



With data analysis skills, I can help you to understand and use data in a way that makes sense. This includes data mining, data wrangling, big data processing, exploratory data analysis (EDA), business analytics, decision-making consulting and other ad hoc data-related tasks.

The following list represents some of my highlighted data analysis projects (excluding open-source contributions and other proprietary or small projects).


♟️superset-supremacy2024   2 days    source_logosource_logosource_logo

Dashboards

Apache Superset dashboard prototype based on strategic marketing project (channel prioritization, budget allocation, ROI forecasting)


demo-image-1

demo-image-2

demo-image-3

♟️ superset-supremacy  source_logosource_logosource_logo  Apache Superset dashboard prototype based on strategic marketing project (channel prioritization, budget allocation, ROI forecasting)

🔁dashbordina2024   3 days    source_logosource_logosource_logo

Dashboards

Marketing analytics dashboard for audience segmentation and multi-channel ROI analysis with ETL, implemented with Dash and Airflow


demo-image-1

🔁 dashbordina  source_logosource_logosource_logo  Marketing analytics dashboard for audience segmentation and multi-channel ROI analysis with ETL, implemented with Dash and Airflow

User activity analytics2024   4 days    source_logosource_logosource_logo

Streaming analytics

End‑to‑end real-time analytics pipeline using PySpark Structured Streaming. The pipeline ingests user activity logs from Kafka, applies multiple levels of transformations and aggregations, and computes both basic and session-level metrics. It includes dynamic anomaly detection using moving averages and standard deviations, and flags unusual spikes in activity.


User activity analytics  source_logosource_logosource_logo  End‑to‑end real-time analytics pipeline using PySpark Structured Streaming. The pipeline ingests user activity logs from Kafka, applies multiple levels of transformations and aggregations, and computes both basic and session-level metrics. It includes dynamic anomaly detection using moving averages and standard deviations, and flags unusual spikes in activity.

✉️mailmanMar 11, 2025   4 days    source_logosource_logosource_logo

Content generation, digital marketing

Personalized, LLM-based email marketing automation with user segmentation and A/B testing analytics


✉️ mailman  source_logosource_logosource_logo  Personalized, LLM-based email marketing automation with user segmentation and A/B testing analytics

🧙🏻‍♂️Gnomych2023   ~2 weeks    source_logosource_logosource_logo

Data processing

A tool that automates data cleaning tasks, validates raw data using rule-based constraints, provides data profiling and reporting, and offers automated correction suggestions for common data issues. Designed to be used both as a standalone tool and as an integrated component in ETL pipelines.


🧙🏻‍♂️ Gnomych  source_logosource_logosource_logo  A tool that automates data cleaning tasks, validates raw data using rule-based constraints, provides data profiling and reporting, and offers automated correction suggestions for common data issues. Designed to be used both as a standalone tool and as an integrated component in ETL pipelines.

🎯SEO-CTR-optimizer2024   1 week    source_logosource_logosource_logo

Digital marketing, web scraping

Digital marketing analytics solution that scrapes websites for SEO factors and predicts advertisement CTR


🎯 SEO-CTR-optimizer  source_logosource_logosource_logo  Digital marketing analytics solution that scrapes websites for SEO factors and predicts advertisement CTR

🌐Social metrics2025   ~2 weeks    source_logosource_logosource_logo

Dashboards

Full-stack data analytics platform using REST API integrations, sentiment/trend analysis and real-time interactive dashboards built with Python, Flask and Dash, aggregating social media data from Twitter, Facebook, Instagram, Discord, Medium, Reddit, YouTube and TikTok for marketing insights


🌐 Social metrics  source_logosource_logosource_logo  Full-stack data analytics platform using REST API integrations, sentiment/trend analysis and real-time interactive dashboards built with Python, Flask and Dash, aggregating social media data from Twitter, Facebook, Instagram, Discord, Medium, Reddit, YouTube and TikTok for marketing insights

📶Demo for DatavisionMar 9, 2025   1 week    source_logosource_logosource_logo

Dashboards

Real‑time/historical analytics dashboard suite unifying multi‑source enterprise data (social media, server, HR, financial, marketing, transactions) using data integration, streaming, visualization and ML techniques (Apache Superset, Grafana, Kafka, Prometheus, Python, React)


📶 Demo for Datavision  source_logosource_logosource_logo  Real‑time/historical analytics dashboard suite unifying multi‑source enterprise data (social media, server, HR, financial, marketing, transactions) using data integration, streaming, visualization and ML techniques (Apache Superset, Grafana, Kafka, Prometheus, Python, React)

👥CuanaMar 27, 2025   3 days    source_logosource_logosource_logo

Customer analytics

End-to-end analytics suite that integrates churn/CLV prediction, customer segmentation and lead scoring into a solution with data pipelines and an interactive dashboard


👥 Cuana  source_logosource_logosource_logo  End-to-end analytics suite that integrates churn/CLV prediction, customer segmentation and lead scoring into a solution with data pipelines and an interactive dashboard

🛒retail-upselling   Work in progress    source_logosource_logosource_logo

Association rule mining

A pipeline for market basket analysis aimed at identifying product associations to optimize retail promotions and bundle deals using SQLite, mlxtend & D3.js


🛒 retail-upselling  source_logosource_logosource_logo  A pipeline for market basket analysis aimed at identifying product associations to optimize retail promotions and bundle deals using SQLite, mlxtend & D3.js

👁️Kallisto2025 – Ongoing       source_logosource_logosource_logo

OSINT, NLP, web agents, web scraping

LLM-based OSINT tool designed to perform deep web searches by orchestrating multiple web agents and a knowledge agent that uses state-of-the-art machine learning methods. The tool crawls the entire web to gather massive amounts of publicly available data, then leverages advanced LLM techniques to perform natural language tasks on that information.


👁️ Kallisto  source_logosource_logosource_logo  LLM-based OSINT tool designed to perform deep web searches by orchestrating multiple web agents and a knowledge agent that uses state-of-the-art machine learning methods. The tool crawls the entire web to gather massive amounts of publicly available data, then leverages advanced LLM techniques to perform natural language tasks on that information.

🐋QASATIKMar 10, 2025   2 weeks    source_logosource_logosource_logo

Question answering, big data analysis

LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped web pages with knowledge graphs, analytics, charts and Streamlit interface


QASATIK is an app dedicated to helping interrogate large volumes of documents, data files and web pages. Built with Streamlit, it supports file uploads, online article scraping and querying using configurable language models (OpenAI, LangChain and LlamaIndex). In addition, it provides interactive knowledge graph visualizations, analytics and charting utilities to help explore and understand the data.

🐋 QASATIK  source_logosource_logosource_logo  LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped web pages with knowledge graphs, analytics, charts and Streamlit interface

📊Telegram chat analyzer2023   ~2 weeks    source_logosource_logosource_logo

Text analysis

Streamlit-based interface for uploading Telegram chat exports and obtaining detailed statistics/visualizations using various data analysis techniques and natural language processing. Also supports chat exports from WhatsApp.


📊 Telegram chat analyzer  source_logosource_logosource_logo  Streamlit-based interface for uploading Telegram chat exports and obtaining detailed statistics/visualizations using various data analysis techniques and natural language processing. Also supports chat exports from WhatsApp.

🔎Paysage2025 – Ongoing   1 month    source_logosource_logosource_logo

Data processing

Pandas add-on library: find data quality issues, clean/improve dataframes and generate reports — in one line, using scikit-learn transformer


🔎 Paysage  source_logosource_logosource_logo  Pandas add-on library: find data quality issues, clean/improve dataframes and generate reports — in one line, using scikit-learn transformer