With data analysis skills, I can help you to understand and use data in a way that makes sense. This includes data mining, data wrangling, big data processing, exploratory data analysis (EDA), business analytics, decision-making consulting and other ad hoc data-related tasks.
The following list represents some of my highlighted data analysis projects (excluding open-source contributions and other proprietary or small projects).
Dashboards
Apache Superset dashboard prototype based on strategic marketing project (channel prioritization, budget allocation, ROI forecasting)



Dashboards
Marketing analytics dashboard for audience segmentation and multi-channel ROI analysis with ETL, implemented with Dash and Airflow

Streaming analytics
End‑to‑end real-time analytics pipeline using PySpark Structured Streaming. The pipeline ingests user activity logs from Kafka, applies multiple levels of transformations and aggregations, and computes both basic and session-level metrics. It includes dynamic anomaly detection using moving averages and standard deviations, and flags unusual spikes in activity.
Content generation, digital marketing
Personalized, LLM-based email marketing automation with user segmentation and A/B testing analytics
Data processing
A tool that automates data cleaning tasks, validates raw data using rule-based constraints, provides data profiling and reporting, and offers automated correction suggestions for common data issues. Designed to be used both as a standalone tool and as an integrated component in ETL pipelines.
Digital marketing, web scraping
Digital marketing analytics solution that scrapes websites for SEO factors and predicts advertisement CTR
Dashboards
Full-stack data analytics platform using REST API integrations, sentiment/trend analysis and real-time interactive dashboards built with Python, Flask and Dash, aggregating social media data from Twitter, Facebook, Instagram, Discord, Medium, Reddit, YouTube and TikTok for marketing insights
Dashboards
Real‑time/historical analytics dashboard suite unifying multi‑source enterprise data (social media, server, HR, financial, marketing, transactions) using data integration, streaming, visualization and ML techniques (Apache Superset, Grafana, Kafka, Prometheus, Python, React)
Customer analytics
End-to-end analytics suite that integrates churn/CLV prediction, customer segmentation and lead scoring into a solution with data pipelines and an interactive dashboard
Association rule mining
A pipeline for market basket analysis aimed at identifying product associations to optimize retail promotions and bundle deals using SQLite, mlxtend & D3.js
OSINT, NLP, web agents, web scraping
LLM-based OSINT tool designed to perform deep web searches by orchestrating multiple web agents and a knowledge agent that uses state-of-the-art machine learning methods. The tool crawls the entire web to gather massive amounts of publicly available data, then leverages advanced LLM techniques to perform natural language tasks on that information.
Question answering, big data analysis
LLM-based Q&A on preloaded docs, raw data, Wikipedia articles and scraped web pages with knowledge graphs, analytics, charts and Streamlit interface
QASATIK is an app dedicated to helping interrogate large volumes of documents, data files and web pages. Built with Streamlit, it supports file uploads, online article scraping and querying using configurable language models (OpenAI, LangChain and LlamaIndex). In addition, it provides interactive knowledge graph visualizations, analytics and charting utilities to help explore and understand the data.
Text analysis
Streamlit-based interface for uploading Telegram chat exports and obtaining detailed statistics/visualizations using various data analysis techniques and natural language processing. Also supports chat exports from WhatsApp.