ALGO LOG
MS Data Science · UMD · GPA 3.78

Ajaykumar
Balakannan

>

Building scalable ML systems on 15M+ records — from anomaly detection pipelines to real-time forecasting models. Proven track record of 35% false positive reduction, 25% efficiency gains, and data-driven decision making.

Live K-Means · click canvas to resetinitialising…
Gradient descent · loss surfacewaiting…
Logistic classifier · live trainingwaiting…
3.78
GPA @ UMD
2nd
National Hackathon
15M+
Records Processed
10+
Yrs Survey Data
Experience

Work History



May – Aug 2025
Canaria Inc.
New York, NY
Data Science Intern
  • Built v0 statistical models and v1 PyCaret pipelines (Isolation Forest, KNN, HBOS, COPOD, OCSVM) on 15M+ records, containerized with Docker. Reduced false positives by 35% and improved system reliability.
  • Developed a salary modeling system (Random Forest, Gradient Boosting), segmented by SOC × location × experience. Deployed on AWS, boosting client trust and adding 10+ new clients.
  • Designed LLM-powered GenAI workflows for unstructured text, automating summarization and reporting. Eliminated repetitive manual tasks, saving 15+ analyst hours weekly and accelerating delivery cycles.
  • Delivered granular pay insights across 40,000+ zip codes and showcased real-world GenAI applications, strengthening business adoption and enhancing client satisfaction.
PyCaretIsolation ForestDockerAWSGenAI/LLMRandom Forest
Sep 2024 – Present
Counseling Center, UMD
College Park, MD
Graduate Research Assistant
  • Led a team to build SQL ETL workflows and Tableau dashboards with Informatica integration, delivering weekly insights that improved counselor time management and scheduling efficiency by 25%.
  • Applied hypothesis testing and sentiment analysis on 10+ years of student survey data, uncovering behavioral patterns that helped administrators secure budget increases for two consecutive quarters.
  • Supported resource allocation discussions with administrators by translating insights into clear actions, cutting redundant resources by 20% and enabling more productive counselor utilization.
SQL · InformaticaTableauSentiment AnalysisHypothesis TestingPython
Skills

Technical Stack



Core Proficiency
Python · NumPy · Pandas · Sklearn92%
ML / Anomaly Detection88%
SQL · Data Engineering85%
Visualization · Tableau · Power BI80%
Cloud · Docker · CI/CD75%
Deep Learning · NLP78%
Machine Learning
Isolation ForestHBOSRandom ForestXGBoostARIMASVMYOLOv5KNNGradient Boosting
Programming & Data
PythonRSQLMATLABPandasNumPyPyCaretScikit-learn
Engineering & Cloud
DockerApache SparkAWSAzureGit/CI-CDPostgresClickHouseHadoop
Visualization
TableauPower BIQlik SenseExcel (Pivot/Macros)MatplotlibSeaborn
Projects

Research Projects



01 — COMPUTER VISION
Safeguarding Agricultural Lands from Animal Intrusion

YOLOv5 + CNN ensemble with automated data reconciliation system and hardware-integrated prototype for real-world deterrent responses in precision agriculture.

Accuracy: 90%
False Pos. ↓: 30%
Reliability ↑: 25%
YOLOv5CNNComputer VisionIoTHardware Integration
🏆 2nd Place · National Hackathon (500+ teams) · BIOGECKO Published
View on GitHub ↗
02 — TIME SERIES FORECASTING
Bitcoin Price Forecasting Pipeline

End-to-end forecasting pipeline with real-time Bitcoin data collection, ARIMA and XGBoost models, automated ETL workflows, and interactive Qlik Sense dashboards.

Accuracy: 85%
Manual Effort ↓: 40%
Real-time: Yes
ARIMAXGBoostTime SeriesQlik SensePythonSQL
View on GitHub ↗
03 — DATA ANALYTICS
Multifamily Real Estate Analytics Pipeline

End-to-end analytics pipeline for real estate portfolio management featuring automated Python ETL, SQLite database, Power BI dashboards, and comprehensive reporting system for multifamily properties.

Phases: 5-Stage
Reports: Automated
Stack: Python/SQL/BI
PythonSQLPower BIETLReal EstateAnalytics
View on GitHub ↗
Education

Academic Background



Master's in Data Science
University of Maryland
Expected May 2026 · Maryland, MD
GPA: 3.78 / 4.0
Natural Language Processing · Machine Learning · Big Data Systems · Deep Learning
B.Tech — Electrical & Electronics Engineering
Sri Krishna College of Engineering & Technology
Aug 2020 – May 2024 · Coimbatore, India
CGPA: 8.4 / 10.0
Microcontrollers · Power System Analysis · Python · Machine Learning in Energy Systems
Let's Build
Together.

Open to full-time roles, internships, and research collaborations in data science, ML engineering, and analytics.