B.Tech Data Science & Engineering
Indian Institute of Technology Palakkad

Hi, I'm Pavan Kalaganda

|

Building intelligent systems at the intersection of Machine Learning, Natural Language Processing, and Large Language Models. Passionate about turning complex data into actionable insights.

4+ ML Projects
15+ Technologies
R² 0.98 Best Model Score

Who I Am

I'm a Data Science and Engineering student at the Indian Institute of Technology, Palakkad, with a deep fascination for how machines understand language and patterns. My academic journey has equipped me with a strong foundation in statistical modeling, algorithm design, and system architecture.

My expertise spans the entire ML pipeline — from ETL and feature engineering to deploying Transformer-based models and Retrieval-Augmented Generation (RAG) systems. I've built production-ready search engines, predictive analytics dashboards, and classification models that solve real-world problems.

When I'm not training models, you'll find me competing in powerlifting at the national level — a discipline that has taught me the value of consistency, progressive overload, and strategic planning, principles I apply equally to my technical work.

ML & Deep Learning

PyTorch, Scikit-learn, Transformers, Time-Series Forecasting

NLP & LLMs

BERT, RAG, BM25, Learning-to-Rank, Search Evaluation

Data Engineering

ETL Pipelines, Elasticsearch, PostgreSQL, MongoDB

pavan.py
class PavanKalaganda:
    def __init__(self):
        self.name = "Pavan Kalaganda"
        self.role = "ML Engineer & Data Scientist"
        self.education = "IIT Palakkad"
        self.languages = ["Python", "SQL"]
        self.interests = [
            "NLP",
            "LLMs",
            "Information Retrieval",
            "Powerlifting"
        ]

    def solve(self, problem):
        return self.analyze(problem)                + self.model(problem)                + self.deploy(problem)

My Technical Stack

Tools and technologies I use to build intelligent systems

Languages

Python
SQL

ML & Deep Learning

PyTorch
Scikit-learn
Transformers
Ridge Regression
PCA
K-Means
Time-Series
Anomaly Detection

NLP & LLMs

Text Processing
BIO Tagging
Embedding Search
BM25
Learning-to-Rank
RAG
LLM Integration
NDCG / MRR

Data Engineering

ETL Pipelines
Feature Engineering
Data Cleaning
Elasticsearch
API Integration

Databases

PostgreSQL
MongoDB
Query Optimization
Indexing
Window Functions

Visualization & Tools

Plotly
Matplotlib
Seaborn
Streamlit
Power BI
Git/GitHub
Linux
Jupyter

Career Progression

My journey through academics, projects, and leadership

Aug 2023 – Present

B.Tech in Data Science and Engineering

Indian Institute of Technology, Palakkad

Pursuing undergraduate studies with focus on Machine Learning, Deep Learning, NLP, and Information Retrieval. Relevant coursework includes AI, Probability & Statistics, Big Data, Database Management Systems, and Data Structures & Algorithms.

CGPA: 6.0/10 Expected May 2027
2024 – Present

Machine Learning & NLP Projects

Independent & Academic Research

Developed end-to-end ML systems including stock market prediction pipelines (R² = 0.98), F1 analytics dashboards with Streamlit, numerical search engines over Wikipedia using Elasticsearch and BERT, and genre classification models.

PyTorch Elasticsearch RAG Streamlit
2026

Captain, Powerlifting Team

IIT Palakkad

Led the IIT Palakkad Powerlifting Team to a Gold Medal at the College General Championship 2026. Managed team training schedules, competition strategy, and athlete development.

Leadership Team Management Gold Medal
2025

Inter-IIT Weightlifting Representative

IIT Palakkad

Selected to represent IIT Palakkad at the Inter-IIT Weightlifting Competition, competing at a national-level platform against other IITs. Demonstrated exceptional physical discipline and competitive spirit.

National Level Athletics Discipline

Project Portfolio

A selection of my most impactful technical work

Nifty 50 Stock Market Prediction

Engineered time-series features and built an end-to-end ML pipeline to forecast next-day Nifty 50 stock prices. Applied PCA and K-Means clustering to segment market regimes and surface trend patterns.

R² = 0.98
Python Scikit-learn Ridge Regression PCA K-Means

F1 Data Analytics & ML Dashboard

Processed 50,000+ race records and engineered ML features for driver and team performance analysis. Developed an interactive Streamlit dashboard that reduced manual analysis effort by 60%.

R² = 0.90 50K+ Records
Python Scikit-learn PCA Streamlit

Numerical Search Engine over Wikipedia

Built a numerical information retrieval system leveraging BIO tagging and Transformer-based models. Implemented hybrid BM25 + BERT reranking pipeline via Elasticsearch for structured numerical queries.

Hybrid Search Wikidata Linked
Python Elasticsearch BERT HuggingFace Wikidata

Song Genre Prediction

Developed a genre classification model using audio features (tempo, energy, loudness) from song metadata. Applied feature engineering and preprocessing; evaluated with accuracy and macro F1-score metrics.

Audio Features F1-Score Optimized
Python Scikit-learn Logistic Regression Random Forest

Academic Background

Bachelor of Technology

Data Science and Engineering

Indian Institute of Technology, Palakkad

Aug 2023 – May 2027 CGPA: 6.0/10

Relevant Coursework

Machine Learning
Deep Learning
Natural Language Processing
Artificial Intelligence
Probability & Statistics
Information Retrieval
Data Structures & Algorithms
Big Data
Database Management Systems

Achievements & Awards

Gold Medal — College General Championship 2026

Captained IIT Palakkad Powerlifting Team to victory

Inter-IIT Weightlifting Representative

Selected to compete at the national-level Inter-IIT platform

GitHub Activity

My coding journey and contributions

pavan-kalaganda

Data Science & Engineering Student at IIT Palakkad

View Profile
Repositories
Total Stars
Python Primary Language
2023 Active Since

Contribution Activity

GitHub contribution graph for pavan-kalaganda

Featured Repositories

Loading repositories...

Download My Resume

Get a comprehensive overview of my skills, projects, and experience in PDF format.

Download CV

Contact Me

Have a project in mind or want to collaborate? Let's talk.

Contact Information

I'm currently open to internships, research collaborations, and freelance projects in Machine Learning, NLP, and Data Engineering.

Connect With Me

Send a Message