Last updated: January 1, 2025
I'm a Data Scientist at JMAN Group, a Forbes Top Management Consulting firm (2024). My work spans AI, data science, and mathematical modeling.
- π Passionate about exploring the intersection of mathematics, statistics, and programming to uncover truths and create impactful solutions.
- π A lifelong learner, delving into topics ranging from backend development to deep learning and advanced AI concepts.
- π― Dedicated to using technology to address real-world challenges and contribute to societal betterment.
- US and EMEA-based Cybersecurity Company: Designed and implemented a LightGBM model for seat expansion propensity, achieving 40% precision and 50% recall, leading to a 15% revenue boost ($7M). Model insights enabled sales reps to prioritize leads effectively.
- Europe-based Parking Company: Conducted topic modeling using MiniLM, UMAP, and HDBSCAN on GDPR-compliant transcript data, increasing actionable insights by 20%.
- AutoML Tool: Developed an internal AutoML tool integrating feature selection, Bayesian optimization, and SHAP explainability, reducing project kickoff time by 2 weeks for predictive modeling tasks.
- Automated web scraping for a telecom firm using AI-based reCAPTCHA solving, reducing data extraction time by 5x.
- Developed a budget optimization tool leveraging SLSQP, Django, and React, streamlining category allocations by 30%.
- Built an AI-powered invoice processing solution using YOLOv7, achieving 80% accuracy for item detection.
- Designed and implemented a Django backend, integrated with a user-friendly web interface hosted on DigitalOcean.
- Hadoop and Hive: Using Hadoop and Hive for my data storage and pre processing requirements for all my future projects
- Stock Market Kaggle Competitions: Exploring kaggle competitions on stock market data to solidify my understanding on application of ML systems in financial markets
- HandcraftedML: A collection of ML topics and code, sometimes from scratch or through implementation.
- DL with TensorFlow: Extensive guide on TensorFlow basics, neural networks, and MNIST classification.
- CSV2Notion-Neo: CLI tool for advanced CSV upload to Notion with enhanced speed and automation.
- Sandbox: A collection of non-ML common software engineering topics that excite me.
- View more of my projects here.
I write to deepen my understanding and share insights on data technology and programming. Some of my popular posts:
- Understanding Cosine Similarity β 5.5k+ Reads 10k Views
- Threading vs Multiprocessing β 5.5k+ Reads 9k Views
- Markov Chains β 545 Reads
- Look at my medium profile to view my other blogs