bhavikshangari BhavikShangari

💻 Bhavik Shangari - ML & AI Enthusiast

🌟 Welcome to my GitHub profile!
I'm Bhavik Shangari, a passionate Machine Learning and Artificial Intelligence enthusiast with a deep interest in robotics and solving real-world problems using advanced computational techniques. My expertise lies in developing and fine-tuning AI models, with a recent focus on Large Language Models (LLMs) and Generative AI.

📖 About Me

🎓 BTech in Data Science & Artificial Intelligence
*Indian Institute of Technology (IIT) Bhilai
💡 Passion: Building solutions that bridge the gap between cutting-edge AI and practical applications.
🔬 Current Focus: Training and deploying multimodal models, with special attention to low-powered devices and robotics.

🏆 Key Projects

🚀 Jetson-VLM 2

Trained a multimodal model combining SigLip and DINO V2 vision encoders with LLama 3.2 (1B).
Focused on deploying Vision Language Models on low-powered devices like Jetson Nano.
Implemented image alignment using Llava 595K v1.5 Data Mixture.

🤖 Gesture-Controlled Robotic Arm

Designed and built a robotic arm using computer vision, microcontrollers, and linear algebra.
Mimicked human hand movements for imitation learning and assistance for physically-aided individuals.

📈 Stock Market Forecasting

Built a novel Date&Time2Vec embedding method for capturing temporal relations in stock market data.
Implemented a Transformer architecture from scratch in PyTorch.
Processed 15 years of stock data with 5M+ data points, combining sentiment analysis and transformer architectures.

🏥 CloudPhysician

Developed a pipeline for extracting vital signs (e.g., SPO2, ECG) from patient monitor images using YOLO, OCR, and image processing.

🔧 Skills & Expertise

Languages

Python, C, C++

Frameworks & Libraries

PyTorch, TensorFlow, Hugging Face, Scikit-learn, Keras, OpenCV, NLTK

Domains

Deep Learning: CNNs, RNNs, Transformers, GANs, YOLO
Generative AI: Fine-tuning LLMs, multimodal integration
Computer Vision: Image segmentation, restoration, enhancement
Natural Language Processing: Multimodal alignment, retrieval

🎯 What I’m Working On

Training Jetson-VLM for improved performance on vision-language tasks.
Implementing multimodal robotic action control with OpenVLA approaches.

🌐 Connect with Me

🐙 GitHub: Bhavik Shangari
💼 LinkedIn: Bhavik Shangari
📫 Email: bhaviks@iitbhilai.ac.in | bhavikhangari@gmail.com

💡 Let's collaborate on exciting AI and robotics projects! Feel free to explore my repositories.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly