Skip to content
View Rasesh2005's full-sized avatar

Highlights

  • Pro

Block or report Rasesh2005

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Rasesh2005/README.md

πŸ‘‹ Hi, I'm Rasesh Udayakumar Shetty!

Welcome to my GitHub profile! I'm a passionate student pursuing an Integrated Dual Degree in Mathematics and Computing at IIT (BHU), Varanasi, with a strong interest in AI/ML research, computer systems, and algorithm design.

πŸ› οΈ Projects

πŸ” AI-Generated Image Detection with Explanation (October - December 2024)

  • Built a dataset of 3.4M+ images, with reasoning generated for 1M+ fake images using Gemini-1.5-Pro.
  • Designed a novel architecture leveraging Variational AutoEncoders with 99% accuracy on similar latent-space images and 75%+ accuracy on unseen datasets (Adobe Firefly).
  • Fine-tuned Qwen2-VL with Class Activation Mapping for image reasoning.
  • Tools: PyTorch, PyTorch Lightning, Gemini API.

πŸŽ₯ GIF Question Answering MultiModal ML Training (September - October 2024)

  • Developed an architecture for GIF Visual Question Answering using qformer from BLIP-2 and Llama3.2-1b.
  • Explored Vision-Language Models (VLMs) and their fine-tuning potential on GIF data.
  • Tools: PyTorch.

πŸ”’ Differential Equation Solver (August - November 2024)

  • Trained a neural network to solve Ordinary Differential Equations under the guidance of Dr. Santwana Mukhopadhyaya.
  • Explored Graph-based ML and how neural networks learn functions.
  • Tools: PyTorch.

πŸ’» Technologies

  • Languages: C++, C, Python, JavaScript
  • Frameworks & Tools: PyTorch, TensorFlow, LangChain, Cirq, Qiskit, MERN Web Development

πŸ“° Publications

  • Understanding the World’s Museums through Vision-Language Reasoning
    Curated a large-scale dataset of 65M images and 200M question-answer pairs for benchmarking vision-language models across visual question answering tasks.

πŸ”— Connect with Me

Feel free to explore my repositories and connect!


My github stats

Github Stats Here

Pinned Loading

  1. Reddit-Meme-API Reddit-Meme-API Public

    An API for Reddit Memes made using python

    Python 18 6

  2. Checkers-Pygame Checkers-Pygame Public

    Checkers game implemented using pygame

    Python

  3. Chess-GUI-Java Chess-GUI-Java Public

    Java 1

  4. Online-Stone-Paper-Scissor Online-Stone-Paper-Scissor Public

    Python

  5. Traffic-Control-Quantum-Annealing Traffic-Control-Quantum-Annealing Public

    Jupyter Notebook

  6. Shubham-Khetan-2005/Haxplore24 Shubham-Khetan-2005/Haxplore24 Public

    HTML 3