Image Captioning Using Transformer
After numerous attempts with RNNs, GRUs, and LSTMs to generate captions for images...
Read on Medium →
A Computer Engineer Specializing in AI & ML Solutions.
Tribhuvan University, Institute of Engineering (TU, IOE)
2021 - 2025
Focused on software development, artificial intelligence, and machine learning. Relevant Coursework: Data Mining, Microprocessor, Object Oriented Programming, Artificial Intelligence and Theory of Computation
Inspiring Lab
Aug 2025 - Present
Skillrank
Feb 2025 - July 2025
Scala Technologies
Oct 2024 - Feb 2025
Designed a real-time anomaly (Fighting, Accidents, Robbery, Vandalism, Explosions,etc 13 Anomaly) detection system using fine-tuned Vision Transformers. Integrated WebRTC for live streaming, WebSockets for real-time communication, and Telegram API for alerts.
Developed an image captioning system using Python, Django, and deep learning, integrating Transformer architectures like BERT, GPT, and T5 alongside VGG16 and LSTM. Enhanced feature extraction, caption generation, and NLP refinement for seamless user interaction and accurate, coherent captions.
Developed an Online Course Recommendation System using Python, Django, and NLP techniques like Tf-Idf and cosine similarity, with curated data for accurate, tailored recommendations with my minor project team.
Streamlit-based app leveraging Weaviate for vectorized Q&A over the uploaded documents. PDF Question Answering is the system where user can upload the pdf and ask anything about the pdfs
Developed an automated cold email generator using LangChain, Llama 3.1, and ChromaDB for precise job matching and tailored applications
Developed a cotton plant disease classification system using CNN and Django, enabling precise image analysis for early detection and improved crop management.
Built Studybuddy, a website using HTML, CSS, JavaScript, and Django, enabling users to create/join rooms, connect, chat, and share knowledge in their fields of interest.
Medium
After numerous attempts with RNNs, GRUs, and LSTMs to generate captions for images...
Read on Medium →
Medium
Machine Learning is said as a subset of artificial intelligence that is mainly concerned with the developmen...
Read on Medium →
Medium
SQL A language used for relational Databases Query data...
Read on Medium →You can find me on these platforms: