YouSee - Your eyes, reimagined
Select the problem statement number and title from the official list given in Participant Manual.
Problem Statement 1 - Weave AI magic with Groq
What problem does your project solve, and who does it serve?
Briefly describe the real-world use case and the value it provides.
A voice-based accessibility tool using Groq which helps visually-imparied people to get the audio description of there surroundings. This app is useful for visually challenged people, elderly people, and also to people who like to explore new things. It can become a way for visually-challenged people to explore their surrounding with the help of technologies.
ChillHackers
- Ayush Kumar (https://github.com/ayushcodes1729 / www.linkedin.com/in/ayushcodes1729 / Backend and AI Engineer)
- Payal Verma (https://github.com/PayalVerma1 / https://www.linkedin.com/in/payal-verma-509244296/ / Frontend Engineer)
- Ayush Kumar Pandey (https://github.com/Akp2636 / https://www.linkedin.com/in/ayush-kr-pandey-37870127a/ / Backend Engineer)
- Ayush Kumar Pandey (https://github.com/riddhiUI / https://www.linkedin.com/in/riddhi-yadav-901b28293/ / UI/UX Designer)
- We chose this problem because we wanted to use AI to solve some real-world problems and by making this project we could serve a lot visually-impaired individuals, and also create curiousity in people.
- Some key challenges which we faced are: Sending Image to groq API in base64 encoding and then getting image description through that, and adding voice based command system to trigger capture function which sends the image data through live webcam to groq API
- Frontend: Reactjs, TailwindCSS, React-icons
- Backend: Nodejs, Expressjs
- Database: MongoDB
- APIs: Groq
- Hosting: Vercel
- [✅] Groq: How you used Groq
- Monad: Your blockchain implementation
- Fluvio: Real-time data handling
- Base: AgentKit / OnchainKit / Smart Wallet usage
- Screenpipe: Screen-based analytics or workflows
- Stellar: Payments, identity, or token usage (Mark with ✅ if completed)
Highlight the most important features of your project:
- ✅ Live Audio description of your surrounding through your webcam.
- ✅ Audio based input without any clicks or navigation for generating description
- ✅ Easy navigation through minimalistic UI
- ✅ Fast responses with secure APIs
Add images, GIFs, or screenshots if helpful!
- Demo Video Link: https://www.loom.com/share/04bf0b6b149f413ca88e646685b2efb4?sid=d819ff0b-dded-4fc7-b217-8a3175f39d32
- Pitch Deck / PPT Link: https://drive.google.com/file/d/12tNo6wmQt37Uk3W0hMinHAZOydLSDvI2/view?usp=drive_link
- [✅] All members of the team completed the mandatory task - Followed at least 2 of our social channels and filled the form (Details in Participant Manual)
- [✅] All members of the team completed Bonus Task 1 - Sharing of Badges and filled the form (2 points) (Details in Participant Manual)
- [✅] All members of the team completed Bonus Task 2 - Signing up for Sprint.dev and filled the form (3 points) (Details in Participant Manual)
(Mark with ✅ if completed)
- Node.js
- API Keys (if any): In backend: Groq api key
- .env file setup (if needed): In backend: MONGODB_URI, PORT, JWT_SECRET & In frontend: VITE_BASE_URL
# Clone the repo
git clone https://github.com/your-team/project-name
# Install dependencies
cd project-name
npm install
# Start development server
npm run devProvide any backend/frontend split or environment setup notes here.
List improvements, extensions, or follow-up features:
- 📈 No touch navigation on the app so that a visually-impaired people can navigate with just there voices.
- APIs or datasets used: Groq API used for image to text generation
- Open source libraries or tools referenced: React-icons, TailwindCSS, Mongoose,etc
- Acknowledgements
Share your hackathon journey — challenges, learnings, fun moments, or shout-outs!
- It was a great experience working in this hackathon. A lot of challenges happend with us while sending data to Groq APIs, creating live webcam with voice controlling features which lead us to increadible learning opportunities
