Skip to content

PaneruVipin/Email-Scraper

Repository files navigation

Email Scraper (Google CSE Based)

An email extraction tool built with Node.js that leverages Google Custom Search Engine (CSE) to discover webpages and extract publicly available email addresses.

This project demonstrates integration with external APIs, search-based data discovery, and pattern-based email extraction.

🔧 Key Features

  • Uses Google Custom Search Engine (CSE) API to discover relevant webpages
  • Extracts email addresses using pattern matching (regex)
  • Handles asynchronous API calls
  • Processes search results programmatically
  • Structured for easy extension and modification

💡 Why This Project

This project showcases:

  • Third-party API integration (Google CSE)
  • Asynchronous request handling in Node.js
  • Data parsing and extraction logic
  • Building utility scripts for automated data discovery

It demonstrates practical backend skills such as:

  • API consumption
  • Response parsing
  • Data filtering
  • Workflow automation

🛠 Tech Stack

  • Node.js
  • Google Custom Search Engine (CSE) API
  • JavaScript (ES6+)
  • Axios / Fetch (for API requests)

🚀 How to Run

  1. Clone the repository

    git clone https://github.com/PaneruVipin/Email-Scraper.git
    cd Email-Scraper

  2. Install dependencies

    npm install

  3. Configure your Google CSE API Key and Search Engine ID in environment variables

  4. Run the script

    npm run dev

🧠 Learning Outcomes

  • Integrating external search APIs
  • Handling rate limits and API responses
  • Extracting structured data from search results
  • Writing modular Node.js scripts

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors