SCConnect

This is a simple Python web scraper to extract contact information (emails, phone numbers, and addresses) from local business websites.

Installation

Clone this repository:

git clone https://github.com/your-username/web-scraper.git

Navigate to the project directory:
```
cd web-scraper
```
Create Virtual Environment (if you don't have one)
```
python -m venv venv
```
Activate the virtual environment:

On Windows (Command Prompt):
```
.\venv\Scripts\activate
```
On macOS/Linux:
```
source venv/bin/activate
```

Install the required dependencies
```
pip install -r requirements.txt
```

Usage

Run the app script to get to scraper website:
```
python app.py
```
Enter the URL of the local business page when prompted.
Filter for Santa Cruz locations: The scraper will only return contact information for businesses with a mention of "Santa Cruz" on the page. It will filter out any businesses located elsewhere.

Troubleshooting

Line Endings Warning

If you see a warning like: warning: in the working copy of 'venv/Scripts/activate', LF will be replaced by CRLF the next time Git touches it This is due to Git automatically handling line endings across different operating systems. You can prevent this by configuring Git:

For Windows-only development:

git config --global core.autocrlf false

For cross-platform development: git config --global core.autocrlf true

Dependencies

The script uses the following dependencies:

requests: To handle HTTP requests and fetch webpage content.
beautifulsoup4: For parsing HTML and extracting the contact information.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
templates		templates
venv		venv
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCConnect

Installation

Usage

Troubleshooting

Line Endings Warning

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ananya-manduva/SCConnect

Folders and files

Latest commit

History

Repository files navigation

SCConnect

Installation

Usage

Troubleshooting

Line Endings Warning

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages