-
It can extract main information about the website including title, description, media link.
-
It can find all the bait products by surfing all the pages recursively.
-
It can extract product information including title, description, price, sku, all media link
- BASE_URL :
Base API URL - PORT :
Port number - OPENAI_API_KEY :
Custom OpenAI API key - CHROMIUM :
Path where chromium is installed. - TIME_OUT :
API response timeout
-
npm run start -
dev mode:
npm run dev
-
Docker build:
docker-compose build -
Docker run process:
docker-compose up -d
-
Use puppeteer to fetch all content from the webpage and visit all subpages.
-
Use ChatGPT APIs, extract information from the content.
-
After getting all pages, filter product page.
-
Get detailed information from the product page