ads.txt is an IAB-approved text file that aims to prevent unauthorized inventory sales. Publishers drop a text file on their web servers that lists all of the companies that are authorized to sell the publisher's inventory. Similarly, programmatic platforms also integrate ads.txt files to confirm which publishers’ inventory they are authorized to sell. This allows buyers to check the validity of the inventory they purchase. Tell me more about this
sudo apt-get install gitgit clone https://github.com/kaustubhd93/adstxt-crawler.gitsudo apt-get install python-pip python-dev build-essentialsudo pip install virtualenvvirtualenv /path/to/your/virtualenvsource /path/to/your/virtualenv/bin/activatecd adstxt-crawlerpip install -r requirements.txt
sudo yum update(if os is newly installed.)sudo yum install epel-releasesudo yum install gitgit clone https://github.com/kaustubhd93/adstxt-crawler.gitsudo yum install gcc* python-devel python-pipsudo pip install -U pipsudo pip install virtualenvvirtualenv /path/to/your/virtualenv- Run this if virtualenv threw an import error
sudo pip install -U zipp configparser. Ignore this if virtualenv succeeded.
- Run this if virtualenv threw an import error
source /path/to/your/virtualenv/bin/activatecd adstxt-crawlerpip install -r requirements.txt
NOTE: List of domains should be written separately each on a new line.
domain1.xyz
domain2.xyz
www.domain3.xyz
./crawl.sh /path/to/listofdomainfile
If you are not running docker and want to use it. Check this installation guide for docker : https://docs.docker.com/get-docker/
docker pull kaustubhdesai/adstxtcrawler:0.1docker volume create adstxtcrawlerdocker run -id --name adstxtcrawler --mount source=adstxtcrawler,target=/app/adstxt/csv adstxtcrawler:0.1docker cp /path/to/file/with/domains/filename adstxtcrawler:/app/filenamedocker exec adstxtcrawler bash -c "./crawl.sh /app/filename"- Once above command has exited properly, run this command to check downloaded csv files.
cd /var/lib/docker/volumes/adstxtcrawler/_data