Skip to content

dunctk/thin-content-checker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Thin Content Checker

A command-line tool for SEO analysis that crawls websites to detect thin content pages and broken links.

Features

  • Crawls websites and checks word count on pages
  • Identifies pages with content below a minimum word threshold (default: 300 words)
  • Detects broken links (404 errors)
  • Option to exclude specific paths from checking

Installation

Prerequisites

  • Rust (latest stable version recommended)

Install from Source

  1. Clone the repository:

    git clone https://github.com/dunctk/thin-content-checker.git
    cd thin-content-checker
  2. Install globally:

    cargo install --path .

Install from GitHub

cargo install --git https://github.com/dunctk/thin-content-checker.git

Usage

Basic Usage

Check a website for thin content:

thin-content-checker https://example.com

Options

  • -m, --min-words <MIN_WORDS>: Set minimum word count (default: 300)
  • --exclude <EXCLUDE>: Exclude paths starting with the given string (can be used multiple times)

Examples

Check with custom word threshold:

thin-content-checker https://example.com --min-words 500

Exclude specific paths:

thin-content-checker https://example.com --exclude /accounts/ --exclude /admin/

Output

The tool outputs:

  • List of thin content pages with word counts
  • List of broken links

License

MIT

About

SEO CLI tool to check a website for thin content and broken links

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages