An automation tool for fetching, converting, and organizing Word-based CVs. Designed to eliminate manual document management
- Python 3.x and the
requestslibrary. - LibreOffice: Installed and added to your System PATH.
- Verification: Open terminal and type
soffice --version.
- Verification: Open terminal and type
You do not need to create a .env file manually. Simply execute the PowerShell script as administrator (so it can create .env file):
./init.ps1The script will:
- Check for an existing .env file.
- Prompt you for any missing fields (required fields can be viewed in
.env.examplefile). - Save your configuration for future runs.
- Automatically trigger the Python build process with environment variables included (python script does NOT handle importing .env file).
- Atomic Updates: Uses a temporary staging area to prevent corrupted or partial updates.
- URL Sanitization: Automatically converts SharePoint links to direct download streams.
- Headless Conversion: Uses native LibreOffice CLI (
soffice) for rapid PDF generation. - Auto-Bootstrapper: Includes a PowerShell launcher (
init.ps1) that manages your.envconfiguration and starts the script
- Prepare: The CV_TEMP_DIR is wiped to ensure a fresh build.
- Sanitize URLs: The URLs are stripped of clutter and forced to ?download=1.
- Download: .docx files are pulled into the staging area.
- Convert: soffice converts Word files to PDF.
- Update: If all steps succeed, the existing CV_FINAL_DIR is replaced by the staging folder.