Hi Unstructured team,
We’re running into HTTP 429 “Too Many Requests” errors when using the unstructured.partition.pdf function in a containerized service deployed on Google Cloud Run.
It happens when the model tries to download the yolo_x_layout model during text extraction. Is there a way we can pass a token and download the model? Setting the HUGGINGFACE_HUB_TOKEN environment variable doesn’t seem to be working, and I’m not sure if we can download the model in docker image and make it work.
Unstructured extraction failed: 429 Client Error: Too Many Requests for url: https://huggingface.co/api/models/unstructuredio/yolo_x_layout/xet-read-token/xxxxxxxxxxxxxx (Request ID: Root=1-6900e75a-58dd51313482b30b223cace0;d35575bd-d4a3-42f1-acce-5139123442be)