Skip to content

Document Upload: Streamline to provider upload #734

@nishika26

Description

@nishika26

Describe the current behavior
Currently, document upload is a two-step process which takes a long time:

  • POST /documents - Uploads file to S3 and saves to database only
  • POST /collections - Internally downloads files from S3, uploads to provider (OpenAI/Bedrock), creates vector store

Describe the enhancement you'd like
Add optional upload_to_provider parameter to document upload endpoin.

New flow would be:

  • Upload to S3 (current)
  • Save to DB (current)
  • NEW: Upload to provider (OpenAI/Bedrock/Gemini) if specified
  • Store provider_file_id in document
  • Collection creation uses existing provider files → much faster

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

Projects

Status

In Progress

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions