-
Couldn't load subscription status.
- Fork 2.1k
chore(deps): bump pypdf from 6.0.0 to 6.1.3 in /backend/requirements #5866
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Bumps [pypdf](https://github.com/py-pdf/pypdf) from 6.0.0 to 6.1.3. - [Release notes](https://github.com/py-pdf/pypdf/releases) - [Changelog](https://github.com/py-pdf/pypdf/blob/main/CHANGELOG.md) - [Commits](py-pdf/pypdf@6.0.0...6.1.3) --- updated-dependencies: - dependency-name: pypdf dependency-version: 6.1.3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Greptile Overview
Summary
Updated pypdf dependency from 6.0.0 to 6.1.3, bringing important security fixes and bug improvements.
Key changes in this update:
- Security fixes: Addresses two critical security issues - limiting size of LZWDecode streams and preventing infinite loops when reading broken DCT-based inline images
- Bug fixes: Corrects PageObject.scale() media box scaling issue and improves handling of zero-length StreamObject
- Robustness improvements: Better handling of malformed PDFs, incremental PDF files, and cross-reference tables
The pypdf library is used indirectly through markitdown[pdf] (line 46 in default.txt) for PDF file processing in /backend/onyx/file_processing/extract_file_text.py. This update is a routine maintenance bump with backward-compatible improvements.
Confidence Score: 5/5
- This PR is safe to merge with minimal risk - it's a standard dependency update with security improvements
- The update is from a reputable library (pypdf) maintained by the py-pdf organization, brings important security fixes for potential DoS vulnerabilities, and maintains backward compatibility. The version jump from 6.0.0 to 6.1.3 is a minor/patch update that only includes bug fixes, security improvements, and robustness enhancements without breaking API changes. Since pypdf is used indirectly through markitdown for PDF processing, and the changes are focused on edge case handling and security, there's minimal risk of regression
- No files require special attention - this is a straightforward dependency version bump
Important Files Changed
File Analysis
| Filename | Score | Overview |
|---|---|---|
| backend/requirements/default.txt | 5/5 | Updated pypdf from 6.0.0 to 6.1.3, includes security fixes for LZWDecode streams and DCT-based inline images |
Sequence Diagram
sequenceDiagram
participant Dependabot
participant CI/CD
participant Requirements
participant MarkItDown
participant PyPDF
participant FileProcessing
Dependabot->>Requirements: Update pypdf 6.0.0 → 6.1.3
Note over Requirements: backend/requirements/default.txt
CI/CD->>Requirements: Install dependencies
Requirements->>MarkItDown: Install markitdown[pdf]
MarkItDown->>PyPDF: Dependency: pypdf==6.1.3
Note over PyPDF: Security fixes:<br/>- LZWDecode stream limits<br/>- DCT inline image loop fix<br/>- Robustness improvements
FileProcessing->>MarkItDown: Use for PDF conversion
MarkItDown->>PyPDF: Process PDF files
PyPDF-->>MarkItDown: Return parsed content
MarkItDown-->>FileProcessing: Return markdown text
1 file reviewed, no comments
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No issues found across 1 file
Bumps pypdf from 6.0.0 to 6.1.3.
Release notes
Sourced from pypdf's releases.
... (truncated)
Changelog
Sourced from pypdf's changelog.
... (truncated)
Commits
4a613f3REL: 6.1.3e51d078SEC: Allow limiting size of LZWDecode streams (#3502)f2864d6SEC: Avoid infinite loop when reading broken DCT-based inline images (#3501)b751ca2ROB: Fail with explicit exception when image mode is an empty array (#3500)e13a1e0BUG: PageObject.scale() scales media box incorrectly (#3489)0859b35REL: 6.1.2623a700BUG: Fix handling of zero-length StreamObject (#3485)16c4c44ROB: Deal with wrong size for incremental PDF files (#3495)54c0dd7DEV: Use released Python 3.149fad9ffDEV: Use Mapping instead of dict in type hint of update_page_form_field_value...Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
@dependabot rebasewill rebase this PR@dependabot recreatewill recreate this PR, overwriting any edits that have been made to it@dependabot mergewill merge this PR after your CI passes on it@dependabot squash and mergewill squash and merge this PR after your CI passes on it@dependabot cancel mergewill cancel a previously requested merge and block automerging@dependabot reopenwill reopen this PR if it is closed@dependabot closewill close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually@dependabot show <dependency name> ignore conditionswill show all of the ignore conditions of the specified dependency@dependabot ignore this major versionwill close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this minor versionwill close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)@dependabot ignore this dependencywill close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)You can disable automated security fix PRs for this repo from the Security Alerts page.
Summary by cubic
Upgrade pypdf to 6.1.3 to pick up security hardening and bug fixes. This improves safety when parsing malformed PDFs and fixes page scaling and stream handling issues.