DocLLM

This is an implementation of the DocLLM paper for Llama models. Based on the paper "DocLLM: A layout-aware generative language model for multimodal document understanding".

License

Most of the code in this repository is published under MIT license. However, the script "src/external_scripts/document_tokenization/document_tokenization_pymupdf.py" is published GNU Affero General Public License due to it using PyMuPDF. If another license for PyMuPDF is acquired, the script may also be used under that license.

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DocLLM

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

BlueCrescent/DocLLM

Folders and files

Latest commit

History

Repository files navigation

DocLLM

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages