OCR publication thread for early 2025
Do not close this issue until all checkboxes below are complete or have been rescheduled:
List of corpora:
In Processed OCR folder (need chapter divisions + sentence splitting+full automatic NLP processing like the bible corpora)
all documents above need to be moved to https://github.com/CopticScriptorium/auto-corpora when done
In GitDox
OCR publication thread for early 2025
Do not close this issue until all checkboxes below are complete or have been rescheduled:
List of corpora:
In Processed OCR folder (need chapter divisions + sentence splitting+full automatic NLP processing like the bible corpora)
all documents above need to be moved to https://github.com/CopticScriptorium/auto-corpora when done
In GitDox
apocalypse.paul (2)
- [ ] corpus name needed
- [ ] other metadata updated
- possibly error in data -- translation on p. 1043 begins with folio 24a but OCR coptic begins in the middle of folio 6a p. 533
pscyril.alexandria
pscyril.jerusalem
psepiphanius on Mary
pschrysostom
pscelestinus
pstimothy.alex
psote.psoi
timothy.discourse