The Deep Learning and Vision Computing Lab is dedicated to advanced theoretical research and innovative applications in the fields of artificial intelligence, computer vision, machine learning, and pattern recognition. Our current research focuses on deep learning, text detection and recognition, document analysis and understanding, and artificial intelligence. In recent years, our team has led more than 30 national and provincial research projects, making significant achievements in optical character recognition (OCR), handwriting recognition, gesture recognition and interaction technology, and innovative applications of deep learning. We have published over 300 SCI/EI papers, obtained more than 50 authorized invention patents, won 5 provincial and ministerial science and technology awards, and achieved first place in international academic competitions 4 times.
Pinned Loading
Repositories
Showing 10 of 24 repositories
- DocHighlight Public
[PRCV 25] Towards Real-World Document Specular Highlight Removal: The DocHighlight Dataset and DocSHRNet Method
SCUT-DLVCLab/DocHighlight’s past year of commit activity - OCR-Reasoning Public
[arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
SCUT-DLVCLab/OCR-Reasoning’s past year of commit activity - DOLPHIN Public
[IEEE TIFS 2024] Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
SCUT-DLVCLab/DOLPHIN’s past year of commit activity - PAVENet Public
[IEEE TPAMI 2025] Privacy-Preserving Biometric Verification With Handwritten Random Digit String
SCUT-DLVCLab/PAVENet’s past year of commit activity - MegaHan97K Public
[PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories"
SCUT-DLVCLab/MegaHan97K’s past year of commit activity - SigBench Public
SCUT-DLVCLab/SigBench’s past year of commit activity
Top languages
Loading…