Tackling Document Shadow Removal with Deep Learning

Document digitization is everywhere in our modern world, but shadows cast on documents during scanning or photography can significantly degrade the quality and readability of the digitized content. This seemingly simple problem turned out to be quite challenging to solve effectively.

The Problem with Existing Approaches

Most existing methods for shadow removal were designed for natural images, not documents. Documents have unique characteristics:

High contrast between text and background
Geometric structures and layouts
Sensitivity to artifacts that might not matter in natural images

Our Solution: SD7K Dataset and Frequency-Aware Network

We created SD7K, a large-scale real-world dataset specifically for document shadow removal. What makes this dataset special:

Real-world diversity: Captured under various lighting conditions and document types
High resolution: Maintaining the quality needed for practical applications
Comprehensive annotations: Carefully labeled shadow regions and clean references

Along with the dataset, we developed a frequency-aware shadow erasing network that:

Analyzes shadows in the frequency domain
Preserves document structure while removing shadows
Achieves state-of-the-art performance on benchmark datasets

Impact and Applications

This work has practical implications for:

Document digitization workflows
Optical Character Recognition (OCR) systems
Archive preservation projects
Mobile document scanning applications

The combination of a high-quality dataset and an effective algorithm opens up new possibilities for automated document processing pipelines.

The Problem with Existing Approaches

Our Solution: SD7K Dataset and Frequency-Aware Network

Impact and Applications

Enjoy Reading This Article?