Official repository for the AAAI2025 paper Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding ...
A YOLOv8-based detector for manga speech bubbles and text boxes. This project uses computer vision and deep learning to automatically detect and classify different types of text elements in manga ...