GDText-VM: An Arbitrary-Shaped Scene Text Detector Based on Globally Deformable VMamba
- (2023/11/20) GDText-VM
In this paper, we proposed a novel method for efficient and accurate arbitrary-shaped scene text detection, termed GDText-VM, to improve the efficiency and accuracy of text detection.
The overall network architecture diagram will be uploaded after publication
First, clone the repository locally:
git clone https://github.com/radish512/GDText-VM.git
required:
PyTorch 1.1.0+
torchvision 0.3.0+
pip install -r requirement.txt
Please refer to dataset/README.md for dataset preparation.
CUDA_VISIBLE_DEVICES=0,1,2,3 python tools/train.py ${CONFIG_FILE}
For example:
CUDA_VISIBLE_DEVICES=0,1,2,3 python tools/train.py configs/GDText-VM/vssm_bfp_gasm_1200e_ctw1500.py
python tools/test.py ${CONFIG_FILE} ${CHECKPOINT_FILE}
For example:
python tools/test.py configs/GDText-VM/vssm_bfp_gasm_1200e_ctw1500.py checkpoints/checkpoint.pth
Please cite the related works in your publications if it helps your research:
This project is released under the Apache 2.0 license.