We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我通过magic-pdf转换不是很清晰的扫描版英文杂志时,发现里面的字母、单词识别的都挺对,但就是英文单词间的空格不见了。在Issue里也搜到了不少相关问题,也看到有的能解决而有的是英文空格因为大小问题不好识别。
能不能在识别后,再进行一遍后处理,用语言模型/词典来补入空格?甚至通过判断拼写错误,来提升英文文章的识别结果?
The text was updated successfully, but these errors were encountered:
你先试试在huggingface或者modelscope的demo↑空格正常不正常
Sorry, something went wrong.
在线的没有这个问题,为什么呢?我是按照说明文档在win11上自己部署的
我又试了下载安装的windows app,空格问题仍旧存在,不过效果比我自己部署的好一些,也就是app版本的结果介于自己部署和在线demo之间。
demo上是用于预览的版本,会拥有一些较新的功能和特性,一般在下个版本发布时会同步到正式版。
No branches or pull requests
我通过magic-pdf转换不是很清晰的扫描版英文杂志时,发现里面的字母、单词识别的都挺对,但就是英文单词间的空格不见了。在Issue里也搜到了不少相关问题,也看到有的能解决而有的是英文空格因为大小问题不好识别。
能不能在识别后,再进行一遍后处理,用语言模型/词典来补入空格?甚至通过判断拼写错误,来提升英文文章的识别结果?
The text was updated successfully, but these errors were encountered: