Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementing image text recognition and automatic conversion based on YOLOv5 combined with AI technology #13381

Open
2 tasks done
1259xcdh opened this issue Oct 24, 2024 · 2 comments · May be fixed by #13383
Open
2 tasks done
Labels
enhancement New feature or request

Comments

@1259xcdh
Copy link

Search before asking

  • I have searched the YOLOv5 issues and found no similar feature requests.

Description

The ability to automatically recognize and convert text from images into text or even images will greatly enhance the efficiency of intelligent systems in document processing, visual search, and information acquisition. By using image generation models (such as GAN), the recognized text content is regenerated into images that meet specific formatting and style requirements.

Use case

No response

Additional

No response

Are you willing to submit a PR?

  • Yes I'd like to help by submitting a PR!
@1259xcdh 1259xcdh added the enhancement New feature or request label Oct 24, 2024
@UltralyticsAssistant
Copy link
Member

👋 Hello @1259xcdh, thank you for your interest in YOLOv5 🚀! It sounds like you're exploring a fascinating application combining image text recognition with advanced AI techniques. This capability could indeed enhance many intelligent systems significantly.

If this is a 🐛 Bug Report, please provide a minimum reproducible example to help us debug it. If you have any initial work or proof-of-concept, sharing that would be very helpful.

For any custom training ❓ Questions, please provide as much detail as possible, including dataset examples and training logs. Also, ensure you're following the best practices for optimal training results.

This is an automated response to assist you quickly, and an Ultralytics engineer will follow up with you soon.

We're also thrilled to introduce YOLOv8 🚀, our latest state-of-the-art model designed to deliver outstanding performance in object detection, image segmentation, and image classification. If you’re interested, this could be a great tool for exploring your project further. 😊

@pderrenger
Copy link
Member

Thank you for your interest in enhancing YOLOv5 with image text recognition and conversion capabilities. While YOLOv5 is primarily focused on object detection, integrating it with OCR tools like Tesseract or leveraging models like GANs for image generation could be a promising approach. We welcome your contribution and look forward to your PR submission!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants