Implementing image text recognition and automatic conversion based on YOLOv5 combined with AI technology #13381

1259xcdh · 2024-10-24T17:03:33Z

Search before asking

I have searched the YOLOv5 issues and found no similar feature requests.

Description

The ability to automatically recognize and convert text from images into text or even images will greatly enhance the efficiency of intelligent systems in document processing, visual search, and information acquisition. By using image generation models (such as GAN), the recognized text content is regenerated into images that meet specific formatting and style requirements.

Use case

No response

Additional

No response

Are you willing to submit a PR?

Yes I'd like to help by submitting a PR!

UltralyticsAssistant · 2024-10-24T17:05:06Z

👋 Hello @1259xcdh, thank you for your interest in YOLOv5 🚀! It sounds like you're exploring a fascinating application combining image text recognition with advanced AI techniques. This capability could indeed enhance many intelligent systems significantly.

If this is a 🐛 Bug Report, please provide a minimum reproducible example to help us debug it. If you have any initial work or proof-of-concept, sharing that would be very helpful.

For any custom training ❓ Questions, please provide as much detail as possible, including dataset examples and training logs. Also, ensure you're following the best practices for optimal training results.

This is an automated response to assist you quickly, and an Ultralytics engineer will follow up with you soon.

We're also thrilled to introduce YOLOv8 🚀, our latest state-of-the-art model designed to deliver outstanding performance in object detection, image segmentation, and image classification. If you’re interested, this could be a great tool for exploring your project further. 😊

pderrenger · 2024-11-09T13:14:26Z

Thank you for your interest in enhancing YOLOv5 with image text recognition and conversion capabilities. While YOLOv5 is primarily focused on object detection, integrating it with OCR tools like Tesseract or leveraging models like GANs for image generation could be a promising approach. We welcome your contribution and look forward to your PR submission!

1259xcdh added the enhancement New feature or request label Oct 24, 2024

This was referenced Oct 24, 2024

Add files via upload #13382

Closed

Project Proposal and Pseudocode #13383

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing image text recognition and automatic conversion based on YOLOv5 combined with AI technology #13381

Implementing image text recognition and automatic conversion based on YOLOv5 combined with AI technology #13381

1259xcdh commented Oct 24, 2024

UltralyticsAssistant commented Oct 24, 2024

pderrenger commented Nov 9, 2024

Implementing image text recognition and automatic conversion based on YOLOv5 combined with AI technology #13381

Implementing image text recognition and automatic conversion based on YOLOv5 combined with AI technology #13381

Comments

1259xcdh commented Oct 24, 2024

Search before asking

Description

Use case

Additional

Are you willing to submit a PR?

UltralyticsAssistant commented Oct 24, 2024

pderrenger commented Nov 9, 2024