ComfyUI > Nodes > ComfyUI-EasyOCR > Apply EasyOCR

ComfyUI Node: Apply EasyOCR

Class Name

Apply EasyOCR

Category
ComfyUI-EasyOCR
Author
prodogape (Account age: 1060days)
Extension
ComfyUI-EasyOCR
Latest Updated
2024-07-11
Github Stars
0.01K

How to Install ComfyUI-EasyOCR

Install this extension via the ComfyUI Manager by searching for ComfyUI-EasyOCR
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-EasyOCR in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • High-speed GPU machines
  • 200+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 50+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Apply EasyOCR Description

Facilitate OCR on images using EasyOCR library for text extraction in multiple languages with CPU/GPU support.

Apply EasyOCR:

Apply EasyOCR is a powerful node designed to facilitate Optical Character Recognition (OCR) on images using the EasyOCR library. This node allows you to extract text from images, making it an invaluable tool for tasks that require text recognition and extraction from various image formats. By leveraging EasyOCR, the node supports multiple languages and can operate on both CPU and GPU, providing flexibility and efficiency. The primary goal of Apply EasyOCR is to simplify the process of text detection and recognition in images, enabling you to seamlessly integrate OCR capabilities into your AI art projects without needing extensive technical knowledge.

Apply EasyOCR Input Parameters:

image

This parameter expects an image input in which the text needs to be detected and recognized. The image should be in a format supported by the PIL library, such as JPEG or PNG. The quality and resolution of the image can impact the accuracy of the OCR results.

gpu

This boolean parameter determines whether the OCR process should utilize the GPU for faster processing. The default value is True, which means the GPU will be used if available. Setting this to False will force the node to use the CPU, which might be slower but can be useful if a GPU is not available.

detect

This parameter allows you to choose between two options: choose and input. If set to choose, the node will use the languages specified in the language_list parameter. If set to input, it will use the languages specified in the language_name parameter. The default value is choose.

language_list

This parameter provides a list of languages that the OCR process can recognize. You can select from a predefined list of languages such as English, Simplified Chinese, Traditional Chinese, Arabic, and many more. The default language is English. This parameter is used when detect is set to choose.

language_name

This string parameter allows you to specify the languages for OCR by their codes, separated by commas (e.g., ch_sim,en). This parameter is used when detect is set to input. The default value is ch_sim,en.

Apply EasyOCR Output Parameters:

IMAGE

This output parameter returns the original image with bounding boxes drawn around the detected text areas. The bounding boxes help visualize where the text has been detected in the image, making it easier to verify the OCR results.

MASK

This output parameter provides a mask image where the detected text areas are highlighted. The mask can be useful for further image processing tasks, such as extracting the text regions or applying additional filters.

JSON

This output parameter returns a JSON object containing detailed information about the detected text, including the text content, bounding box coordinates, and confidence scores. This structured data can be used for further analysis or integration into other applications.

Apply EasyOCR Usage Tips:

  • Ensure that the input image is clear and has good contrast between the text and the background to improve OCR accuracy.
  • Use the gpu parameter to leverage GPU acceleration for faster processing, especially when working with large images or multiple images.
  • Select the appropriate languages in the language_list or language_name parameters to match the text in your images for better recognition results.
  • Utilize the MASK output to isolate text regions for additional processing or to create visual effects in your AI art projects.

Apply EasyOCR Common Errors and Solutions:

"Model files not found in the specified directory"

  • Explanation: This error occurs when the EasyOCR model files are not available in the specified directory.
  • Solution: Ensure that the model files are correctly downloaded and placed in the directory specified by model_storage_directory. You can also check if the directory path is correct and accessible.

"Unsupported image format"

  • Explanation: This error occurs when the input image format is not supported by the PIL library.
  • Solution: Convert your image to a supported format such as JPEG or PNG before passing it to the node.

"Language not supported"

  • Explanation: This error occurs when the specified language code is not supported by EasyOCR.
  • Solution: Verify that the language code provided in language_name or selected in language_list is correct and supported by EasyOCR. Refer to the list of supported languages in the documentation.

"GPU not available"

  • Explanation: This error occurs when the gpu parameter is set to True, but a compatible GPU is not available.
  • Solution: Set the gpu parameter to False to use the CPU for OCR processing, or ensure that a compatible GPU is properly installed and configured on your system.

Apply EasyOCR Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-EasyOCR
RunComfy

© Copyright 2024 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals.