Visit ComfyUI Online for ready-to-use ComfyUI environment
Facilitate OCR on images using EasyOCR library for text extraction in multiple languages with CPU/GPU support.
Apply EasyOCR is a powerful node designed to facilitate Optical Character Recognition (OCR) on images using the EasyOCR library. This node allows you to extract text from images, making it an invaluable tool for tasks that require text recognition and extraction from various image formats. By leveraging EasyOCR, the node supports multiple languages and can operate on both CPU and GPU, providing flexibility and efficiency. The primary goal of Apply EasyOCR is to simplify the process of text detection and recognition in images, enabling you to seamlessly integrate OCR capabilities into your AI art projects without needing extensive technical knowledge.
This parameter expects an image input in which the text needs to be detected and recognized. The image should be in a format supported by the PIL library, such as JPEG or PNG. The quality and resolution of the image can impact the accuracy of the OCR results.
This boolean parameter determines whether the OCR process should utilize the GPU for faster processing. The default value is True
, which means the GPU will be used if available. Setting this to False
will force the node to use the CPU, which might be slower but can be useful if a GPU is not available.
This parameter allows you to choose between two options: choose
and input
. If set to choose
, the node will use the languages specified in the language_list
parameter. If set to input
, it will use the languages specified in the language_name
parameter. The default value is choose
.
This parameter provides a list of languages that the OCR process can recognize. You can select from a predefined list of languages such as English, Simplified Chinese, Traditional Chinese, Arabic, and many more. The default language is English
. This parameter is used when detect
is set to choose
.
This string parameter allows you to specify the languages for OCR by their codes, separated by commas (e.g., ch_sim,en
). This parameter is used when detect
is set to input
. The default value is ch_sim,en
.
This output parameter returns the original image with bounding boxes drawn around the detected text areas. The bounding boxes help visualize where the text has been detected in the image, making it easier to verify the OCR results.
This output parameter provides a mask image where the detected text areas are highlighted. The mask can be useful for further image processing tasks, such as extracting the text regions or applying additional filters.
This output parameter returns a JSON object containing detailed information about the detected text, including the text content, bounding box coordinates, and confidence scores. This structured data can be used for further analysis or integration into other applications.
gpu
parameter to leverage GPU acceleration for faster processing, especially when working with large images or multiple images.language_list
or language_name
parameters to match the text in your images for better recognition results.MASK
output to isolate text regions for additional processing or to create visual effects in your AI art projects.model_storage_directory
. You can also check if the directory path is correct and accessible.language_name
or selected in language_list
is correct and supported by EasyOCR. Refer to the list of supported languages in the documentation.gpu
parameter is set to True
, but a compatible GPU is not available.gpu
parameter to False
to use the CPU for OCR processing, or ensure that a compatible GPU is properly installed and configured on your system.© Copyright 2024 RunComfy. All Rights Reserved.