[tesseract-ocr] Reading image from Rubber

Taresh Chaudhari Mon, 25 Nov 2024 02:22:19 -0800

Hi, 
I am trying to read the characters from the image, which has characters 
with black color in the background. Attaching the code which i used to 
extract, currently its giving the partial output. Can you help me to guide 
how to make it accurate?



import pytesseract
from PIL import Image
pytesseract.pytesseract.tesseract_cmd = 
'C:\\Users\\M562765\\AppData\\Local\\Programs\\Tesseract-OCR\\tesseract.exe'
# Paths to your images
image_paths = [
   'C:/Users/M562765/Downloads/Unable-images/Unable/crop1.jpg']

# Function to process an image and extract text
def extract_text_from_image(image_path):
    # Open the image
    img = Image.open(image_path)
    
    # Use pytesseract to perform OCR
    extracted_text = pytesseract.image_to_string(img, config='--psm 6')  # 
PSM 6 assumes a block of text
    return extracted_text.strip()

# Process all images and print results
for img_path in image_paths:
    text = extract_text_from_image(img_path)
    print(f"Text extracted from {img_path}: {text}")

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion visit 
https://groups.google.com/d/msgid/tesseract-ocr/6887daf5-e4ba-44ef-8d86-73c5943990e1n%40googlegroups.com.

[tesseract-ocr] Reading image from Rubber

Reply via email to