Solved issue #115 #525

arkya-art · 2021-08-26T19:55:55Z

Hi @rkcosmos, this PR resolves issue #115. I have worked upon this and solved the issue posted by @apiszcz on Jul 11, 2020. For this purpose first of all I have created a function named readtextlang() which predicts the result in a similar fashion compared to readtext(). Then stored all the detected characters from the image within a list after that wrote two functions

one for searching the given character within the character text files
Second for adding the detected language to the result obtained from readtext()

Output Image File

https://drive.google.com/file/d/1KTV6l1wMg2dc5IIQZZhYySvi9AUmH7Md/view?usp=sharing

Methodology

Moved through each of the files and stored the characters within the file in a list. Correspondingly compared each character with the characters within the file and searched for the two most famous languages English and Chinese

Code

 `def readtextlang(self, image, decoder = 'greedy', beamWidth= 5, batch_size = 1,\
             workers = 0, allowlist = None, blocklist = None, detail = 1,\
             rotation_info = None, paragraph = False, min_size = 20,\
             contrast_ths = 0.1,adjust_contrast = 0.5, filter_ths = 0.003,\
             text_threshold = 0.7, low_text = 0.4, link_threshold = 0.4,\
             canvas_size = 2560, mag_ratio = 1.,\
             slope_ths = 0.1, ycenter_ths = 0.5, height_ths = 0.5,\
             width_ths = 0.5, y_ths = 0.5, x_ths = 1.0, add_margin = 0.1, output_format='standard'):
    '''
    Parameters:
    image: file path or numpy-array or a byte stream object
    '''
    img, img_cv_grey = reformat_input(image)

    horizontal_list, free_list = self.detect(img, min_size, text_threshold,\
                                             low_text, link_threshold,\
                                             canvas_size, mag_ratio,\
                                             slope_ths, ycenter_ths,\
                                             height_ths,width_ths,\
                                             add_margin, False)
    # get the 1st result from hor & free list as self.detect returns a list of depth 3
    horizontal_list, free_list = horizontal_list[0], free_list[0]
    result = self.recognize(img_cv_grey, horizontal_list, free_list,\
                            decoder, beamWidth, batch_size,\
                            workers, allowlist, blocklist, detail, rotation_info,\
                            paragraph, contrast_ths, adjust_contrast,\
                            filter_ths, y_ths, x_ths, False, output_format)
   
    char = []
    directory = 'characters/'
    for i in range(len(result)):
        char.append(result[i][1])
    
    def search(arr,x):
        g = False
        for i in range(len(arr)):
            if arr[i]==x:
                g = True
                return 1
        if g == False:
            return -1
    def tupleadd(i):
        a = result[i]
        b = a + (filename[0:2],)
        return b
    
    for filename in os.listdir(directory):
        if filename.endswith(".txt"):
            with open ('characters/'+ filename,'rt',encoding="utf8") as myfile:  
                chartrs = str(myfile.read().splitlines()).replace('\n','') 
                for i in range(len(char)):
                    res = search(chartrs,char[i])
                    if res != -1:
                        if filename[0:2]=="en" or filename[0:2]=="ch":
                            print(tupleadd(i))`

Solved issue JaidedAI#115

Solved issue JaidedAI#115

1673b67

rkcosmos merged commit 78297cc into JaidedAI:master Sep 11, 2021

thuc-moreh pushed a commit to moreh-dev/EasyOCR that referenced this pull request Jul 5, 2023

Merge pull request JaidedAI#525 from arkya-art/master

f1996ae

Solved issue JaidedAI#115

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solved issue #115 #525

Solved issue #115 #525

arkya-art commented Aug 26, 2021

Solved issue #115 #525

Solved issue #115 #525

Conversation

arkya-art commented Aug 26, 2021

Output Image File

Methodology

Code