Monday, August 15, 2016

Telugu OCR timeline

Rajasekaran and Deekshatulu (1977).
Rao and Ajitha (1995)
Sukhaswami, Seetharamulu and Pujari (1995).
Negi, Bhagvati and Krishna (2001)
Lakshmi and Patvardhan (2002)
Jawahar, Kumar and Kiran (2003)
Pujari et al. (2004)
Kumar et al. (2011)
By Rakesh Achanta , and Trevor Hastie

telugu తెలుగు


just an aside

"so all the  telugu guNintam letters combined will be the  Abugida  of the telugu language "

"Abugida as a term in linguistics was proposed by Peter T. Daniels in his 1990 typology of writing systems.[1] ’Abugida is an Ethiopian name for the Ge‘ez script, taken from four letters of that script, 'ä bu gi da, in much the same way that abecedary is derived from Latin a be ce de, and alphabet is derived from the names of the two first letters in the Greek alphabetalpha and beta

 In Telugu language "There are 16 vowels and 37 consonants which combine to give over 500 simple syllables. Of them about 400 are commonly used. There are nearly sixty other symbols including vowel-less consonants (m in Figure 1), punctuation and numbers. This brings the total number of symbols used in common writing to approximately 460."


"Our goal is to develop an end-to-end system that takes a page of Telugu text and converts it to Unicode text."
with 100% accuracy! this is the part where most researchers give up and  some  few people  like  late  Steve Jobs  Obsess

No comments: