Portal recognition utility is a part of a big solution. In general, the solution consists of a Crawler, OCR utility and a Website. The crawler scrapes data from a website on regular basis, saves the scraped data into the Crawler database. OCR utility processes the data available in the Crawler database and outputs it into text format. Then text data is uploaded onto another Website database. From this website it is possible to search for a key word/sets of words and get output with all text documents related to the search terms.
Tools and Technologies