Little Things: Google’s Sinhala OCR and a Lankan URL shortener


“Want to convert your Sinhala pdfs or images into texts? Sinhala OCR is now available with Google Docs. Simply open your image/pdf with Google Docs and scroll down to see the magic!”

This little snippet of tech news was shared with us by Googler Keshan Sodimana, who’s known for breaking all the cool Google stuff to the Sri Lankan social media communities long before media get around to doing it. While Sinhala OCR has been worked on for some time, especially by the UCSC’s Language Technology Research Laboratory, current implementations tend to be of the offline variety. Don’t expect this solution to be perfect, but it’s a huge step up from what we had before. 

“If you’ve got a URL as long as an anaconda…”

Thambaru Wijesekara of Ceylon Systems blogged about a local URL shortener that they’ve come up with called Twt.lkIt’s really quite simple – paste URL, shorten, done. The cool thing is that it lets you track visits to that URL by signing up with Facebook, your email or G+. We shortened and ran it through GTmetrix to see just how much of a lag is introduced, and we got a page load time of 3.92 seconds versus our usual 2.7-2.8 time. Not bad for use on Twitter. 



  1. Proper Sinhala OCR: This is the best news I’ve heard all day. Truly a game changer.

    I remember trying out the offline implementations done by UCSC back then but never got them to work properly. Just tried this off a screenshot of some text off my blog and it was 100% perfect. I guess the output quality might change with scan quality and stuff but this is just amazing!

  2. Do not abuse the trust of one’s mentor by constantly requesting information that you might get by reading.

    Be sure to avoid horizontal scrolling, which
    is never recommended by web page design standards. Some realtors wish to take additional classes or subjects so they really will likely be updated and still have a good edge in the business.


Please enter your comment!
Please enter your name here