Wikisource talk:Google OCR

Proprietary OCR

Latest comment: 8 years ago2 comments2 people in discussion

Where were the Google terms cleared for this usage? There was some discussion on Wikisource-l about the generic Google API terms and the situation was not so clear. Nemo 17:42, 16 September 2016 (UTC)Reply

@Nemo_bis: That's a good question. I assume this usage was cleared, because Google have donated some number of Vision API requests for this purpose. Whether that extends to other APIs I don't know. It's sounding like it'd be good to be able to get a similar thing going for the Google Drive API (to use its OCR system as well, which appears to be a bit different to the Cloud Vision one). —SWilson (WMF) (talk) 11:20, 18 September 2016 (UTC)Reply

Punjabi Wikisoure

Latest comment: 7 years ago1 comment1 person in discussion

The OCR is not working for Punjabi Wikisource--Parveer Grewal (talk) 12:55, 25 February 2017 (UTC)Reply

Kannada Wikisource

Latest comment: 6 years ago3 comments3 people in discussion

The OCR button is not working in Kannada Wikisource. Whenever we trying using it we get an error saying undefined Invalid language hints.--Ananth subray (talk) 09:15, 18 October 2017 (UTC)Reply

@Ananth subray: Unfortunately, it doesn't yet support Kannada. :-( The list of available languages is at https://cloud.google.com/vision/docs/languages — there's a note a the top of this wiki page about it. Unsupported languages are: Malayalam, Telugu, Oriya, Gujrati, and Kannada. We're still trying to get Google to use the same OCR engine for this as they use for Google Drive; we'll be sure to post here when we get some news! Sam Wilson 05:25, 19 October 2017 (UTC)Reply

They really should be using the Google Drive OCR engine 🤔 The current one doesn't treat vertical Chinese text properly. Suzukaze-c (talk) 06:55, 30 June 2018 (UTC)Reply

Commons user script

Latest comment: 6 years ago1 comment1 person in discussion

I've been experimenting with a OCR button at Commons, that uses this same service: commons:User:Samwilson/GoogleOCR.js. It can be used to populate the inscription template there. Sam Wilson 04:41, 2 August 2018 (UTC)Reply

Raw OCR Results?

Latest comment: 5 years ago1 comment1 person in discussion

I'm working on a project to extract genes and other biological entities from scientific figures/diagrams like this WNT Pathway, and I'd like to feed the OCR results from your tool into my code that identifies bioentities.

Would it be possible to get the raw OCR results from your tool? There are two reasons the raw results would be useful: 1) positional information and 2) better ability to extract bio-entities. (Because the text on the diagrams doesn't have the typical page/paragraph/sentence/word structure, sometimes the list of words will actually split a single bio-entity or merge multiple bioentities.) Ariutta (talk) 19:23, 25 June 2019 (UTC)Reply

Using Google OCR for old English text

Latest comment: 4 years ago1 comment1 person in discussion

Hi, I'm running a project to upload 3,000 chapbooks from the National Library of Scotland's digitised collections and we're interested in using the Google OCR function instead of Tesseract because it identifies the long f/s letter (ſ) really well. Do you think this would be an acceptable use? https://en.wikisource.org/wiki/Wikisource:WikiProject_NLS Gweduni (talk) 06:57, 30 April 2020 (UTC)Reply