public class Tesseract3TextExtractor
extends org.apache.jackrabbit.extractor.AbstractTextExtractor
Constructor and Description |
---|
Tesseract3TextExtractor()
Creates a new
TextExtractor instance. |
Modifier and Type | Method and Description |
---|---|
String |
doOcr(File tmpFileIn)
Performs OCR on image file
|
String |
doOcr(String ocr,
File tmpFileIn)
Performs OCR on image file
|
String |
extractText(File input)
Extract text from image using Tesseract OCR
|
Reader |
extractText(InputStream stream,
String type,
String encoding) |
String |
extractText(String ocr,
File input)
Extract text from image using Tesseract OCR
|
Reader |
extractText(String ocr,
InputStream stream,
String type,
String encoding) |
public Tesseract3TextExtractor()
TextExtractor
instance.public Reader extractText(InputStream stream, String type, String encoding) throws IOException
IOException
public String extractText(File input) throws IOException
IOException
public Reader extractText(String ocr, InputStream stream, String type, String encoding) throws IOException
IOException
public String extractText(String ocr, File input) throws IOException
IOException
public String doOcr(File tmpFileIn) throws IOException
IOException
public String doOcr(String ocr, File tmpFileIn) throws IOException
IOException
Copyright © 2016. All rights reserved.