|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectnet.firstpartners.rp.back.extractor.GenericDataExtractor
net.firstpartners.rp.back.extractor.PdfDataExtractor
public class PdfDataExtractor
Extracts information from pdf files in a format that can be stored and added to the Index.
Field Summary | |
---|---|
protected org.apache.log4j.Logger |
logger
Logger for this class and subclasses |
Constructor Summary | |
---|---|
PdfDataExtractor()
|
Method Summary | |
---|---|
int |
canHandle(INewInformation info)
How well the plugin thinks it can handle a new piece of information |
void |
convert(INewInformation info)
Convert the file information into tuples |
java.lang.String |
getOriginalUri()
The original place where we got this data |
void |
onLoad()
Carry out any initiation tasks |
Methods inherited from class net.firstpartners.rp.back.extractor.GenericDataExtractor |
---|
getListExtensions, getMaxLengthSummary, getMinLengthWord, getNotIgnoreChars, getReplaceChars, getType, setListExtensions, setMaxLengthSummary, setMinLengthWord, setNotIgnoreChars, setReplaceChars, setType |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected final org.apache.log4j.Logger logger
Constructor Detail |
---|
public PdfDataExtractor()
Method Detail |
---|
public java.lang.String getOriginalUri()
getOriginalUri
in interface IDataExtractor
getOriginalUri
in class GenericDataExtractor
public void onLoad()
onLoad
in interface IPlugin
onLoad
in class GenericDataExtractor
public int canHandle(INewInformation info)
canHandle
in interface IDataExtractor
canHandle
in class GenericDataExtractor
info
- Information to be handled
public void convert(INewInformation info) throws RpException
convert
in interface IDataExtractor
convert
in class GenericDataExtractor
info
- Information to be converted
RpException
- If an error occur in processing the file
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |