|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectnet.firstpartners.rp.back.extractor.GenericDataExtractor
public class GenericDataExtractor
Extracts information from a binary files in a format that can be added to an Index.
Field Summary | |
---|---|
private java.lang.String[] |
listExtensions
Supported extension list |
protected org.apache.log4j.Logger |
logger
Logger for this class and subclasses |
private int |
maxLengthSummary
Maxim length for summary |
private int |
minLengthWord
Minim length for the words |
private java.lang.String |
notIgnoreChars
String pattern which defines the characters not to ignore |
private java.lang.String |
replaceChars
String pattern which defines the characters to be replaced |
private java.lang.String |
type
Extractor type |
Constructor Summary | |
---|---|
GenericDataExtractor()
|
Method Summary | |
---|---|
int |
canHandle(INewInformation info)
How well the plugin thinks it can handle a new piece of information |
void |
convert(INewInformation info)
Convert the file information into tuples |
java.lang.String[] |
getListExtensions()
List of the supported extensions |
int |
getMaxLengthSummary()
|
int |
getMinLengthWord()
|
java.lang.String |
getNotIgnoreChars()
|
java.lang.String |
getOriginalUri()
The original place where we got this data |
java.lang.String |
getReplaceChars()
|
java.lang.String |
getType()
The type of the extractor |
void |
onLoad()
Carry out any initiation tasks |
void |
setListExtensions(java.lang.String[] listExtensions)
|
void |
setMaxLengthSummary(int maxLengthSummary)
|
void |
setMinLengthWord(int minLengthWord)
|
void |
setNotIgnoreChars(java.lang.String notIgnoreChars)
|
void |
setReplaceChars(java.lang.String replaceChars)
|
void |
setType(java.lang.String type)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected final org.apache.log4j.Logger logger
private java.lang.String type
private java.lang.String[] listExtensions
private int minLengthWord
private int maxLengthSummary
private java.lang.String notIgnoreChars
private java.lang.String replaceChars
Constructor Detail |
---|
public GenericDataExtractor()
Method Detail |
---|
public java.lang.String getOriginalUri()
getOriginalUri
in interface IDataExtractor
public void onLoad()
onLoad
in interface IPlugin
public int canHandle(INewInformation info)
canHandle
in interface IDataExtractor
info
- Information to be handled
public void convert(INewInformation info) throws RpException
convert
in interface IDataExtractor
info
- Information to be converted
RpException
- If an error occur in processing the filepublic int getMaxLengthSummary()
public void setMaxLengthSummary(int maxLengthSummary)
maxLengthSummary
- The maxLengthSummary to set.public int getMinLengthWord()
public void setMinLengthWord(int minLengthWord)
minLengthWord
- The minLengthWord to set.public java.lang.String getNotIgnoreChars()
public void setNotIgnoreChars(java.lang.String notIgnoreChars)
notIgnoreChars
- The notIgnoreChars to set.public java.lang.String getReplaceChars()
public void setReplaceChars(java.lang.String replaceChars)
replaceChars
- The replaceChars to set.public java.lang.String getType()
IDataExtractor
getType
in interface IDataExtractor
public void setType(java.lang.String type)
type
- The type to set.public java.lang.String[] getListExtensions()
IDataExtractor
getListExtensions
in interface IDataExtractor
public void setListExtensions(java.lang.String[] listExtensions)
listExtensions
- The listExtensions to set.
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |