org.apache.nutch.parse
Class ParseImpl
java.lang.Object
org.apache.nutch.parse.ParseImpl
- All Implemented Interfaces:
- Configurable, Writable, Parse
public class ParseImpl
- extends Object
- implements Parse, Writable, Configurable
The result of parsing a page's raw content.
- See Also:
Parser.getParse(Content)
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
ParseImpl
public ParseImpl()
ParseImpl
public ParseImpl(Parse parse)
ParseImpl
public ParseImpl(String text,
ParseData data)
ParseImpl
public ParseImpl(ParseText text,
ParseData data)
getText
public String getText()
- Description copied from interface:
Parse
- The textual content of the page. This is indexed, searched, and used when
generating snippets.
- Specified by:
getText
in interface Parse
getData
public ParseData getData()
- Description copied from interface:
Parse
- Other data extracted from the page.
- Specified by:
getData
in interface Parse
write
public final void write(DataOutput out)
throws IOException
- Specified by:
write
in interface Writable
- Throws:
IOException
readFields
public void readFields(DataInput in)
throws IOException
- Specified by:
readFields
in interface Writable
- Throws:
IOException
read
public static ParseImpl read(DataInput in,
Configuration conf)
throws IOException
- Throws:
IOException
setConf
public void setConf(Configuration conf)
- Specified by:
setConf
in interface Configurable
getConf
public Configuration getConf()
- Specified by:
getConf
in interface Configurable
Copyright © 2006 The Apache Software Foundation