org.apache.nutch.indexer
Class Indexer
java.lang.Object
org.apache.hadoop.util.ToolBase
org.apache.nutch.indexer.Indexer
- All Implemented Interfaces:
- Configurable, Closeable, JobConfigurable, Mapper, Reducer, Tool
public class Indexer
- extends ToolBase
- implements Reducer, Mapper
Create indexes for segments.
Nested Class Summary |
static class |
Indexer.OutputFormat
Unwrap Lucene Documents created by reduce and add them to an index. |
Fields inherited from class org.apache.hadoop.util.ToolBase |
conf |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
DONE_NAME
public static final String DONE_NAME
- See Also:
- Constant Field Values
LOG
public static final org.apache.commons.logging.Log LOG
Indexer
public Indexer()
Indexer
public Indexer(Configuration conf)
configure
public void configure(JobConf job)
- Specified by:
configure
in interface JobConfigurable
close
public void close()
- Specified by:
close
in interface Closeable
reduce
public void reduce(WritableComparable key,
Iterator values,
OutputCollector output,
Reporter reporter)
throws IOException
- Specified by:
reduce
in interface Reducer
- Throws:
IOException
index
public void index(Path indexDir,
Path crawlDb,
Path linkDb,
Path[] segments)
throws IOException
- Throws:
IOException
main
public static void main(String[] args)
throws Exception
- Throws:
Exception
run
public int run(String[] args)
throws Exception
- Specified by:
run
in interface Tool
- Throws:
Exception
map
public void map(WritableComparable key,
Writable value,
OutputCollector output,
Reporter reporter)
throws IOException
- Specified by:
map
in interface Mapper
- Throws:
IOException
Copyright © 2006 The Apache Software Foundation