org.apache.nutch.crawl
Class PartitionUrlByHost
java.lang.Object
org.apache.nutch.crawl.PartitionUrlByHost
- All Implemented Interfaces:
- JobConfigurable, Partitioner
public class PartitionUrlByHost
- extends Object
- implements Partitioner
Partition urls by hostname.
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
PartitionUrlByHost
public PartitionUrlByHost()
configure
public void configure(JobConf job)
- Specified by:
configure
in interface JobConfigurable
close
public void close()
getPartition
public int getPartition(WritableComparable key,
Writable value,
int numReduceTasks)
- Hash by hostname.
- Specified by:
getPartition
in interface Partitioner
Copyright © 2006 The Apache Software Foundation