org.apache.nutch.crawl
Class PartitionUrlByHost

java.lang.Object
  extended by org.apache.nutch.crawl.PartitionUrlByHost
All Implemented Interfaces:
JobConfigurable, Partitioner

public class PartitionUrlByHost
extends Object
implements Partitioner

Partition urls by hostname.


Constructor Summary
PartitionUrlByHost()
           
 
Method Summary
 void close()
           
 void configure(JobConf job)
           
 int getPartition(WritableComparable key, Writable value, int numReduceTasks)
          Hash by hostname.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

PartitionUrlByHost

public PartitionUrlByHost()
Method Detail

configure

public void configure(JobConf job)
Specified by:
configure in interface JobConfigurable

close

public void close()

getPartition

public int getPartition(WritableComparable key,
                        Writable value,
                        int numReduceTasks)
Hash by hostname.

Specified by:
getPartition in interface Partitioner


Copyright © 2006 The Apache Software Foundation