org.apache.nutch.parse.oo
Class OOParser

java.lang.Object
  extended by org.apache.nutch.parse.oo.OOParser
All Implemented Interfaces:
Configurable, Parser, Pluggable

public class OOParser
extends Object
implements Parser

Parser for OpenOffice and OpenDocument formats. This should handle the following formats: Text, Spreadsheet, Presentation, and corresponding templates and "master" documents.

Author:
Andrzej Bialecki

Field Summary
static org.apache.commons.logging.Log LOG
           
 
Fields inherited from interface org.apache.nutch.parse.Parser
X_POINT_ID
 
Constructor Summary
OOParser()
           
 
Method Summary
 Configuration getConf()
           
 Parse getParse(Content content)
          Creates the parse for some content.
static void main(String[] args)
           
 void setConf(Configuration conf)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final org.apache.commons.logging.Log LOG
Constructor Detail

OOParser

public OOParser()
Method Detail

setConf

public void setConf(Configuration conf)
Specified by:
setConf in interface Configurable

getConf

public Configuration getConf()
Specified by:
getConf in interface Configurable

getParse

public Parse getParse(Content content)
Description copied from interface: Parser
Creates the parse for some content.

Specified by:
getParse in interface Parser

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


Copyright © 2006 The Apache Software Foundation