org.apache.nutch.parse.mspowerpoint
Class MSPowerPointParser

java.lang.Object
  extended by org.apache.nutch.parse.ms.MSBaseParser
      extended by org.apache.nutch.parse.mspowerpoint.MSPowerPointParser
All Implemented Interfaces:
Configurable, Parser, Pluggable

public class MSPowerPointParser
extends MSBaseParser

Nutch-Parser for parsing MS PowerPoint slides ( mime type: application/vnd.ms-powerpoint).

It is based on org.apache.poi.*.

Author:
Stephan Strittmatter - http://www.sybit.de, Jérôme Charron
See Also:
Jakarta POI

Field Summary
static String MIME_TYPE
          Associated Mime type for PowerPoint files (application/vnd.ms-powerpoint).
 
Fields inherited from class org.apache.nutch.parse.ms.MSBaseParser
LOG
 
Fields inherited from interface org.apache.nutch.parse.Parser
X_POINT_ID
 
Constructor Summary
MSPowerPointParser()
           
 
Method Summary
 Parse getParse(Content content)
          Creates the parse for some content.
static void main(String[] args)
          Main for testing.
 
Methods inherited from class org.apache.nutch.parse.ms.MSBaseParser
getConf, getParse, main, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

MIME_TYPE

public static final String MIME_TYPE
Associated Mime type for PowerPoint files (application/vnd.ms-powerpoint).

See Also:
Constant Field Values
Constructor Detail

MSPowerPointParser

public MSPowerPointParser()
Method Detail

getParse

public Parse getParse(Content content)
Description copied from interface: Parser
Creates the parse for some content.


main

public static void main(String[] args)
Main for testing. Pass a powerpoint document as argument



Copyright © 2006 The Apache Software Foundation