How to read MS-word document in java without displaying junk
(unnecessary) data?
Answers were Sorted based on User's Feedback
Answer / nivas
There are lot of third party API's availbale. Please try
below one available in Jakarta site
Apache POI - HWPF - Java API to Handle Microsoft Word Files
Is This Answer Correct ? | 23 Yes | 9 No |
Answer / rajdeo
hi this is rajdeo
i m trying to read a word file using poi interface but i m
getting the error on import org.apache.poi.hwpf.extractor.*;
interface i have put the poi JAR FILE on WEB-INF BUT IN DONT
KNOW WHICH jar file ((means )) name of the jar file. i put
the jar file with the poi-bin-3.6-20091214.tar zip file plz
help me any body
i m getting the error to the page
WordExtractor we = new WordExtractor(doc);
i used this code plz watch this and reply me please
thanks
import org.apache.poi.poifs.filesystem.*;
import org.apache.poi.hwpf.*;
import org.apache.poi.hwpf.extractor.*;
import java.io.*;
public class readDoc
{
public static void main( String[] args )
{
String filesname = "Hello.doc";
POIFSFileSystem fs = null;
try
{
fs = new POIFSFileSystem(new
FileInputStream(filesname;
//Couldn't close the braces at the end as
my site did not allow it to close
HWPFDocument doc = new HWPFDocument(fs);
WordExtractor we = new WordExtractor(doc);
String[] paragraphs = we.getParagraphText();
System.out.println( "Word Document has " +
paragraphs.length + " paragraphs" );
for( int i=0; i<paragraphs .length; i++ ) {
System.out.println( "Length:"+paragraphs[ i
].length());
}
}
catch(Exception e) {
e.printStackTrace();
}
}
}
Is This Answer Correct ? | 8 Yes | 1 No |
Answer / arumugam
import org.apache.poi.poifs.filesystem.*;
import org.apache.poi.hwpf.*;
import org.apache.poi.hwpf.extractor.*;
import java.io.*;
public class readDoc
{
public static void main( String[] args )
{
String filesname = "Hello.doc";
POIFSFileSystem fs = null;
try
{
fs = new POIFSFileSystem(new
FileInputStream(filesname;
//Couldn't close the braces at the end as
my site did not allow it to close
HWPFDocument doc = new HWPFDocument(fs);
WordExtractor we = new WordExtractor(doc);
String[] paragraphs = we.getParagraphText();
System.out.println( "Word Document has " +
paragraphs.length + " paragraphs" );
for( int i=0; i<paragraphs .length; i++ ) {
System.out.println( "Length:"+paragraphs[ i
].length());
}
}
catch(Exception e) {
e.printStackTrace();
}
}
}
Note : Make sure before run this program , you should added
supporting jars are presence in this link :
http://poi.apache.org/download.html#POI-3.6
Is This Answer Correct ? | 9 Yes | 6 No |
Answer / prabhu
download bin zip file from apache any version.
poi.jar will have org.apache.poi.poifs.*
poi-scratchpad.jar will have org.apache.poi.hwpf.*
else if names of the jars which you have downloaded is
different then do not worry open all the jars with winzip
and check for the classes that you have used in the program.
Then select those jars and putit in your project in web-
inf.lib directory
Is This Answer Correct ? | 4 Yes | 1 No |
Answer / tamina
Use Java Libraries which are available online, one such
library is Aspose.Words for Java which allows you to perform
many operations on word docs including reading MS Word
document without showing junk data.
http://www.aspose.com/java/word-component.aspx
Is This Answer Correct ? | 1 Yes | 0 No |
Do you think about CMM(Capability Maturity Model) process?
Write Down Steps Using SAX Parser
how to find reverse of string without string funtion of java.
exception org.apache.jasper.JasperException: java.lang.NullPointerException org.apache.jasper.servlet.JspServletWrapper.handleJs pException(JspServletWrapper.java:491) org.apache.jasper.servlet.JspServletWrapper.service( JspServletWrapper.java:419) org.apache.jasper.servlet.JspServlet.serviceJspFile( JspServlet.java:313) org.apache.jasper.servlet.JspServlet.service(JspServ let.java:260) javax.servlet.http.HttpServlet.service(HttpServlet.j ava:717) root cause java.lang.NullPointerException org.apache.struts.taglib.TagUtils.retrieveMessageRes ources(TagUtils.java:1175) org.apache.struts.taglib.TagUtils.message(TagUtils.j ava:1038) org.apache.struts.taglib.bean.MessageTag.doStartTag( MessageTag.java:224) org.apache.jsp.register_jsp._jspx_meth_bean_005fmess age_005f0(register_jsp.java:138) org.apache.jsp.register_jsp._jspService(register_jsp .java:94) org.apache.jasper.runtime.HttpJspBase.service(HttpJs pBase.java:70) javax.servlet.http.HttpServlet.service(HttpServlet.j ava:717) org.apache.jasper.servlet.JspServletWrapper.service( JspServletWrapper.java:377) org.apache.jasper.servlet.JspServlet.serviceJspFile( JspServlet.java:313) org.apache.jasper.servlet.JspServlet.service(JspServ let.java:260) javax.servlet.http.HttpServlet.service(HttpServlet.j ava:717)
How to get one hasmap value in another hashmap ,only value not key
Can we change the validator-rules.xml for our own validations in struts??
plz send code for feature rich resume builder in j2ee frontend:J2EE Backend: DB2 Express
code inside filter to restrict users to access a servlet component .
How to convert string containing decimal point into integer in java? For example given a string like "3.14" as input how to get integer 3 as result.
an on-line examination application using html jsp servlet and jdbc. including session management and cookies
how to display after one month adding today's sysdatedate in jsp?
Where can i find some sample applications for different J2EE frameworks like Spring,Struts ,JSF, Hibernate etc ??