.NET Questions (CLOSED)

Questions and Answers on any aspect of .NET. Now closed.

This discussion group is now closed.

Have a question about .NET development? Try stackoverflow.com, a worldwide community of great developers asking and answering questions 24 hours a day.

The archives of .NET Questions contain years of Q&A. Even older .NET Questions are still online, too.

Turning binary files of multiple types into text files

I'm looking for some way to take PDF files and multiple Office files of different applications (Word, Excel) and versions (XP, 2002, 2007) and turn them into text files.

Formatting does not have to be great -- the text files are just going to get sucked up by a search indexer.  The text just has to all be there.

I've found some components that will read Office documents, which would probably work, but I was curious if I could harness Microsoft Index Server to do this?  Since Index Server can read all these files types to index them, could I retrieve the content from Index Server?

Just curious if anyone has some advice here.
Deane Send private email
Friday, November 14, 2008
 
 
one word: ifilter .

Google it.
Totally Agreeing
Saturday, November 15, 2008
 
 
Excellent.  Thank you.
Deane Send private email
Sunday, November 16, 2008
 
 

This topic is archived. No further replies will be accepted.

Other recent topics Other recent topics
 
Powered by FogBugz