Extract Text (string) from doc, xls, pdf ... I will pay for

  • Thread starter Thread starter Frank Uray
  • Start date Start date
F

Frank Uray

Hi all

I am looking for a C# class to read all text informations
(including spaces but without formatting and structure) out of files.

I need a C# class with just one function like this:
public string GetTextFromFile(string local_PathFile, string
local_FileType)

I will call this function like:
string SomeText = GetTextFromFile(@"C:\temp\somePDF.pdf", "PDF");
string SomeText = GetTextFromFile(@"C:\temp\someDOC.doc", "DOC");

The File Types are: .pdf, .doc, .xls, .txt, .csv, .rtf

The Text, out of this files need to be read without using COM.
There will be no Office or Acrobar Reader installed.

Please offer what this solution would cost.

Additionaly (please offer):
If somebody can provide a function to read all Text and Spaces
out of picture files, this woul dby great :-)

Thanks and best regards
Frank Uray
 
Back
Top