R
Ryan Ternier
Hello,
We're on the way to start a new web application that deals with users
scanning in Printed off Documents, and uploading them to the their website.
When these docs hit the website, they will be ran through an online OCR
utility, to get the raw text from it.
From this point, they will be stripped down and shoved (maybe even stomped)
into a SQL DB.
I've never attempted this before, and was looking for some advice with the
DB section.
Once the document is uploaded and stripped into it's text form, what is the
best way to index it so users can do searches on it. We could just put the
whole thing into
a large text field, but indexing seems faster.
If anyone could shed light, or point me in a direction it'd be appreciated.
Thanks.
Ryan Ternier
Code Monkey
We're on the way to start a new web application that deals with users
scanning in Printed off Documents, and uploading them to the their website.
When these docs hit the website, they will be ran through an online OCR
utility, to get the raw text from it.
From this point, they will be stripped down and shoved (maybe even stomped)
into a SQL DB.
I've never attempted this before, and was looking for some advice with the
DB section.
Once the document is uploaded and stripped into it's text form, what is the
best way to index it so users can do searches on it. We could just put the
whole thing into
a large text field, but indexing seems faster.
If anyone could shed light, or point me in a direction it'd be appreciated.
Thanks.
Ryan Ternier
Code Monkey