General Question

docguru's avatar

How can I build a distributed document processing platform in the cloud (like scribd.com)?

Asked by docguru (1points) September 25th, 2009

A user will upload large amounts of documents, like email, spreadsheets, images, pdfs, etc.

These documents will need metadata and text extraction, they will then be converted to .PDF documents for viewing.

Searching by keyword will be very important in the application.

Observing members: 0 Composing members: 0

1 Answer

jrpowell's avatar

Start in small chunks. I’m going to assume you know nothing.

#1: get a environment to work in. This can be local or hosted.
#2: Google how to upload files in html (I would start here). You will need a server-side solution to deal with what people upload (I suggest PHP). You will also really need to look into security. Allowing people to upload stuff can be scary. Really, be careful.

Get my drift? This is going to be a lot of work. You can’t click a button and be done. Baby steps. Solve one problem and then work on the next.

Answer this question

Login

or

Join

to answer.

This question is in the General Section. Responses must be helpful and on-topic.

Your answer will be saved while you login or join.

Have a question? Ask Fluther!

What do you know more about?
or
Knowledge Networking @ Fluther