Source

textize /

Filename Size Date modified Message
19 B
1.1 KB
1015 B
Online service to take a gutenberg text, strip off the bumpf and make available in different formats such as plain text, html, pdf.

  * Ticket: http://knowledgeforge.net/okfn/tasks/ticket/283
  * Mercurial repo: http://bitbucket.org/rgrp/textize

Copyright 2010 Open Knowledge Foundation. Licensed under AGPL.


= Plan =

== System Structure ==

1. Frontend: catalogue and links to texts in various formats
  * openbiblio
2. Store: with texts (maybe same as frontend)
3. Messaging system
4. Listeners which do cleaning and PDFizing
  * Could plug in other cleaners ...

== Steps ==

=== Frontend ===

1. I come to the Frontend and request Madame Bovary in PDF
2. Frontend looks up in store where PDF exists
  * Yes: return it
  * No: proceed
3. Add PDF task for Madame Bovary to task queue and return to user saying they should check back in 30m

=== Task queue ===

=== Storage server ===

=== Worker ===

1. Worker checks for message for PDFation and finds pdf item
2. Work locates and downloads source text
3. Cleans and up PDFs source text
4. Uploads results to holding/storage server
5. Adds message that this has been done

Tip: Filter by directory path e.g. /media app.js to search for public/media/app.js.
Tip: Use camelCasing e.g. ProjME to search for ProjectModifiedEvent.java.
Tip: Filter by extension type e.g. /repo .js to search for all .js files in the /repo directory.
Tip: Separate your search with spaces e.g. /ssh pom.xml to search for src/ssh/pom.xml.
Tip: Use ↑ and ↓ arrow keys to navigate and return to view the file.
Tip: You can also navigate files with Ctrl+j (next) and Ctrl+k (previous) and view the file with Ctrl+o.
Tip: You can also navigate files with Alt+j (next) and Alt+k (previous) and view the file with Alt+o.