Projects / openDIAS


openDIAS (Document Imaging Archive System) provides document imaging with OCR. You can scan documents (with SANE) or import ODF documents and then assign tags. It can store all your letters, bills, statements, etc. in a convenient, safe, and easily retrievable way.

Operating Systems

RSS Recent releases

  •  19 Nov 2012 23:33

Release Notes: This release implement issues #7 (document linkage), issue #12 (better testing), and issue #13 (device locking). PDF, ODF, and image imports correctly have a thumbnail and OCR performed. Works on 64-bit machines. Migrated to tesseract v3. I/O is all UTF-8. The front end will now time out rather than hang on an error. The document list is now auto-loaded rather than using a paged table. A localization framework has been put in place (with English and German languages added). Defunct speech functionality has been removed. Various bugfixes and cleanups.

  •  11 Jan 2012 22:31

    Release Notes: This release introduces major new functionality and polishes the code and user interface. Overall, it is a solid increase in functionality and quality.

    •  15 May 2011 15:47

      Release Notes: The software was totally rewritten from the bottom up. It is now a Web based client, interfacing into a server backend that controls the SANE devices and the database.

      •  02 Jul 2008 11:02

      Release Notes: Threading was added to "slick up" the UI in places. More memory leaks were fixed. Things are handled when the loading image is not available. Compiler flags are used to set the data_dir. Cleanup was done. A "no OCR libs" error was fixed. Lots of memory leaks were fixed. Bind vars are now used for database updates and inserts. The "should we OCR" setting now defaults to on. Image processing was added to allow paged views of scanned images. An icon was added to the application. A build failure bug if tesseract is not installed was fixed.

      •  27 May 2008 14:54

      Release Notes: A bug that prevented new users from creating their repository was fixed.


      Project Spotlight

      TurnKey Django Appliance

      A Django appliance that is easy to use and lightweight.


      Project Spotlight


      A compression library for lzip files