• "Drop OpenOffice for text extraction"
    Fixed. Disabled the starOffice extractor, commented out OO code in the base extractor.
    
    Committed by: Megan Watson
    Reviewed by: Donald Jackson
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9602 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »
  • "Implement PHP wrapper for the text extractors"
    Fixed. Added new tika apache extractor, disabled the old extractors (word, excel, powerpoint, pdf). Added new text extraction function the the XmlRpcLucene class.
    
    KTS-3894
    "Implement PHP interface for GUID inserter"
    Fixed. Added metadata read and write functions to the XmlRpcLucene class.
    
    Committed by: Megan Watson
    Reviewed by: Donald Jackson
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9600 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »
  • "Design new Document Processor architecture"
    Fixed. Implemented new architecture, added scheduled task and removed old indexing task and the indexing dashlets tasks, refactored indexDocuments() function.
    
    Committed by: Megan Watson
    Reviewed by: Donald Jackson
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9599 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »

  • "The number of documents in the indexing queue is incorrect on the Document Indexer Statistics dashlet"
    Fixed. Adjusted sql to ignore problem documents.
    
    Committed by: Megan Watson
    Reviewed by: Conrad Vermeulen
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9511 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »

  • "A postscript file not indexed."
    Fixed. The IgnorePS default value was set to true. It now defaults to false.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9486 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Doing a search for anything via the Search text box at the top of the browser page results in a DB error."
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9466 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Search results must return the permissions similar to KTAPI for use by client tools"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9448 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "When navigating the folder after it has been found in a seach (Advanced Search) navigates the user to the Root folder and not the folder itself."
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9440 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Search should return folders"
    Implemented.
    
    KTS-3671 
    "Support for searching of symbolic links"
    Implemented.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9400 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "Search should return folders"
    Implemented.
    
    KTS-3671 
    "Support for searching of symbolic links"
    Implemented.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9399 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "A postscript file not indexed."
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9283 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Indexer dashlet may take considerable time to show stats on a large repository"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Waston
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9271 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Uno module required for pdf generation"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9226 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "After indexing, the cached index stats should be updated so that dashboard is up to date."
    Fixed. Added indexStats.
    
    Committed by: Megan Watson
    Reviewed by: Conrad Vermeulen
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9126 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »
  • "Allow indexing to not be dependant on open office"
    Fixed.  Unfortunately, we have to keep it for windows. catdoc, etc does not work properly in windows.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9097 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Allow indexing to not be dependant on open office"
    Fixed.  When catdoc fails and OO must stand in, OO returns a new file containing the text.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9075 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Allow indexing to not be dependant on open office"
    Fixed.  Unzip requires a file to have an extension in wondows. Renamed tmp directories to be cleaned up by cleanup script.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9047 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Indexing a large XML document (97 meg) resulted in the indexer throwing an error relating to heap not being big enough"
    Updated. Optimised the read of content so text extractor does not crash indexer.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9017 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Allow indexing to not be dependant on open office"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8871 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "Allow reindexing of folders and their contents"
    Implemented.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8870 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Uno module appears to require editing document conversion scripts"
    Fixed. Allow ooProgramPath to be set as an environment variable.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8836 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "not indexed documents are not flagged to the admin"
    Fixed. reportUnindexedDocuments.php iterates through document id's in the database and identifies records that don't have a record in the index.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8693 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Background task can test Document Indexer diagnostics and cache results"
    Fixed
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8669 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "Indexing needs to be more resiliant when encountering errors with open office"
    Fixed. Rewrote expression used to resume processing failed indexed documents.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8654 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Indexing needs to be more resiliant when encountering errors with open office"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Philip Arkoll
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8652 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Restart open office periodically"
    Fixed. A count is incremented on indexing a document (successfully or not), after 50 documents open office is restarted.
    
    Committed by: Megan Watson
    Reviewed by: Conrad Vermuelen
    
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8634 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »