• "Drop OpenOffice for text extraction"
    Fixed. Disabled the starOffice extractor, commented out OO code in the base extractor.
    
    Committed by: Megan Watson
    Reviewed by: Donald Jackson
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9602 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »
  • "Implement PHP wrapper for the text extractors"
    Fixed. Added new tika apache extractor, disabled the old extractors (word, excel, powerpoint, pdf). Added new text extraction function the the XmlRpcLucene class.
    
    KTS-3894
    "Implement PHP interface for GUID inserter"
    Fixed. Added metadata read and write functions to the XmlRpcLucene class.
    
    Committed by: Megan Watson
    Reviewed by: Donald Jackson
    
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9600 c91229c3-7414-0410-bfa2-8a42b809f60b
    Megan Watson authored
     
    Browse Dir »

  • "A postscript file not indexed."
    Fixed. The IgnorePS default value was set to true. It now defaults to false.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9486 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Allow indexing to not be dependant on open office"
    Fixed.  Unzip requires a file to have an extension in wondows. Renamed tmp directories to be cleaned up by cleanup script.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@9047 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Allow indexing to not be dependant on open office"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8871 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Indexing needs to be more resiliant when encountering errors with open office"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Philip Arkoll
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8652 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "When installing stack as root, OpenOffice stores files as root in the KT temp folder. The indexing scripts cannot clean these up."
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8448 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »


  • "CLONE -3.5.2 Upgrade - INFO: Scheduler - Background tasks errors(SUP-636)"
    Fixed. 
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8266 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Security properties on the PDF document prevent text from being extracted."
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8181 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Support for Office 2007 Documents"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8159 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "Support for Office 2007 Documents"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8158 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Support for Office 2007 Documents"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8141 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Search2: Documents are not being indexed in Windows XP for OSS"
    Fixed.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8121 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Document Indexing Diagnostics shows that there is a problem indixing a PDF file"
    Fixed.
    
    Commited By: Conrad Vermeulen
    Reviewed By: Jonathan Byrne
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8027 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "Document Indexing Diagnostics shows that there is a problem indixing a PDF file"
    Fixed.
    
    Commited By: Conrad Vermeulen
    Reviewed By: Jonathan Byrne
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@8026 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Support for Office 2007 Documents"
    Implemented.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7992 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »


  • "Powerpoint files not being indexed"
    Fixed. powerpoint was missing.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7800 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Documents are not being indexed in the queue"
    
    Fixed. Using new exec system.
    
    Committed By: Kevin Fourie
    Reviewed By: Conrad Vermeulen
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7768 c91229c3-7414-0410-bfa2-8a42b809f60b
    Kevin Fourie authored
     
    Browse Dir »

  • "Postscrip extractor is disabled on linux"
    Fixed. The issue was that the full path to the application was not used and that the parameters were incorrect.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Megan Watson
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7588 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "Open office document text extractor doesn't have application/msword associated with it"
    Fixed. Added missing mime types
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7568 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "The search algorithm needs some work"
    Updated. added licensing
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7548 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »

  • "The search algorithm needs some work"
    Updated.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7472 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7471 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7470 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated.  MailMime is not tested... will implement again when have more time.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7467 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7466 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated.
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7465 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "Test RTF Indexer"
    Implemented. 
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7464 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated. Integrated with DocumentConverter.py
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7463 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »
  • "The search algorithm needs some work"
    Updated. Removed experimental indexer
    
    Committed By: Conrad Vermeulen
    Reviewed By: Kevin Fourie
    
    git-svn-id: https://kt-dms.svn.sourceforge.net/svnroot/kt-dms/trunk@7460 c91229c3-7414-0410-bfa2-8a42b809f60b
    Conrad Vermeulen authored
     
    Browse Dir »