Skip to content

Latest commit

 

History

History
12 lines (11 loc) · 319 Bytes

File metadata and controls

12 lines (11 loc) · 319 Bytes

TODO list

  • Put code into real repository.
  • Test running on Linux.
  • Release vs debug builds.
  • Test converting large corpus files.
  • Measure conversion time.
  • Progress indicator.
  • Corpus statistics class
    • Counts of files, documents, streams per document
    • Byte size of files.
  • Java documentation comments.