Tools

Text Denoiser

(October 6, 2011 by Rushdi Shams)

Text Denoiser is a tool that extracts content-rich sentences from texts by removing the insignificant and/or unwanted sentences of a text according to their reading difficulty score (which in this case is the Fog Index). The 30% of the most difficult-to-read sentences of any text is called denoised text and the rest of the text is considered as noise text. DOWNLOAD

Noun Extractor

(October 12, 2011 by Rushdi Shams)

Noun Extractor is a tool that extracts nouns from the sentences of a text file using Genia POS Tagger (version 1.0). Key advantages of the tool are-

  • A separate tagger is not required. The tool uses Genia POS tagger to get the set of POS associated with every word
  • Reports total number of nouns in a text file
  • Formats its output so that nouns for every sentence can be recognized

DOWNLOAD