Social Science Consulting - WordStat
WordStat - Content Analysis and Text Mining Software

Last update: 8. June 2006


Overview

Name of the program WordStat
Version 5.1
type of software computer aided content analysis, text mining
development Normand Peladeau, Provalis Research
distribution Social Science Consulting and others
languages menu and help system English, French
manual languages English
hardware requirements 64 MB RAM, 5 MB free disk space
software requirements Windows 9x or better, requires either Simstat or QDA-Miner to run
test version yes
test version restrictions full version for 30 days


Applications

The following applications can be performed by Wordstat:


Features

The following applications can be performed by WordStat:

  • list of words, sorted by alphabet or by frequency, ascending or descending, also with exclusion lists (STOP-words) that come with the program (for English, German, and French)
  • list of word sequences
  • list of word permutations
  • KWICs - key word in context with variable line length
  • SITs - search patterns in text unit
  • content analysis with powerful features like interactive coding, control files, and negation detection
  • control of multiple search patterns
  • readability analysis for English, Spanish, French, Dutch, Danish, Swedish and German texts


Short description


WordStat is a text analysis module specifically designed to study textual information such as responses to open-ended questions, interviews, titles, journal articles, public speeches, electronic communications, etc. WordStat may be used for automatic categorization of text using a dictionary approach or various text mining as well as for manual coding. WordStat can apply existing categorization dictionaries to a new text corpus. It also may be used in the development and validation of new categorization dictionaries or taxonomies. When used in conjunction with manual coding, this module can provide assistance for a more systematic application of coding rules, help uncover differences in word usage between subgroups of individuals, assist in the revision of existing coding using KWIC (Keyword-In-Context) tables, and assess the reliability of coding by the computation of inter-raters agreement statistics.

WordStat includes numerous exploratory data analysis and graphical tools that may be used to explore the relationships between the content of documents and information stored in categorical or numeric variables such as the gender or the age of the respondent, year of publication, etc. Relationships among words or categories as well as document similarity may be identified using hierarchical clustering and multidimensional scaling analysis. Correspondence analysis and heatmap plots may be used to explore relationships between keywords and different groups of individuals. More ...


© Social Science Consulting, 1999-2006