Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • The "Quantity" selection allows you to choose how the numerical values are counted:Depending on what option you choose, the label of the y-axis of the graph is changed accordingly, and the chart values adjusted.
    •  "% of words" shows how frequently the unigram or bigram is used relative to all other tokens in the corpus, for the given year shown. This lets you see how often a word is used relative to the size of the corpus, without having to worry about things like whether there are more books in 1850 than 1900.

...

    • When "word count" is selected, the actual count of the searched word is used for creating the plot.

    • When "text count" is selected, only those volumes in which the searched word occurs, are used for creating the plot. So, each such volume registers as only a single count. 

    • For "Case", selecting the "Insensitive" button will ignore the distinction between lowercase and uppercase characters when counting words while selecting "Sensitive" will maintain the distinction between lowercase and uppercase.

    • "Smoothing" is a means to create a moving average over the data and to identify overall trends by removing jagged and discontinuous data points. To see the raw data points, set smoothing to 0. To average one point on each side of a data point, set smoothing to 1, which counts previous one, current one, and next one and divides that sum by 3. A smoothing setting of 5 means that 11 values will be averaged, 5 values on each side of the data point. The leftmost and rightmost data points will average fewer values. So with a smoothing of 1, only 2 data points will be averaged. Often trends become more apparent when data is viewed as a moving average. A smoothing of 1 means that the data shown for 1950 averages the raw count for 1950 plus 1 year on either side. Smoothing windows are weighted: the year shown is weighted the most heavily, and the weights decrease in each direction until the smoothing span is reached. A smoothing of 0 means no smoothing at all: just raw data. After setting "smoothing" to 5 years, the same graph looks more continuous and less jagged, as shown below.