Wiki

Clone wiki

r-text-tools / plotEccoTcpTexts

Takes a csv filename and column number and reads TCP IDs from the designated column, then loops through all IDs and goes out over network and gets the full text for each and concatenates all texts returns into a single large text string. Once it has all of the text, it calculates word frequency and plots it on a bar graph

Digital Approaches to Bibliography and Book History - Rare Book School - University of Virginia - Carl Stahmer and Benjamin Pauley

USAGE:

text <- the name of a csv file
field <- the column to look in for TCP ids
size <- The number of texts to grab
stops <- A comma separated list of stop words to
to ignore in calculation and plot. No
spaces should appear between words.
maxnum <- The number of words to chart

plotEccoTcpTexts(text, field, size, stops, maxnum)

examples:

r> plotEccoTcpTexts("ecco-tcp-ids.csv", 2, 10, "the,a", 5)

This work is licensed under a Creative Commons Attribution 4.0 International License.

Updated