Wiki

Clone wiki

r-text-tools / getEccoTcpWordFrequency

Takes a csv filename and column number and reads TCP IDs from the designated column, hen loops through all IDs and goes out over network and gets the full text for each and concatenates all texts returns into a single large text string. Once it has all of the text, it calculates word frequency and returns it as a list

Digital Approaches to Bibliography and Book History - Rare Book School - University of Virginia - Carl Stahmer and Benjamin Pauley

USAGE:

text <- the name of a csv file
field <- the column to look in for TCP ids
size <- The number of texts to grab
stops <- A comma separated list of stop words to
to ignore in calculation and plot. No
spaces should appear between words.

makeEccoTcpWordle(text, field, size, stops)

examples:

r> makeEccoTcpWordle("ecco-tcp-ids.csv", 2, 10, "the,and")

This work is licensed under a Creative Commons Attribution 4.0 International License.

Updated