Wiki

Clone wiki

r-text-tools / getEccoTcpWordAssociation

Takes a csv filename and column number and reads TCP IDs from the designated column, then loops through all IDs and goes out over network and gets the full text for each and concatenates all texts returns into a single large text string. Once it has all of the text, it calculates word frequency and returns it as an ordered list.

Digital Approaches to Bibliography and Book History - Rare Book School - University of Virginia - Carl Stahmer and Benjamin Pauley

USAGE:

text <- the name of a csv file
field <- the column to look in for TCP ids
size <- The number of texts to grab
seed <- The seed word for which you want to check
associations.
num <- a number of matches above which you wqnt to
see asscoations

getEccoTcpWordAssoc(text, field, size, stops)

examples:

r> getEccoTcpWordAssoc("ecco-tcp-ids.csv", 2, 10, "his", 0)

This work is licensed under a Creative Commons Attribution 4.0 International License.

Updated