1. Giovanni Dall'Olio
  2. query_pubmed

Commits

Author Commit Message Date Builds
Giovanni Dall'Olio
ADD: plotting mean sentence lenght
Giovanni Dall'Olio
ADD: plots of withfreq and adj+adv by Journal
Giovanni Dall'Olio
FIX: Plot titles
Giovanni Dall'Olio
DATA: removed a few "unknown" nationalities
Giovanni Dall'Olio
FIX: giving up manually reviewing the data. I will filter out USA and merge Europe and latin
Giovanni Dall'Olio
ADD: RESULTS: new column "country_reviewed"
Giovanni Dall'Olio
RESULTS: checked other entries
Giovanni Dall'Olio
FIX: plotting script adapted to new data format
Giovanni Dall'Olio
RESULTS: other entries reviewed manually
Giovanni Dall'Olio
revied other ~100 entries
Giovanni Dall'Olio
RESULTS: abstracts file sorted by nationality and name. Manually curated all Asian entries
Giovanni Dall'Olio
RESULTS: manually reviewed some ~100 entries
Giovanni Dall'Olio
RESULTS: starting manual review of abstracts file. Added a column named "liability" This column indicates, on a scale from 0 to 3, the liability of the nationality guess. A 0 means that the guess is not liable. A 3 means that it is ok
Giovanni Dall'Olio
ADD: RESULT: added Country and URL fields in the results output file
Giovanni Dall'Olio
ADD: RESULTS: added abstract, authors and affiliation fields to freqs_by_nationality. Added a manually reviewed file
Giovanni Dall'Olio
ADD: RESULTS: some scripts to generate plots
Giovanni Dall'Olio
DATA: added new journals, from various disciplines (geology, psichology...)
Giovanni Dall'Olio
DATA: downloading 300 abstracts instead of 100. Fixed Database(Oxford)
Giovanni Dall'Olio
ADD: calculating mean sentence length
Giovanni Dall'Olio
ADD: pretty print function + more regex to identify nationality
Giovanni Dall'Olio
ADD: more regex to identify nationality
Giovanni Dall'Olio
FIX: correctly iterating on each journal
Giovanni Dall'Olio
ADD: stub for guessing nationality
Giovanni Dall'Olio
REFACT:ADD: splitting sentences; added main() function
Giovanni Dall'Olio
MINOR: using logging library
Giovanni Dall'Olio
ADD: main function; parsing all journals
Giovanni Dall'Olio
ADD: added a check in download_abstracts.py to skip downloading journals that have already been downloaded
Giovanni Dall'Olio
ADD: included pickle file in the repo
Giovanni Dall'Olio
ADD: using cPickle to store tagger :-)
Giovanni Dall'Olio
ADD: a function to clean the entries dict
  1. Prev
  2. Next