Overview

Wikitalk:

This a repository for data/scripts used in the paper described at
https://sites.google.com/site/rmyeid/projects/english-writing-styles

data.json.txt.details:
----------------------- 
This is the description of the columns:
autoid: Index that is unique for each row
text: The actual comment
lang: native user language
user_name: wikipedia username
page_id: The ID of the wikipedia page that the comment appeared in.
page_title: Title of the wikipedia page that the comment appeared in.
time_stamp: The time stamp of the comment, as mentioned in the comment signature. Could be null
level: English proficiency of the user.



users_props.json.txt:
---------------------
This is the description of the fields:
username:
langs: set of the languages that the user claims knowledge of.
comm_size: number of (characters/wrods, not sure!) that is used in all the user comments.
comm_num: number of comments that the user contributed.