Skip to content
GitLab
Projects
Groups
Snippets
Help
Loading...
Help
What's new
10
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Open sidebar
Ian Dennis Miller
dataset-guide
Commits
86aff1df
Commit
86aff1df
authored
Sep 30, 2018
by
Ian Dennis Miller
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Update README.md
parent
2cca4c6c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
33 additions
and
1 deletion
+33
-1
README.md
README.md
+33
-1
No files found.
README.md
View file @
86aff1df
# dataset-guide
This project provides a guide for working with the data housed at SISR Lab.
\ No newline at end of file
This project provides a guide for working with the data housed at SISR Lab.
## Location
Data are accessible from
`~/Data`
and
`/mnt/data/share`
.
## Command Line Data Exploration
It is possible to quickly inspect the data without the need to perform a full import.
-
JSON
-
query with
`/usr/bin/jq`
-
https://stedolan.github.io/jq/
-
XML
-
query with
`/usr/bin/xmlstarlet`
-
http://xmlstar.sourceforge.net/
-
https://www.ibm.com/developerworks/library/x-starlet/index.html
## Data Set Descriptions
### 4chan
### gold
### government corpus
### reddit
### twitter
### wikipedia
### youtube
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment