Commit 57bd0dd2 authored by Ian Dennis Miller's avatar Ian Dennis Miller

- change xmlstarlet to xml-coreutils

- include q data as text for CSV work
parent 86aff1df
......@@ -6,17 +6,20 @@ This project provides a guide for working with the data housed at SISR Lab.
Data are accessible from `~/Data` and `/mnt/data/share`.
## Command Line Data Exploration
## Peeking at the Data
It is possible to quickly inspect the data without the need to perform a full import.
It is possible to quickly inspect the data on the command line.
There is no need to create a full analysis environment if you just want to quickly look at the data.
- JSON
- query with `/usr/bin/jq`
- https://stedolan.github.io/jq/
- XML
- query with `/usr/bin/xmlstarlet`
- http://xmlstar.sourceforge.net/
- https://www.ibm.com/developerworks/library/x-starlet/index.html
- query with `/usr/local/bin/xml-ls`
- http://www.lbreyer.com/xml-coreutils.html
- CSV
- query with `/usr/bin/q`
- http://harelba.github.io/q/
## Data Set Descriptions
......@@ -32,4 +35,10 @@ It is possible to quickly inspect the data without the need to perform a full im
### wikipedia
```
xml-ls --attributes \
~/Data/wikipedia/wikidatawiki-20180701-pages-articles.xml \
:/mediawiki/page/title |less
```
### youtube
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment