Data Preparation - shellywhen/measure-flow GitHub Wiki
Following Vistorian, the MeasureFlow imports data formatted in tables in csv
format (comma-separated values) files. Each csv
contains one table. You can export a csv
file from
- MS Excel (File → Save As → Select file type 'CSV')
- Apple's Numbers (File → Export to → CSV)
- Google Sheet (File → Download as → Comma-separated values)
- text editor, you may utilize the online tool CSVLint to check if your
csv
file is properly formatted
This page explains what information you can put in your CSV files so that MeasureFlow can help you explore it with visualizations. There are some rules to follow where information goes.
Terminology
This document and the Manual follow the general network terminology. Though there are many concepts, potentially they mean the same. A network (or graph) is a set of nodes (/actors/vertices, etc.) and links (/relations/edges/arcs, etc.) Attributes are values associated to nodes and links. Our network model currently supports the following information:
- multiple links between the same pair of nodes
- time, i.e. changing network topology, attributes
Data Tables
The MeasureFlow takes in two types of tables depending on your network.
Link Table
A link table contains one row per link in the network while columns specify link attributes.
Source | Target | Time | Attr_0 | ... |
---|---|---|---|---|
Note that the source, target, and the time field are required. The source and target refer to the node ids of the link, which are strings, preferably integers. The time column should follow the same format as moment.js. Such format should be specified in the data upload page.
The attribute column is not required, however, you may include additional information such as
- link weight
- link type
- link id
Node Table
Id | Attr_0 | ... |
---|---|---|
Similarly, the attribute column is not required, yet you may want to assign
- node type
- alias