Analysis Workflows Cheatsheet

Repeatable, step-by-step analysis patterns for the most common VisiData use cases.

Workflow 1: Frequency Distribution

Goal: Count how many rows exist for each value in a column.

Open file:          vd data.csv
Move to column:     h/l or arrow keys
Cast if needed:     # (int) or ~ (str)
Open freq table:    Shift+F
Sort by count:      ]   (descending)
Drill into group:   Enter
Return:             q

Workflow 2: Pivot Table (Cross-Tabulation)

Goal: Sum/mean of a value column grouped by two categorical columns.

Open file:          vd data.csv
Cast value column:  % or #
Add aggregator:     + → enter: sum
Mark row key:       ! on first category column
Mark col key:       ! on second category column
Move to value col:  h/l
Open pivot:         Shift+W
Explore results:    arrow keys, Enter for drill-down
Return:             q

Workflow 3: Data Profiling (Instant Audit)

Goal: See nulls, distinct counts, min/max for all columns.

Open file:          vd data.csv
Open Describe:      Shift+I
Read: nulls column  → which columns have missing data?
Read: distinct      → is a column constant? (distinct=1 → useless)
Read: min/max       → spot outliers
Drill into column:  Enter → opens frequency table for that column
Return:             q

Workflow 4: Filter and Export

Goal: Extract a subset of rows matching a condition.

Open file:          vd data.csv
Select matching:    | regex (by regex in current column)
                       z| expr (by Python expression)
                       , (by matching current cell value)
Verify count:       check status bar (N rows selected)
Open filtered:      "
Save filtered:      Ctrl+S → new_filename.csv
Return:             q

Workflow 5: Log Parsing

Goal: Parse unstructured log lines into columns.

1. Open log:           vd /var/log/nginx/access.log
2. Parse with regex:   ; (semicolon)
   Enter: (?P<ip>\S+) \S+ \S+ \[(?P<date>[^\]]+)\] "(?P<method>\S+) (?P<path>\S+) \S+" (?P<status>\d+) (?P<bytes>\d+)
3. Type columns:       Move to status → #
                       Move to bytes → #
                       Move to date → @
4. Analyze:            Shift+F on status → distribution
5. Drill into 500s:    Enter on 500 row in freq table
6. Export:             Ctrl+S → report.csv

Workflow 6: Data Cleaning

Goal: Clean a messy CSV for import or analysis.

Open file:          vd raw_data.csv
Inspect:            Shift+I (Describe Sheet)
Fix types:          # % @ on numeric/date columns
Fill nulls:         f on columns with propagating values
                       ge "default" on columns needing a fixed fill
Normalize text:     gs → g* → (?i)ACTIVE<Tab>active
Remove invalids:    z| expr → gd
Check duplicates:   Shift+F on key column → look for count > 1
Verify:             Shift+I again → confirm nulls=0, types correct
Export:             Ctrl+S → clean_data.csv

Workflow 7: Sysadmin Process Inspection

Goal: Interactively inspect running processes.

Launch:             ps aux | vd -f fixed --skip 1
Type CPU:           % on %CPU column
Type MEM:           % on %MEM column
Sort by CPU:        ] (descending)
Filter high-CPU:    z| → CPU > 5
Open filtered:      "
Inspect PIDs:       note the PID column values

Workflow 8: Batch File Conversion

Goal: Convert data format without interactive UI.

# CSV → JSON
vd -b input.csv -o output.json

# JSONL → CSV
vd -b events.jsonl -o events.csv

# Excel → TSV
vd -b report.xlsx -o report.tsv

# With row limit (sample conversion)
vd -b --max-rows 10000 input.csv -o sample.json

# Replay a recorded session on new input
vd --play clean_workflow.vdj --batch new_input.csv -o cleaned.csv

Workflow 9: Join Two Datasets

Goal: Enrich one dataset with columns from another.

Open both files:    vd file1.csv file2.csv
Set key on file1:   ! on 'id' column
Set key on file2:   ! on 'id' column (switch with Ctrl+^)
Open Sheets Sheet:  Shift+S
Select both:        s on each sheet row
Join:               & → choose jointype (inner/outer/full)
Result sheet opens: verify row count and columns
Save:               Ctrl+S → joined.csv

Workflow 10: CommandLog Replay (Reproducible Analysis)

Goal: Save and replay an analysis for documentation or automation.

During interactive session:
  1. Perform all your analysis steps
  2. Open CommandLog:   Shift+D
  3. Save:             Ctrl+S → workflow.vdj

Replay later:
  vd --play workflow.vdj                          # interactive replay
  vd --play workflow.vdj --batch -o result.csv    # batch replay
  vd --play workflow.vdj --replay-wait 2          # demo (2s between steps)

Quick Analysis Decision Matrix

What you need	VisiData command
Count by category	`Shift+F` (frequency table)
Sum/mean by category	`+` aggregator + `Shift+F`
Cross-tab two categories	key columns `!` + `Shift+W`
All column statistics	`Shift+I` (Describe Sheet)
Filter to matching rows	`
Plot a numeric column	`.` (after setting key column `!`)
Compare two variables	key=x `!`, y column, `.` (scatterplot)
Join two files	`Shift+S` → select both → `&`
Convert file format	`vd -b input.ext -o output.ext`
Parse log fields	`;` with named capture regex

Common Command Sequences

# Open → frequency → drill → export
vd f.csv → Shift+F → ] → Enter → Ctrl+S

# Open → type → pivot → export
vd f.csv → # on value → + sum → ! on keys → Shift+W → Ctrl+S

# Open → parse log → analyze → export
vd log → ; regex → # on status → Shift+F → | 500 → " → Ctrl+S

# Open → filter → clean → export
vd f.csv → Shift+I → # % @ → f → gs → g* → gd → Ctrl+S

Next Steps

You have completed the VisiData curriculum. Return to any module for deeper practice:

What is VisiData — fundamentals
Log Analysis — server workflows
Core Keys Cheatsheet — daily reference

Workflow 1: Frequency Distribution​

Workflow 2: Pivot Table (Cross-Tabulation)​

Workflow 3: Data Profiling (Instant Audit)​

Workflow 4: Filter and Export​

Workflow 5: Log Parsing​

Workflow 6: Data Cleaning​

Workflow 7: Sysadmin Process Inspection​

Workflow 8: Batch File Conversion​

Workflow 9: Join Two Datasets​

Workflow 10: CommandLog Replay (Reproducible Analysis)​

Quick Analysis Decision Matrix​

Common Command Sequences​

Next Steps​