Inner, outer, right, cross, anti & semi joins. Automatically creates a simple, in-memory hash index to make it fast.
Table of Contents | Source: src/cmd/join.rs | 😣👆
Description | Usage | Arguments | Join Options | Join Key Transformation Options | Common Options
Description ↩
Joins two sets of CSV data on the specified columns.
The default join operation is an 'inner' join. This corresponds to the intersection of rows on the keys specified.
Joins are always done by ignoring leading and trailing whitespace. By default, joins are done case sensitively, but this can be disabled with the --ignore-case flag.
For examples, see https://github.com/dathere/qsv/blob/master/tests/test_join.rs.
Usage ↩
qsv join [options] <columns1> <input1> <columns2> <input2>
qsv join --helpArguments ↩
| Argument | Description |
|---|---|
<input1> |
is the first CSV data set to join. |
<input2> |
is the second CSV data set to join. |
<columns1> |
& are the columns to join on for each input. |
Join Options ↩
| Option | Type | Description | Default |
|---|---|---|---|
‑‑left |
flag | Do a 'left outer' join. This returns all rows in first CSV data set, including rows with no corresponding row in the second data set. When no corresponding row exists, it is padded out with empty fields. | |
‑‑left‑anti |
flag | Do a 'left anti' join. This returns all rows in first CSV data set that has no match with the second data set. | |
‑‑left‑semi |
flag | Do a 'left semi' join. This returns all rows in first CSV data set that has a match with the second data set. | |
‑‑right |
flag | Do a 'right outer' join. This returns all rows in second CSV data set, including rows with no corresponding row in the first data set. When no corresponding row exists, it is padded out with empty fields. (This is the reverse of 'outer left'.) | |
‑‑right‑anti |
flag | This returns only the rows in the second CSV data set that do not have a corresponding row in the first data set. The output schema is the same as the second dataset. | |
‑‑right‑semi |
flag | This returns only the rows in the second CSV data set that have a corresponding row in the first data set. The output schema is the same as the second data set. | |
‑‑full |
flag | Do a 'full outer' join. This returns all rows in both data sets with matching records joined. If there is no match, the missing side will be padded out with empty fields. (This is the combination of 'outer left' and 'outer right'.) | |
‑‑cross |
flag | USE WITH CAUTION. This returns the cartesian product of the CSV data sets given. The number of rows return is equal to N * M, where N and M correspond to the number of rows in the given data sets, respectively. | |
‑‑nulls |
flag | When set, joins will work on empty fields. Otherwise, empty fields are completely ignored. (In fact, any row that has an empty field in the key specified is ignored.) | |
‑‑keys‑output |
string | Write successfully joined keys to . This means that the keys are written to the output file when a match is found, with the exception of anti joins, where keys are written when NO match is found. Cross joins do not write keys. |
Join Key Transformation Options ↩
| Option | Type | Description | Default |
|---|---|---|---|
‑i,‑‑ignore‑case |
flag | When set, joins are done case insensitively. | |
‑z,‑‑ignore‑leading‑zeros |
flag | When set, leading zeros are ignored in join keys. |
Common Options ↩
| Option | Type | Description | Default |
|---|---|---|---|
‑h,‑‑help |
flag | Display this message | |
‑o,‑‑output |
string | Write output to instead of stdout. | |
‑n,‑‑no‑headers |
flag | When set, the first row will not be interpreted as headers. (i.e., They are not searched, analyzed, sliced, etc.) | |
‑d,‑‑delimiter |
string | The field delimiter for reading CSV data. Must be a single character. (default: ,) |
Source: src/cmd/join.rs
| Table of Contents | README