- Plugin type: output
- Load all or nothing: no
- Resume supported: no
- Cleanup supported: yes
-
path_prefix: A prefix of output path. (string, required)
- support:
file,s3,s3nands3a.
- support:
-
file_ext: An extension of output file. (string, default:
.orc) -
sequence_format: (string, default:
.%03d) -
buffer_size: Set the ORC buffer size (integer, default:
262144(256KB)) -
strip_size: Set the ORC strip size (integer, default:
67108864(64MB)) -
block_size: Set the ORC block size (integer, default:
268435456(256MB)) -
compression_kind: description (string, default:
'ZLIB')NONE,ZLIB,SNAPPY,LZO,LZ4
-
overwrite: Overwrite if output files already exist. (boolean, default:
false)- Support:
LocalFileSystem,S3(s3, s3a, s3n)
- Support:
-
default_from_timezone Time zone of timestamp columns. This can be overwritten for each column using column_options (DateTimeZone, default:
UTC) -
auth_method: name of mechanism to authenticate requests (basic, env, instance, profile, properties, anonymous, or session. default: basic)
see: https://github.com/embulk/embulk-input-s3#configurationenv,basic,profile,default,session,anonymous,properties
out:
type: orc
path_prefix: "/tmp/output"
compression_kind: ZLIB
overwrite: true- Bump
orclibrary to1.5.4 - bugfix
- bugfix
- Bump
orclibrary to1.4.4
- Update
orclibraries to1.4.3
-
Change default value : (block_size, buffer_size, strip_size)
- default value is Hive's default value.
(see: https://orc.apache.org/docs/hive-config.html)
- default value is Hive's default value.
-
support: output to s3
s3n,s3aprotocol
- initial release
$ ./gradlew gem # -t to watch change of files and rebuild continuously