cascalog.cascading.io documentation
TODO: Some of these things need to move into hadoop-util.
delete-all
(delete-all bindings)
delete-file-recursively is preemptive delete on exiting the code
block for repl and tests run in the same process.
delete-all-fs
(delete-all-fs fs paths)
delete-file-recursively
(delete-file-recursively f & [silently])
Delete file f. If it's a directory, recursively delete all its contents.
Raise an exception if any deletion fails unless silently is true.
get-bytes
(get-bytes bytes)
Extracts a byte array from a Hadoop BytesWritable object. As
mentioned in the [BytesWritable javadoc](http://goo.gl/cjjlD), only
the first N bytes are valid, where N = `(.getLength byteswritable)`.
temp-dir
(temp-dir sub-path)
1) creates a directory in System.getProperty("java.io.tmpdir")
2) calls tempDir.deleteOn Exit() so the file is deleted by the jvm.
reference: ;http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4735419
deleteOnExit is last resort cleanup on jvm exit.
temp-path
(temp-path sub-path)
tmp-dir-property
Use this variable as key in JobConf if you want to override the
root of temporary paths. See with-fs-tmp.
with-fs-tmp
macro
(with-fs-tmp [fs-sym & tmp-syms] & body)
Generates unique, temporary path names as subfolders of <root>/cascalog_reserved.
<root> by default will be '/tmp', but you can configure it via the
JobConf property `cascalog.io/tmp-dir-property`.
with-log-level
macro
(with-log-level level & body)
with-tmp-files
macro
(with-tmp-files bindings & body)
write-lines
(write-lines f lines)
Writes lines (a seq) to f, separated by newlines. f is opened with
writer, and automatically closed at the end of the sequence.