BigData Home

Ebulk tool

Tool description

Ebulk makes it easy to exchange or archive very large data sets. It performs data set ingestions or downloads from different protocols, to Wendelin Big Data platform. It also allows to perform local changes in data sets and to upload added and modified files. One key feature of Ebulk is to be able to resume and recover from errors happening with interrupted transfers.

Requirements

Ebulk tool relies on Embulk Java application (see docs). Please make sure that Java 8 is installed.

Installation

Debian / Ubuntu Linux OS

Please follow these instructions.

gpg --keyserver hkp://keyserver.ubuntu.com:80 --recv-keys B38DB8D777BB9026

gpg --output /usr/share/keyrings/ebulk-ebulk-ppa-archive-keyring.gpg --export B38DB8D777BB9026

echo "deb [signed-by=/usr/share/keyrings/ebulk-ebulk-ppa-archive-keyring.gpg] http://ppa.launchpad.net/ebulk/ebulk-ppa/ubuntu xenial main" > /etc/apt/sources.list.d/ebulk-ebulk-ppa.list

apt update

apt install ebulk curl

Linux Source

Git project repository