Skip to content

Latest commit

 

History

History
18 lines (11 loc) · 982 Bytes

README.md

File metadata and controls

18 lines (11 loc) · 982 Bytes

data-capture-module

Data Capture Module to recieve uploaded datasets, and validate client-side checksums.

In more general terms, this is an external module designed to allow users to upload large datasets to a repository (designed for Dataverse) without going through http.

The presentation slides from the 2017 Dataverse Community Meeting may provide some additional information. The design is intented to be agnostic to transfer protocol, and currently implements rsync over ssh.

DCM installation

See installation instructions for DCM installation instructions, and the Dataverse Guides for configuring the two systems together.

general organization

  • api/ : external interface that repository software will call
  • gen/ : transfer script generation for rsync+ssh uploads
  • scn/ : scanning for completed uploads, and handling related tasks