Wiki » History » Version 6

Version 5 (Marco Fabiani, 2012-05-29 12:16 PM) → Version 6/12 (Marco Fabiani, 2012-05-29 12:17 PM)

h1. Wiki

h2. Usage

SWORD2 DSpace bulk uploader

A python script to submit large numbers of files to a SWORD2-compatible repository, specifically DSpace 1.8x.
Built on the SWORD2 python client library:


- no installation required, simply copy the script to a suitable location. The first time you run the script, it will create the sword2_logging.conf file.
- a server.cfg file is also available. If the --servicedoc option is not used, sworduploader will read the first line of server.cfg and use it as the server's URL. If the server.cfg is missing, it will default to C4DM's server.


sworduploader[-h] [--username USER_NAME] [--title TITLE]
[--author AUTHOR [AUTHOR ...]] [--date DATE]
[--servicedoc DSPACEURL]

Bulk upload to DSpace using SWORDv2.

positional arguments:
data Accepts: METSDSpaceSIP and BagIt packages, simple zip
files, directories, single files. NOTE: METSDSpaceSIP
packages are only accepted by Collections with a

optional arguments:
-h, --help show this help message and exit
--username USER_NAME DSpace username.
--title TITLE Title (ignored for METS packages).
--author AUTHOR [AUTHOR ...]
Author(s) (ignored for METS packages). Accepts
multiple entries in the format "Surname, Name"
--date DATE Date of creation (string) (ignored for METS packages).
--zip If "data" is a directory, compress it and post it as a
single file. The zip file will be saved along with the
individual files.
--servicedoc SD Url of the SWORDv2 service document (default: use
server.cfg if available, otherwise http://c4dm.eecs.qm
If the submission is created successfully, it will remain open to be completed
with the necessary metadata and licenses, using the DSpace web interface. The
submission can be found in the "My Account -> Submissions" section of the
user's area.

h2. Updates

Version 0.6:
- Uploading a directory will maintain the path structure (i.e. subdirectories).
- A different server can be specified in the server.cfg file. This is overridden if the --servicedoc option is used. If the file is missing, sworduploader will default to C4DM's repository.

h2. Possible problems

Version 0.4 is based on the modified python-sword2 library that can be found on bitbucket/marcofabiani.

The script has been tested with the version now on github (richardjones), and it works IF DSpace is patched with the latest version of the sword-server-2.0 which corrects a mistake in the service document.

Issue 1: Service document verification fails (DSpace server)

richardjones on Fri, 20 Apr 2012 12:45:26 +0200:

The bug was in the Java common library used by default

See for details of multipart support requirements.

This has now been fixed in the java server library, which is hosted at

status: new -> resolved