Compared Against Existing Systems
Using Tranche has several advantages over other common data sharing methods. Below are some examples of the existing methods of sharing data and the flaws researchers may encounter. None of these systems were designed and built for scientific data sharing as the core function. No other file sharing method offers the same ability to annotate your data, sercure file storage and publishing options as Tranche.
Email
- Most data generated from scientific research is much too large for emailing.
- Does not allow for sharing or disseminating data in the public realm.
- No built-in method of annotating data sets.
- Not an efficient method for transferring data.
- Not secure.
- No method of validating data as original.
File Transfer Protocol (FTP)
- Single source point for downloading creates bandwidth .
- Must set up an FTP server (or have access to one).
- No built-in method of annotating data sets.
- Less secure.
- No method of validating data as original.
BitTorrent (or similar file sharing networks)
- Unmonitored and unrestricted users.
- Was not built with the needs for scientific data in mind.
- No built-in method of annotating data sets.
Publishing Options
There are several similarities and differences between how author's traditionally publish data (hyperlinks) versus using a Hash as described by this document.
- The hash is based on the data itself. It is not an arbitrary URL.
- Anyone with the data can use the hash to verify that the data is identical to what was published.
- Software results are more reproducible because you'll know if, or if not, the software or data has changed since publication.
- Any server on the network can look up the data based on its hash.
- Hashes don't change. You will never have a 'broken link', which often occurs with URLs.
- You have many choices for downloading the data as the network isn't restricted to HTTP and web browsers.
- You can still download the data straight from a web browser (as shown in the demo application)
- You can also download data from the network using many other tools, including customized programs.
- The hash is based on a standard algorithms -- no new scheme for referencing is being made up.
Tips
Related Guides
Troubleshooting