Licenses


Introduction

In order to make data and software more accessible, licenses are an important tool to ensure what can be used for what. By default any creator of data, software, writing or any other content involving a sufficient amount of creativity is the copyright owner of that content without having to declare the copyright explicitly. Defining or using a suitable license for published content usually has the benefit of giving all parties legal certainty and understanding of permission to use.

In sciences, two categories of licenses can applied to either software or data and results that explicitly describe whether and how others can use it.

Properties of recommendable licenses

  • Standardized, i.e. machine-readable, text
  • Available in an easy-to-read version
  • No transfer of exclusive rights
  • Compatible with many jurisdictions
  • Common

Licenses for data and other creative works

Also scientific data and output are subject to copyright if their generation requires a sufficient amount of creativity (this might be contested in some contexts). However, since scientists conduct experiments mostly within their employment of their funding institution, the copyright lies with both and should ideally be discussed with the employer (the universities legal department) in high profile cases.

Similar to the publication of source code, a license communicates who can use the data or results for what and should be distributed with the publication in repositories or databases.

Publishing figures and articles in journals, usually requires accepting the license agreement of the publisher and involves either a complete transfer of rights on your own work or picking an open access journal with acceptable permissive licenses.

Recommendations

Because publicly funded sciences should make their data and results publicly available (often required by the employer or funder), choosing a permissive license is highly encouraged.

The same criteria as for software apply and the Creative Commons (CC) licenses are most widely used as well as easily understandable.

There are different versions of CC that consist of the core license with further extensions to choose from that can be combined to achieve the desired permissions:

  • Public dedication (Zero or 0): Allows re-users to distribute, remix, adapt, and build upon the material in any medium or format, with no conditions. Excludes the following extensions.
  • Attribution (BY): Credit must be given to the creator.
  • Share-Alike (SA): Adaptations must be shared under the same terms.
  • Non-commercial (NC): Only non-commercial uses of the work are permitted.
  • Non-derivative (ND): No derivatives or adaptations of the work are permitted (not compatible with share-alike).

ccsprectrum

(Source, CC-BY Shaddim; original CC license symbols by Creative Commons)

Usage

Creators can declare a works license by selecting it during upload to a website/database/repository or attaching a line stating the title, author and license to the published work.

Software licenses

When software sources are distributed, it is considered good practice to specify an already established license under which it can be used. Software developed in the context of science and funded by public money, usually needs to be made available free of charge and in open source by requirements of the funder or the governing institution’s policies.

Recommendations

When choosing a license, multiple aspects should be regarded depending on its application and intention:

  • Standardized: While you can formulate your own license or freely modify most available licenses to better suit your needs, keep in mind that anyone interested in using/supporting/modifying your software needs to be familiar with the terms and in doubt has to read it. Choosing a popular license communicates the terms to third parties extremely efficiently, because the terms and consequences are widely known. Using fringe licenses or wording your own can unintentionally put you or others in hot legal waters down the line.
  • Easy to understand: While legal texts (including licenses) can be worded quite awkwardly in order to be legally precise, you should try to choose a license that you yourself can read and understand easily. This also implies that short licenses are favorable, since a single paragraph can be comprehended more easily than a multi-page document of clauses and eventualities.
  • Permissive: Unless you have a certain intention or conviction that your software should not be used in specific circumstances where you do not want somebody to modify your software or want them to use it for commercial purposes, you should choose permissive licenses over restrictive. Prominent somewhat restrictive examples are GNU GPL conform licenses that employ so-called “copyleft”, a restriction that requires derivative works to use the same license. What sounds like a good, viral mechanism to enforce free and open software can result in problems down the line. Once a work has been put under such a license, it is often considered impractical and thus untouchable by commercial or more permissively minded entities.

Going by the above criteria, the following licenses are particularly recommended:

Both basically allow anyone to do whatever with the software (sell, modify) as long as the original license and copyright is redistributed along with the software/sources. The text is also a disclaimer that states software is distributed “as is” without warranties.

An exhaustive list of generally recommended licenses for Open Source is curated by the Open Source Initiative.

Usage

Typically, developers or distributors add a plain-text file called LICENSE to the source code or binary of their software that contains the chosen license text. Especially source repositories like github or gitlab will allow you to choose a license per project and automatically adding such a LICENSE file to the source code. The benefit of selecting a license on the code hosting platform is the machine readable interpretation of your permissions which can potentially increase visibility in search results across the platform.

Further resources

Edit this page
On this Page