Frequently Asked Questions - Submitting and using Data
What types of data will the eAtlas publish?
The eAtlas publishes environmental datasets about tropical Australia, with a particular focus on the Great Barrier Reef, its catchments, Torres Strait and North West Australia. The eAtlas automatically accepts data from NESP TWQ projects, but will also publish other relevant datasets that complement the existing catalogue.
How do I submit data to the eAtlas?
Please follow the Submitting data to the eAtlas guide.
In what file format(s) should I submit my data?
Due to the diversity of datatypes hosted by the repository, the eAtlas does not require a single file format, or even set of file formats, but rather encourages users to follow accepted community standards where they exist. Users are strongly encouraged to ensure that data can be easily extracted from the files provided (e.g., by using CSV rather than PDF).
To maximize accessibility and preservability, the eAtlas always prefers non-proprietary, openly documented formats. However, data submitted to the eAtlas should also be optimized for reanalysis and reuse.
For example, for a table of values, the eAtlas's preferred format is CSV; however, a spreadsheet that was created and formatted using Microsoft Excel may optimize reuse by remaining in XLS format. For this reason the eAtlas accepts more than just the preservation-friendly formats listed below.
Text: plain text (ASCII, UTF-8), Open Office formats
Tabular data: Comma-separated (or otherwise delimited) values for tabular data or XLS
Images: JPEG, PNG, TIFF, SVG
Compressed/archived formats: GZIP/TAR, ZIP
Files may be compressed and/or archived when necessary due to large file size, a large number of files (more than 10), or the need to gather files together in a particular directory structure in order for them to be understood.
GIS files: Shapefile, GeoTiff, NetCDF
The eAtlas also welcomes the submission of data in multiple formats, to help anticipate various reuse scenarios.
Data citation practices are actively evolving and vary among journals. The eAtlas does not have a recommendation for the placement of data citations. Some publishing organizations, such as CrossRef, recommend reporting the data both in the text (e.g., within the Methods or in a dedicated Data Availability section) and in the Bibliography.
When referencing data in the text, we recommend the following as a template (substitute your UUID suffix for the xxxxx):
Data available from the eAtlas Repository: http://eatlas.org.au/data/uuid/xxxxx
In the Bibliography, we recommend a citation similar to:
Grech A., Sheppard J. and Marsh H. (2011). Dataset: Torres Strait Dugong distribution and relative density - Spatial model of aerial surveys from 1987 - 2011 (NERP TE 2.1, JCU) [Dataset].eAtlas Repository. http://eatlas.org.au/data/uuid/70e21d20-cc5e-4d1d-9d2b-7b08f4b061a2
Catterall C., Freebody K. and Shoo L. (2012). Aerial photo mosaic of Atherton Tablelands in June 1978 (NERP TE 12.2, Griffith, source: DERM) [Dataset]. eAtlas Repository. http://eatlas.org.au/data/uuid/ce9d3908-9e7f-487d-9b58-bab5f2dfaff7
Lawrey E. (2013). Queensland Ship Vessel Tracking (AIS) May - Aug 2013 (NERP TE 13.1 eAtlas, source: AMSA) [Dataset]. eAtlas Repository. http://eatlas.org.au/data/uuid/5dc65f37-13c8-4a15-a69b-9a046e30a31d
Beaman, R. (2012). Project 3DGBR: Great Barrier Reef and Coral Sea Geomorphic Features (JCU) [Dataset]. eAtlas Repository. http://eatlas.org.au/data/uuid/25685ba5-6583-494f-974d-cce2f3429b78
How may data from eAtlas be reused?
The datasets in the eAtlas are made available under a range of licenses depending on the source of the data. Some datasets are restricted and not available for download, in which case you would need to contact the author of the data to access it. In other cases the data is available for download, but there are licensing restrictions that must be complied with. To know what you can do with a dataset you should check the Data Usage Constraints section of the dataset page.
By default most datasets on the eAtlas are made available under a Creative Commons Attribution license which allows you to use the dataset in any manner you like as long as the dataset is properly attributed. For how to do this see how to cite datasets.
Do I have to pay to download or use the data?
No, all the content in eAtlas is free to download and reuse.
For a dataset to be published as a full record (metadata record and downloadable data) it must meet the following minimum standard. This includes:
- Is the dataset relevant for understanding the environment (or how people use or value the environment) of Australia?
- Is the dataset documented sufficiently so that others can understand and use the data? This includes basic questions such as
- What is the dataset about?
- What methods were used in its collection and creation?
- Is a data dictionary provided that describes the data fields, any non-obvious codes and data units?
- Who created the dataset?
- When was the data collected/created?
- What is the license that it is made available under?
- Is the dataset free of logical inconsistencies or errors due to poor data cleaning and preparation?
- Is the data in a format that can be reused by others? If not, can it be easily converted?
- Is the dataset likely to be useful in applications other than the original research?
Datasets that do not meet this standard may not be published as full records, but will be recorded as a "Dataset teaser", which is in effect a simplified metadata record.
What is the review process applied to datasets submitted to the eAtlas?
Data submitted to the eAtlas is reviewed by the editors to ensure that the datasets meet a minimum level of quality prior to publication (see What is the requirements for a dataset to be published in the eAtlas?).
The eAtlas team provides a curated service and will review and attempt to resolve and improve the dataset documentation issues from associated papers and technical reports, clean obvious data errors and reformat the data into open formats for publication. Any remaining issues and questions will be sent back to the researcher. Once the researchers has provided a response to this questions then the dataset will be published.
Any remaining unresolved issues with the data will be noted in the metadata record to assist others with using the dataset.
Datasets are not peer reviewed, but are checked for anomalies.
Sections of text in this FAQ were based on the the FAQ page from Dryad.