Skip to article frontmatterSkip to article content

Notebooks has been an excellent tool for prototyping and writing examples to showcase computational analyses Rule et al., 2019. EDS book complements the science and methodological development of academic journals by translating research outputs into FAIR notebooks using open infrastructure and open review (Figure 1).

The Environmental Data Science project aims to engage the wider scientific research community on information extraction and analysis from environmental sensors using innovative data science.

Figure 1:The EDS book community supports and connects environmental scientists and practitioners to communicate their science through customisable interactive notebooks which benefit from a collaborative reviewing process. Illustration by Scriberia as part of The Turing Way book dash in November 2022 Community & Scriberia, 2023.

Vision

Environmental scientists work collaboratively to demonstrate and communicate their science through FAIR executable notebooks and have gained significant skills to publish in notebook-based scholarly publication systems.

Mission

Educate and leverage good scientific software and data management practices among environmental scientists through peer-reviewed findable, accessible, interoperable and reusable (FAIR) executable notebooks.

In addition to the book, our goal is to build a computational notebook community putting open science into practice towards collaborative, reusable and transparent environmental research.

Who is the book for?

While the scientific community is broad, we think the target audience of this book is:

For more information on who can contribute, please see the contributors section.

Why EDS book notebooks?

EDS book notebooks contributes to open, collaborative and transparent Environmental science. A shared goal among our community-driven notebooks is to increase awareness and demonstrate open-source software developments and open data suited to Environmental science.

EDS book notebooks support FAIR principles for research software (FAIR4RS) Barker et al., 2022 as describe below:

Additional to following FAIR principles, we expand the pedagogical possibilities of Jupyter Notebook and community-centred open infrastructure services. Furthermore, we contribute in building and connecting the international community of environmental data scientists and enthusiasts in open-source developments at all levels of seniority.

Since the official launch of EDS book in December 2021, the community has successfully published numerous notebooks covering exploration, preprocessing and modelling from environmental datasets.

Technologies

This section indicates the main technologies stack for EDS book and published notebooks. We largely use and contribute to open infrastructure and open-source software to provide a collaborative and reproducible environment for our community.

Binder

The Binder project offers an easy place to share computing environments to everyone. It allows users to specify custom environments and share them with a single link. Use cases involve workshops, scientific workflows and streamline sharing among teams.

Binder is entirely powered by an open-source infrastructure stack. Its main two tools are BinderHub, which is an open-source tool that deploys the Binder service in the cloud, and repo2docker, which generates reproducible Docker images from a git repository. The Binder team also runs a public BinderHub deployment at mybinder.org as a free public service for the community.

For EDS book notebooks, we use the public BinderHub to provide a cloud-based service for users to run notebooks without installing any software on their local machine. Besides the public service, we suggest launching notebooks in the EGI BinderHub deployment, a private BinderHub instance hosted by the European Grid Infrastructure (EGI) and the European Open Science Cloud (EOSC) initiative.

You can find out more about Project Binder on their About mybinder.org page.

MyST

MyST is an open-source, community-driven markup language project building upon CommonMark (a standard form of Markdown) with special syntax extensions Cockett et al., 2024. In 2022, the Executable Books project, which hosts Jupyter Book and MyST, started work on the mystmd command line interface (CLI), which was initially developed as the Curvenote CLI, and later transferred to the ExecutableBooks project. In June 2024, MyST Markdown officially became part of Project Jupyters. For EDS book, Jupyter Book, a distribution of the MyST Document Engine, is used to render the content of the book and notebooks.

For more about MyST, see in https://mystmd.org/.

Netlify

Netlify is a cloud-based platform that provides hosting and serverless backend services for web applications and static websites. EDS book uses Netlify to host the static website of the book, which is built using Jupyter Book and MyST. Netlify provides a free tier for open-source projects, which allows EDS book to host the website without any cost.

For more about Netlify, see in https://www.netlify.com/.

Quay.io

Quay.io is a container registry that allows users to host and share container images. It is a service provided by Red Hat and is part of the OpenShift Container Platform. Quay.io is used to host the Docker images for the EDS book notebooks, which are built for executing the notebooks in GitHub and Binder.

For more about Quay.io, see in https://quay.io/.

ReviewNB

The interaction between authors and reviewers in notebook submissions for EDS book is facilitated through ReviewNB, a third-party plugin in GitHub for displaying and commenting Jupyter Notebooks. ReviewNB is a GitHub-verified marketplace app that renders all interactive HTML/JavaScript notebook outputs, except bokeh-related widgets. The app allows commenting on rich diffs and it will render even the largest of notebooks without timing out.

The main features of ReviewNB according to its official documentation are:

For more about ReviewNB, see in https://blog.reviewnb.com/.

RoHub

RoHub Garcia-Silva et al., 2018 is a Research Object management platform that enables researchers to collaboratively manage, share and preserve their research work (data, software, workflows, models, presentations, videos, articles, etc.). RoHub implements the full RO model and paradigm: resources associated to a particular research work are aggregated into a single FAIR digital object, and metadata relevant for understanding and interpreting the content is represented as semantic metadata that are user and machine-readable. All EDS book notebooks are findable in RoHub. RoHub allows tracking all research life cycle and derivative work of EDS book notebooks.

For more about RoHub, see in https://reliance.rohub.org/.

References
  1. Rule, A., Birmingham, A., Zuniga, C., Altintas, I., Huang, S.-C., Knight, R., Moshiri, N., Nguyen, M. H., Rosenthal, S. B., Pérez, F., & Rose, P. W. (2019). Ten simple rules for writing and sharing computational analyses in Jupyter Notebooks. PLOS Computational Biology, 15(7), 1–8. 10.1371/journal.pcbi.1007007
  2. Community, T. T. W., & Scriberia. (2023). Illustrations from The Turing Way: Shared under CC-BY 4.0 for reuse. Zenodo. 10.5281/ZENODO.7587336
  3. Barker, M., Chue Hong, N. P., Katz, D. S., Lamprecht, A.-L., Martinez-Ortiz, C., Psomopoulos, F., Harrow, J., Castro, L. J., Gruenpeter, M., Martinez, P. A., & Honeyman, T. (2022). Introducing the FAIR Principles for research software. Scientific Data, 9(1), 622. 10.1038/s41597-022-01710-x
  4. Cockett, R., Purves, S., Koch, F., & Morrison, M. (2024). Continuous Tools for Scientific Publishing: Using MyST Markdown and Curvenote to encourage continuous science practices. Proceedings of the 23rd Python in Science Conference, 121–136. 10.25080/nkvc9349
  5. Garcia-Silva, A., Gomez-Perez, J. M., Palma, R., Krystek, M., Mantovani, S., Foglini, F., Grande, V., De Leo, F., Salvi, S., Trasati, E., Romaniello, V., Albani, M., Silvagni, C., Leone, R., Marelli, F., Albani, S., Lazzarini, M., Napier, H. J., Glaves, H. M., … Altintas, I. (2018). Enabling FAIR Research in Earth Science through Research Objects. arXiv. 10.48550/ARXIV.1809.10617