Gito

A lightweight and safe container for bioinformatics


Full Description:


GITO, a lightweight and safe container docker image based on Alpine containing bioinformatics tools ready to be used. The base image is only 4.41 MB in size and contains only the OS libraries and language de-pendencies required to run an application, but robust enough to run bioinformatics pipeline with security. GITO is an open source project Docker and provides an easy and modular method for building, distributing, and replicating pipelines, which has a base image (GITO-base) that forms the single foundation layer on which each container application is built.

GITO adopts "start with minimum and add dependencies as needed" approach, which have separate containers for each bioinformatics tool with the minimum to run. GITO use musl as the standard C library for Linux-based systems, called musl-libc. Musl-libc is free, lightweight and smaller in size compared to the Glibc library. The source code for bioinformatics tools, written in C, C ++ and Java, has been compiled for native Alpine executables.


Guide to downloading and running this Docker image




Step 1: Install Docker

To download and run this Docker image, you first need to set up Docker on your machine. The easiest way to start with Docker is to install the Docker Toolbox by simply downloading and clicking the installer which is available for both Mac OSX and Windows. For Linux users, follow the instructions here.

Step 2: Download and run the Docker image

Option 1: Through a Command Line Interface (CLI)
The image can be downloaded and executed through the CLI of Docker's Docker Quickstart Terminal in the Docker Toolbox with the following commands:
1. Pull(download) the Docker images:
$ docker pull gitobioinformatics/fastqc
$ docker pull gitobioinformatics/trimmomatic
$ docker pull gitobioinformatics/trinity
$ docker pull gitobioinformatics/bowtie2
2. Change the working directory to a project, and run the following commands:
# Run FastQC
$ docker run -v $PWD:$PWD --rm gitobioinformatics/fastqc
# Run Trimmomatic
$ docker run -v $PWD:$PWD --rm gitobioinformatics/trimmomatic
# Run Trinity
$ docker run -v $PWD:$PWD --rm gitobioinformatics/trinity
# Run Bowtie2
docker run -v $PWD:$PWD --rm gitobioinformatics/bowtie2

Deploy this Docker image onto your cloud

You can download and deploy this Docker image with your cloud provider such as DigitalOcean, Amazon Web Services, HP Enterprise, IBM, Microsoft Azure Cloud or others.



Reproducing a pipeline example



GITO was used to reproduce the pipeline present in the work of Hernández-Fernández (2017), which is the first de novo transcriptome assembly of Eretmochelys imbricate published (https://doi.org/10.1016/j.dib.2017.10.015), instructions used can be accessed here.

License



MIT License. See License.txt (#Texto) for more information
MIT License
Copyright (c) 2018 Marcos Paulo Alves de Sousa, Franklin Gonçalves Sobrinho, Lucas Peres Oliveira, Kayke Correa Conceição Lins Pereira.
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Contributions


GITO is open-source (see License), and we welcome contributions from anyone who is interested in contributing. To contribute code, please make a pull request on github_. The issue tracker for GITO is also available on github.

Meaning and inspiration



The word "gito" is used by populations of the North of Brazil when people refer to something "very small, smaller than normal". The logo of the project was inspired by the Boto Tucuxi (Sotalia fluviatilis), which is one of the smallest dolphins in the Delphinidae family and the only one of this family that lives in rivers. It is a very agile dolphin, although it is an endangered species, it is still possible to find it in the rivers of the Amazon jumping in the air. MORE INFORMATION.

Acknowledgements



We thank the following institutions, which contributed to ensuring the success of our work:
• Ministério da Ciência, Tecnologia, Inovação e Comunicação (MCTIC)
• Museu Paraense Emílio Goeldi (MPEG)
• Centro Universitário do Estado do Pará (CESUPA)

Funding



This work has been supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico - CNPq (grants 149985/2018-5; 129954/2018-7)

Authors
Marcos Paulo Alves de Sousa
Franklin Gonçalves Sobrinho
Lucas Peres Oliveira
Kayke Correa Conceição Lins Pereira

Contact



Dr. Marcos Paulo Alves de Sousa (Project leader)
Email: msousa@museu-goeldi.br
Laboratório de Biologia Molecular
Museu Paraense Emílio Goeldi
Av. Perimetral 1901. CEP 66077- 530. Belém, Pará, Brazil.

© Copyrights GITO. All Rights Reserved