EDGE bioinformatics

EDGE COVID-19

Action

No text

Job Progress

No project loaded
Last check: Not available

EDGE Server Usage
CPU
MEM
DISK

Action
View live log
Force this project to rerun
Reconfig project (BETA)
Interrupt running project
Delete entire project
Empty project outputs
Move to the archive storage
Share project
Unshare project
Make project public
Rename Project
Rerun Pangolin

Metadata Aciton
Update Metadata
Upload to GISAID and NCBI
Upload to NCBI SRA

2.0.0

New to EDGE?

2.4.0

Run EDGE

EDGE bioinformatics

1.0.0

Upload Data

Allowed File types are fastq, fasta, genbank, gff, xlsx and text (txt,config,ini) and can be in gzip format.

Nano-EDGE

Run EDGE for Nanopore dataset.

1.0.0

Project List

sra-EDGE

Run EDGE for Sequence Read Archive

1.0.0

Reports

Create summary tables for multiple projects.

Empowering the Development of Genomics Expertise

EDGE COVID-19 is a tailored bioinformatics platform based on the more flexible and fully open-source EDGE Bioinformatics software (Li et al. 2017). This mini-version consists of a user-friendly GUI that drives standardized workflows for genome reference-based 'assembly' and preliminary analysis of Illumina or Nanopore data for SARS-CoV-2 genome sequencing projects. The result is a final SARS-CoV-2 genome ready for submission to GISAID and/or GenBank.

The default workflow in EDGE COVID-19 includes:

data quality control (QC) and filtering,
alignment of reads to the original (first available) reference genome (NC_045512.2, we removed the PolyA tail from the 3' end (33 nt)),
creation of a consensus genome sequence based on the read alignments, and
a Single Nucleotide Polymorphism and Variant analyses, with some detail such as location and resulting coding differences if any.

The EDGE COVID-19 platform can accommodate Illumina or ONT data, including ONT data from the SARS-CoV-2 ARTIC network sequencing protocols. Users can input/upload Illumina or Nanopore sequencing FASTQ files (and/or download from NCBI SRA). For Illumina data, default analyses include only read QC, read mapping to the reference, and SNP/variant analysis. For ONT data, the data must be demultiplexed prior to uploading; the samples will be processed individually. The SNP/variant calling is not on by default for ONT. However, other functions (e.g. de novo assembly for whole genome data) are also available for both sequencing platforms. While command line execution is possible (see here and here), the GUI provides an easy data submission and results viewing platform, with the graphical and tabular views of variant/SNP data and a genome browser to view read coverage and location of SNPs or variants, as well as the reference annotations.

This light-weight version is a Docker container, able to run on any local hardware infrastructure or in the cloud. We have tested this Docker container on laptops and cloud, using several Illumina (e.g. SRR11177792) and ONT (e.g. SRR11300652) datasets.

Note: For EDGE Bioinformatics users who would also like to use the phylogeny or read- and assembly-based taxonomy classification tools to identify all organisms that may be present within complex samples, we recommend using the original EDGE Bioinformatics platform which harbors several tools and associated (large) databases that enable such a search. In initial tests of taxonomy classification of SARS-CoV-2 samples (with no SARS-CoV-2 genomes in any of the databases), we recover SARS coronavirus and Bat Coronavirus as the nearest neighbor.

Features of EDGE

No need for high-level bioinformaticists
Allow users to address a wide range of use cases including the assembly/annotation and comparison of novel genomes, and the characterization of complex clinical or environmental samples
Focus on accurate and rapid analysis
Enables sequencing as a solution in facilities where human-resources, space, bandwidth, and time are limited

Implementation

EDGE Bioinformatics is built around a collection of publicly available, open-source software packaged or in-house developed tools/algorithms/scripts to process FASTQ data
The EDGE bioinformatics web-based graphic user interface is primarily implemented using the JQuery Mobile javascript framework and HTML5 on the client-side, and implements perl CGI using Apache or Python on the server-side
Due to the involvement of several memory/time consuming steps, we normally recommend computers with at least 8GB memory and 8 CPUs, though we typically use on servers with a minimum of 256GB memory with 64 CPUs

Download & Updates

EDGE COVID-19 Docker image: location and instructions.
The Source code: LANL-Bioinformatics GitHub site.

Tutorial & Help

The detailed user guide can be found in here
The EDGE tutorial video series for the original EDGE platform hosted in Youtube.
User discussion group can be found here or you can contact us at edge-covid19@lanl.gov.

Publication

Chien-Chi Lo, Migun Shakya, Ryan Connor, Karen Davenport, Mark Flynn, Adán Myers y Gutiérrez, Bin Hu, Po-E Li, Elais Player Jackson, Yan Xu, Patrick S G Chain, EDGE COVID-19: A Web Platform to generate submission-ready genomes from SARS-CoV-2 sequencing efforts, Bioinformatics, 2022;, btac176, https://doi.org/10.1093/bioinformatics/btac176

This research was supported by LANL (20200732ER), by DTRA (CB10152 and CB10623) and by the DOE Office of Science (KP160101), through the National Virtual Biotechnology Laboratory, a consortium of DOE national laboratories focused on response to COVID-19, with funding provided by the Coronavirus CARES Act.

Input Your Sample

EDGE requires sequence data files in FASTQ format. EDGE accepts both paired-end and single-end sequence data files. User Guide

The Qiime2 pipeline requires sequence data files in FASTQ format and a mapping file. The sequence file is either paired-end or single-end sequences.Or directory with demultiplexed fastq files. Please see the documentation for more information.

The DETEQT is a pipeline for diagnostic targeted sequencing adjudication. Please see the documentation for more information.

The PiReT is a pipeline for Reference based Transcriptomics analysis. Please see the documentation for more information.

Batch Project Submission

Run EDGE with Multiple projects using a tools set configuration. Click Download [Sample File] to see the example.

Batch Excel File

file

Input Metadata

Virus detail

Virus name

Passage details/history

Sample information

Collection date

Location

Host

Gender

Patient age

Patient status

Sequencing Technology

Choose Processes / Analyses

EDGE provides many modules to do various analyses. You can choose to run or skip a specific process. Parameters/options are provided for most of the analyses. You can click here to turn all on, expand all sections or close all sections.

Taxonomy Classification

Read-based Taxonomy Classification

EDGE will use all reads by default. You can change the behavior to use reads that are unmapped to the reference if Reference-based Analysis is on.

Always Use All Reads Yes No

Classification Tools Run With Following Databases

| additional options |

Splitrim Quality Level

Custom DB Tool Add

Contig-based Taxonomy Classification

Contigs Classification Yes No

Phylogenetic Analysis

EDGE supports 5 pre-computed databases for SNP phylogeny analysis and two tree builders. FastTree is faster and RAxML is slower but more accurate.

Tree Build Method FastTree RAxML

Pre-built SNP DB

Select/Add Genomes or SRA Reads: The same species or at least within the same genus are recommended.

Select Genome(s)

Select A Reference Genome from Selected Genomes

Add Genome(s)

file Add

SRA Accessions

Bootstrap Yes No

Bootstrap Number

Gene Family Analysis

Read-based Gene Family Analysis

EDGE will use ShortBRED to search the reads for Antibiotic Resistance genes from ARDB and Resfams and for Virulence genes from VFDB.

Reads Gene Family Analysis Yes No

Contig-based (CDS) Gene Family Analysis

EDGE will use ShortBRED to search the CDSs on the contigs for Virulence genes from VFDB.

EDGE will use RGI (Resistance Gene Identifier) to search the CDSs on the contigs for Antibiotic Resistance genes from CARD.

CDS Gene Family Analysis Yes No

| additional options |

ShortBRED Minimum Percent Identity

ShortBRED Minimum Percent Length

PCR Primer Analysis

a. Primer Validation

b. Primer Design

Run Primer Validation Yes No

Given a primer file, EDGE will run validation of the primer pair to the reference and/or assembled contigs, as available.

Primer Fasta Sequences

file

Run Primer Design Yes No

EDGE will design primers based on the assembled contigs.

Tm Optimum (C)

Tm Range (C)

Length Optimum (bp)

Length Range (bp)

Background Tm Differential (C)

Number of Primer Pairs

Parameters

Barcode Options

Barcode Fastq File

file

Reads Quality Control and Feature Table Construction

Quality Offset Phred+33 Phred+64

Quality Control Method DADA2 Deblur

Trim 5'end Forward

Trim 5'end Reverse

Truncation Len Forward

Truncation Len Reverse

SE Trim Length

PHRED Quality

Max "N" base

Min Per Read Length Fraction

SE Truncation Len

Sampling

Sampling Depth

Auto-Adjust Sampling Depth Yes No

Parameters

Platform Illumina Nanopore

Mode Paired-End Single-End

Quality Calculation Cutoff

Depth Filter

| additional options |

Read Length Filter

The following parameters will affect how the Quality Calcuation derived. The four weight parameters should sum up to 1. Mouse over the label to see the notes.

Expected Coverage

Expected Identity

Expected BaseQ

Expected Mapping Quality

Coverage Weight

Identity Weight

BaseQ Weight

MapQ Weight

Parameters

Required arguments

Kingdom Prokarya Eukarya Both

Prokaryotic Reference Fasta

file

Prokaryotic Reference GFF

file

Eukaryotic Reference Fasta

file

Eukaryotic Reference GFF

file

Optional arguments

Strandedness Not Stranded Forward Reverse

Method

HISAT2 index file

file

Q-value

Warning

Select a file

Proceeding action

Cancel Confirm

Live log

Auto-scroll

The log is not available at this moment.

Log Out
Update Profile
System
Clean-up MyUploads

Login to EDGE

Email Address Password Remember my email

Forgot your password? Reset it here!

New to EDGE? Sign up now!

Contiue as GUEST: GO!

Login Failed

Did you enter the right credentials?

Try again

MyUploads Files

Delete Cancel

System Properties

Update Cancel

User Profile

First Name Last Name New Password Update Cancel

Upload files

Max file size is 1gb and total user space up to 1gb. Allowed File types are fastq, fasta, genbank, gff, xlsx and text (txt,bed,config,ini) and can be in gzip format. Files will be kept for 7 days.

You are not logged in or your browser doesn't have Flash, Silverlight or HTML5 support.

Alternative uploading methods

Web uploader is designed to upload small files. When the sizes of uploading files are >1000MB, please use one of following options:

Directly copy files to user's MyUploads directory from the host OS (mounted EDGE_input directory while docker run):

EdgeSite

Please fill out this form before using 'Run EDGE'.

Organization Full Name

Acronym

Location

Enable Sample Metadata? Yes No

Auto Submit Sample Metadata/Pathogens? Yes No

Questions? Please contact us at edge-covid19@lanl.gov.

EDGE COVID-19

Reportsclick to expand contents

Projectsclick to expand contents

Empowering the Development of Genomics Expertise

Features of EDGE

Implementation

Download & Updates

Tutorial & Help

Publication

Input Your Sample

Input Raw Reads

Batch Project Submission

Input Metadata

Choose Processes / Analyses

Pre-processing

Assembly and Annotation

Reference-Based SARS-CoV-2 Genome Analysis

Taxonomy Classification

Phylogenetic Analysis

Gene Family Analysis

PCR Primer Analysis

Parameters

Parameters

Parameters

Warning

Select a file

Proceeding action

Live log

Login to EDGE

Login Failed

MyUploads Files

System Properties

User Profile

Upload files

Alternative uploading methods

EdgeSite

EDGE COVID-19

Reportsclick to expand contents

Projectsclick to expand contents

Empowering the Development of Genomics Expertise

Features of EDGE

Implementation

Download & Updates

Tutorial & Help

Publication

Input Your Sample

Input Raw Reads

Batch Project Submission

Input Metadata

Choose Processes / Analyses

OffOn Pre-processing

OffOn Assembly and Annotation

OffOn Reference-Based SARS-CoV-2 Genome Analysis

OffOn Taxonomy Classification

OffOn Phylogenetic Analysis

OffOn Gene Family Analysis

OffOn PCR Primer Analysis

Parameters

Parameters

Parameters

Warning

Select a file

Proceeding action

Live log

Login to EDGE

Login Failed

MyUploads Files

System Properties

User Profile

Upload files

Alternative uploading methods

EdgeSite

Pre-processing

Assembly and Annotation

Reference-Based SARS-CoV-2 Genome Analysis

Taxonomy Classification

Phylogenetic Analysis

Gene Family Analysis

PCR Primer Analysis