Tripal BLAST Module Description

This module provides a basic interface to allow your users to utilize your server's NCBI BLAST+.

Setup | Functionality | Large jobs | Genome visualization

Setup Instructions

  1. Install NCBI BLAST+ on your server (Tested with 2.2.26+). There is a package available for Ubuntu to ease installation. Optionally you can set the path to your BLAST executable in the settings.
  2. Optionally, create Tripal External Database References to allow you to link the records in your BLAST database to further information. To do this simply go to Tripal> Chado Modules > Databases > Add DB and make sure to fill in the Database prefix which will be concatenated with the record IDs in your BLAST database to determine the link-out to additional information. Note that a regular expression can be used when creating the BLAST database to indicate what the ID is.
  3. Create "BLAST Database" nodes for each dataset you want to make available for your users to BLAST against. BLAST databases should first be created using the command-line makeblastdb program with the -parse_seqids flag.
  4. It's recommended that you also install the Tripal Job Daemon to manage BLAST jobs and ensure they are run soon after being submitted by the user. Without this additional module, administrators will have to execute the tripal jobs either manually or through use of cron jobs.

Highlighted Functionality

Protection Against Large Jobs

Depending on the size and nature of your target databases, you may wish to constrain use of this module.
  1. Limit the number of results displayed via admin page. The recommended number is 500.
  2. Limit the maximum upload file size in php settings. This is less useful because some very large queries may be manageable, and others not.
  3. Repeat-mask your targets, or provide repeat-masked versions. Note that some researchers may be looking for repeats, so this may limit the usefulness of the BLAST service.

Whole Genome Visualization

This module can be configured to use CViTjs to display BLAST hits on a genome image.

CViTjs Setup

  1. Download CViTjs and copy the code to your webserver. It needs to be placed in [your drupal root]/sites/all/libraries. To download, execute the git command inside the libraries/ directory:
    git clone https://github.com/LegumeFederation/cvitjs.git
  2. CViTjs will have a config file in its root directory named cvit.conf. This file provides information for whole genome visualization for each genome BLAST target. Make sure the config file can be edited by your web server.
  3. Enable CViTjs from the BLAST module administration page.
  4. Edit the configuration file to define each genome target. These will look like:
    [data.Cajanus cajan - genome]
    conf = data/cajca/cajca.conf
    defaultData = data/cajca/cajca.gff
    Where:
    At the top of the configuration file there must be a [general] section that defines the default data set. For example:
    [general]
    data_default = data.Cajanus cajan - genome
  5. Edit the nodes for each genome target (nodes of type "BLAST Database") and enable whole genome visualization. Remember that the names listed in the CViTjs config file must match the BLAST node name. In the example above, the BLAST database node for the Cajanus cajan genome assembly is named "Cajanus cajan - genome"

Notes