interpro.rst 3.4 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374757677787980
  1. Adding InterProScan Results
  2. ===========================
  3. .. note::
  4. Remember you must set the ``$DRUPAL_HOME`` environment variable if you want to cut-and-paste the commands below. See :doc:`../../../prereqs/drupal_home`
  5. For this tutorial, these results were obtained by using a local installation of InterProScan installed on a computational cluster. However, you may choose to use Blast2GO or the online InterProScan utility. Results should be saved in ``XML`` format.
  6. What is InterProScan?
  7. ---------------------
  8. To learn more about InterProScan, please visit https://www.ebi.ac.uk/interpro/
  9. Create the Analysis Page
  10. -------------------------
  11. .. note::
  12. It is always recommended to create an analysis page anytime you import data. The purpose of the analysis page is to describe how the data being added was derived or collected.
  13. Tripal defines the **InterPro Results** Bundle, which is a specific type of Chado analysis. Create a new record by going to ``Content -> Tripal Content -> Add Tripal Content --> InterPro Results``.
  14. Fill out the fields as described in the table below.
  15. .. csv-table::
  16. :header: "Field", "Value"
  17. "Name", "InterPro Annotations of C. sinensis v1.0"
  18. "InterPro Program", "InterProScan"
  19. "InterPro Version", "4.8"
  20. "Date Performed", "Current Date"
  21. "Data Source Name", "C. sinensis v1.0 mRNA"
  22. "Data Source Version", "v1.0"
  23. "Data Source URI", "n/a"
  24. "Description", "Materials & Methods: C. sinensis mRNA sequences were mapped to IPR domains and GO terms using a local installation of InterProScan executed on a computational cluster. InterProScan date files used were MATCH_DATA_v32, DATA_v32.0 and PTHR_DATA v31.0."
  25. Press the **Save** button.
  26. Import the InterProScan XML results
  27. ------------------------------------
  28. Next, we will load InterProScan results for our citrus gene. To do this, navigate to **Tripal > Data Loaders > Chado InterProScan XML results loader**. The following page will be presented:
  29. .. image:: interpro1.png
  30. The top section of this page provides multiple methods for providing results file: via an upload interface, specifying a remote URL or a file path that is local to the server. Most likely, you will always upload or provide a remote URL. However, we downloaded the files earlier, and stored them here: ``$DRUPAL_HOME/sites/default/files``. So, in this case we can use the path on the local server. Provide the following value for this form:
  31. .. csv-table::
  32. :header: "Field", "Value"
  33. "Server path", "sites/default/files/Citrus_sinensis-orange1.1g015632m.g.iprscan.xml"
  34. "Analysis", "InterPro Annotations of C. sinensis v1.0"
  35. 'Load GO terms to the database', 'unchecked'
  36. "Query Name RE", ""
  37. "Use Unique Name", "unchecked"
  38. "Query Type", "mRNA"
  39. In order for GO terms to be imported, the Gene Ontology must be loaded on your site: for this tutorial, we leave the box unchecked.
  40. .. note::
  41. For the **Server path** we need not give the full path. Because we downloaded the files into the Drupal directory we can leave off any preceding path and Tripal will resolve the path. Otherwise we could provide the full path.
  42. Clicking the **Import InterProScan file** will add a job which we can manually execute with the following command:
  43. ::
  44. drush trp-run-jobs --username=administrator --root=$DRUPAL_HOME
  45. After the job is run, our InterPro field will be populated on the mRNA page with an annotation diagram:
  46. .. image:: interpro2.png