1 / 24

Parsing BLAST output

Parsing BLAST output. Output of a local BLAST search. “less” program. Full path to the BLAST output file. BLAST program used for the search. Reference . Information of the query sequence . Information of the database. One-line summary of the search results.

lei
Download Presentation

Parsing BLAST output

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Parsing BLAST output

  2. Output of a local BLAST search “less” program Full path to the BLAST output file

  3. BLAST program used for the search Reference Information of the query sequence Information of the database One-line summary of the search results Detailed information for the first 2 hsps of the first hit: Accession number, description, organism, score, E value, identities, positives, and alignment

  4. Sample BLAST output (continued) Hsp information from the first hit

  5. Press “q” to quit the “less” viewing mode

  6. The size of the BLAST output is limited only by the free disk space you have in your computer. It’s virtually impossible to open a large text file. Let alone going through the file line by line. The purpose of parsing BLAST output is to extract user-defined information from the BLAST output file for clear visualization and summarization.

  7. Search result parsing The Bio::SearchIOsystem was designed for parsing sequence database searches (BLAST, sim4, waba, FASTA, HMMER, exonerate, etc.)

  8. One-line summary of the search results Load Bio::SearchIO module Usage information It will appear if the program is invoked without arguments Define the class Print out the header information Process each result

  9. Process each hit Process each HSP Control for the number of hits to be extracted Indicator showing the work is done

  10. Change directory (cd) to where the perl script and the BLAST output file are stored Confirm that the perl script and the BLAST output are in place

  11. Oops… an error message It’s due to Windows and Unix compatibility.

  12. Find the file in Windows system and open it with Notepad++

  13. Select “convert to UNIX format” in the “Format” drop-down menu After the conversion, save the file and exit Notepad++

  14. Another error message This is because the perl interpreter has been installed in another location (/usr/bin/) while the script is looking for the perl interpreter in /usr/local/bin

  15. Now it’s working ! Solution: Create a symbolic link of /usr/bin/perl in /usr/local/bin Command: ln<space>-s<space>/usr/bin/perl<space>/usr/local/bin/perl

  16. This is the file you’ve just generated. Congratulation! You’ve just parsed a BLAST output!

  17. Let’s see how the file looks like, using “less”.

  18. Here is how it looks like. The parsed output is tab-delimited and can be imported into Excel for better visualization.

  19. Locate the file in Windows system

  20. Header row Query sequence Accession numbers of the top 3 hits E values of the top 3 hits Descriptions of the top 3 hits Information of each HSP of the top 3 hits

More Related