PERL and BioPERL are the programming languages that are very important for Biological researchers who are working in computational biology. The programs provide the basic knowledge in both theory and practical applications. The introduction provides the basics on these programming languages and the execution in windows.
Table of Contents
Chapter 1: PERL
Chapter 2: PERL installation and execution (windows)
Program1: Concatenation
Program 2: Reverse complement
Program 3: Game demonstrating artificial intelligence
Program 4: DNA Execution using package
Program 5: Array
Program 6: Reference variable
Program 7: String at specific location
Program 8: Arrow operator
Program 9: Hash
Program 10: Motif search using subroutines
Program 11: Subroutines using array
Program 12: 2D Matrix
Program 13: Matrix using for loop
Program 14: Transcription
Program 15: ORF Search
Program 16: Regular expression
Program 17: Pattern Search
Program 18: Mutation
Program 19: Mutation Percent
Program 20: Translation program using PERL
Program 21: Translation using Beginperlbioinfo.pm module
Program 22: Counting number of bases
Program 23: Calculating GC%
Program 24: Writing Geneticcode.pm
Program 25: Translation using Geneticcode.pm
Program 26: Calculating the length, total nucleotides and GC and AT counts
Chapter 3: CPAN
Chapter 4: BioPERL
Program 27: Creating a BioPERL module and program for translation
Program 28: Translation using BioPERL
Data 1: Blast annotated file
Program 29: Parsing BLAST
Program 30: Run query sequence from remote BLAST
Program 31: Conversion of sequence to FASTA format
Program 32: Minimum sequences in string
Program 33: Shorten IDs in FASTA format from databases
Program 34: Motif search
Program 35: Simple alignment of sequences
Program 36: Restriction enzymes
Objectives and Topics
This guide aims to provide a practical introduction to using PERL and BioPERL for bioinformatics tasks. It focuses on equipping researchers and students with the necessary programming skills to extract, manipulate, and analyze biological sequence data through a series of concrete, executable code examples.
- Fundamental PERL syntax, data types, and control structures for bioinformatics.
- Methods for processing and analyzing biological sequences (e.g., GC content, translation, reverse complement).
- Effective use of CPAN modules to extend functionality.
- Practical application of BioPERL for sequence analysis, parsing BLAST results, and alignment manipulation.
Excerpt from the Book
Program 1: Concatenation
Variables are denoted with symbol $. Concatenation is the process of joining two or more strings (in variables) into a common string. In the present program, the two strings (or set of characters) have been combined into single string through various processes. Joining of strings can be also done using dot (.) operator.
Program
$DNA1 = 'ATGTACACTAC';
$DNA2 = 'GATCATGT';
$DNA3 = "$DNA1$DNA2";
print "first string: $DNA1 \n";
print "second string: $DNA2 \n";
print "concatenation of two strings (version 1):\n\n";
print "$DNA3\n\n";
$DNA3 = $DNA1 . $DNA2;
print "concatenation of two strings (version 2):\n\n";
print "$DNA3\n\n";
print "concatenation of two strings (version 3):\n\n";
print $DNA1, $DNA2, "\n";
exit;
Summary of Chapters
Chapter 1: PERL: This chapter introduces the basic concepts, syntax, data types, and operators in PERL programming, laying the foundation for sequence analysis.
Chapter 2: PERL installation and execution (windows): This chapter provides a step-by-step guide to installing PERL on Windows and includes numerous programs demonstrating how to perform bioinformatics tasks like DNA manipulation, motif searching, and matrix operations.
Chapter 3: CPAN: This chapter explains the Comprehensive Perl Archive Network (CPAN), its purpose in managing and distributing Perl modules, and how it benefits programmers.
Chapter 4: BioPERL: This chapter covers the BioPERL toolkit, its object-oriented structure, and presents advanced programs for sequence alignment, parsing BLAST data, and managing restriction enzymes.
Keywords
PERL, BioPERL, Bioinformatics, Programming, DNA, RNA, Sequence Analysis, Scripting, CPAN, BLAST, Translation, Motif Search, Modules, Algorithms, Data Processing
Frequently Asked Questions
What is the fundamental purpose of this guide?
The guide is designed to help researchers and students learn basic PERL and BioPERL programming specifically for the purpose of biological sequence analysis, providing practical, step-by-step examples.
What are the primary thematic areas covered in the book?
The book covers fundamental PERL programming, sequence manipulation techniques (such as transcription and translation), the use of CPAN for module management, and the specific application of BioPERL for bioinformatics workflows.
What is the primary goal of the presented programming approach?
The primary goal is to provide executable code snippets that allow users to automate the extraction, analysis, and reporting of biological data found in text-based sequence files.
Which scientific programming methods are emphasized?
The guide emphasizes procedural programming in PERL, the usage of Regular Expressions for pattern matching, and object-oriented programming techniques within the BioPERL framework.
What is covered in the main section of the book?
The main section consists of numerous practical programs ranging from basic string manipulation (like concatenation and reverse complement) to more complex bioinformatics tasks such as ORF searching, parsing BLAST results, and simple sequence alignment.
How are the key concepts of this work best characterized?
The work is best characterized by its focus on "learning by doing," utilizing concrete biological datasets to demonstrate how PERL scripts can be used to solve specific research problems in bioinformatics.
How does the book guide the user in setting up their environment?
Chapter 2 specifically addresses the setup of the PERL environment on Windows, guiding the user from software download and path configuration to executing their first scripts.
How does the guide explain the use of BioPERL for sequence parsing?
The guide utilizes specific modules like Bio::SearchIO to parse BLAST output, demonstrating how to extract and filter specific biological information, such as hits meeting a certain length or percentage identity threshold.
- Citation du texte
- Kaladhar DSVGK (Auteur), 2014, Basics in PERL and BioPERL, Munich, GRIN Verlag, https://www.grin.com/document/267659