Week 1 — Extended Module

Week 1 — Extended Module

Week 1 — Extended Module

A hands-on, step-by-step journey from the fundamentals to applied analysis. Trainees learn how to retrieve and clean real sequence data, run core DNA/protein analyses, build high-quality alignments, and generate publication-ready outputs. Each week delivers ~15 training hours (internationally accredited, AInTP) with practical exercises and instructor feedback.

 

Week 1 — Extended Module

What you gain (brief): A single-sample end-to-end pipeline—data review, database records, quality control, simple assembly, and ORF finding.

  • Day 1: The World of Bioinformatics (5 hours)

      • Lecture (2 hours):
        • What is bioinformatics? Core concepts and applications.
        • Understanding DNA, RNA, and Protein: The central dogma of molecular biology.
        • Introduction to sequence analysis, structural bioinformatics, and genomics.
      • Practical (3 hours):
        • Becoming an expert with PubMed: Searching for literature, using filters, and managing references.
        • Introduction to major bioinformatics portals like NCBI and ExPASy.

     

    Day 2: Introduction to Biological Databases (5 hours)

      • Lecture (2 hours):
        • Types of Databases: Primary (GenBank, DDBJ, EMBL) vs. Secondary (UniProtKB/Swiss-Prot, RefSeq).
        • Understanding database entries: Reading GenBank and UniProtKB/Swiss-Prot records.
      • Practical (3 hours):
        • Retrieving DNA and protein sequences using accession numbers and keywords.
        • Exploring the cross-references and links within database entries.
        • Guided tour of NCBI and ExPASy websites.
        • Task: Find the human insulin protein sequence and its corresponding gene sequence.

    Day 3: Working with Individual Sequences (5 hours)

      • Lecture (2 hours):
        • Sequence quality control: Identifying and removing vector contamination.
        • DNA composition: G+C content, word counts, and finding repeats.
        • Finding protein-coding regions: Open Reading Frames (ORFs) and GeneMark.

     

      • Practical tasks (3 hours):
        • Using VecScreen to check for vector contamination.
        • Analyzing DNA composition with online tools.
        • Predicting ORFs using ORF Finder.

  • Molecular biology/proteomics researchers.

  • R&D teams and drug-discovery interns.

  • Anyone annotating or characterizing novel proteins.