Difference between revisions of "User:Boris/Temp/APB"

From "A B C"
Jump to navigation Jump to search
m
m
Line 2: Line 2:
  
  
;Misc. Principles
+
;Hardware
* [[Autonomous agent]]
+
* [[High Performance Computing]] (... at the bench: GPUs, FPGAs, Clusters)
* [[Bioinformatics foundations]]
 
* [[High Performance Computing]]
 
 
* [[Cloud computing]]
 
* [[Cloud computing]]
* [[Data integration]]
 
 
; Algorithms
 
* [[Dijkstras Algorithm]]
 
* [[Floyd Warshall Algorithm]]
 
* [[APB Functional annotation]]
 
  
 
; Tools
 
; Tools
Line 19: Line 11:
 
* [[Unix automation]] shellscripts, cron
 
* [[Unix automation]] shellscripts, cron
 
* [[Unix wget]]
 
* [[Unix wget]]
* [[Tools for the bioinformatics lab]] (APB, CLUSTAL, EMBOSS, PHYLIP, T-Coffee, HMMER)
+
* [[Network Configuration]]
 
* [[Apache]] (serving a local file)
 
* [[Apache]] (serving a local file)
 
* [[MySQL]] (introduction, installation, MariaDB)
 
* [[MySQL]] (introduction, installation, MariaDB)
Line 27: Line 19:
 
** [[Database example-PHP form interface]]
 
** [[Database example-PHP form interface]]
 
** [[DBI Mac OS X installation notes]]
 
** [[DBI Mac OS X installation notes]]
 +
* [[Tools for the bioinformatics lab]] (APB, CLUSTAL, EMBOSS, PHYLIP, T-Coffee, HMMER)
 
* [[GBrowse]] (installation notes, viewing annotations, LDAS installation notes, LDAS usage)
 
* [[GBrowse]] (installation notes, viewing annotations, LDAS installation notes, LDAS usage)
  
 
+
;Programming
;[[Programming]]
 
 
* [[IDE|IDE (Integrated Development Environment)]] (Komodo)
 
* [[IDE|IDE (Integrated Development Environment)]] (Komodo)
 
* [[Regular Expressions]]
 
* [[Regular Expressions]]
 
* [[Screenscraping]]
 
* [[Screenscraping]]
 
* [[Perl]] (installation notes, CPAN, principles, syntax, one liners)
 
* [[Perl]] (installation notes, CPAN, principles, syntax, one liners)
** [[Perl basic programing]]
+
** [[Perl basic programming]]
 
** [[Perl hash example]]
 
** [[Perl hash example]]
 
** [[Perl LWP example]]
 
** [[Perl LWP example]]
Line 57: Line 49:
 
** [[BioPerl exercise signal cleavage]]
 
** [[BioPerl exercise signal cleavage]]
 
* [[PHP]]
 
* [[PHP]]
 +
* [[BioPython]] (scope, highlights, installation, use, support)
 +
* [[Graphical output]] (PNG and SVG)
 +
* [[Autonomous agents]]
  
;Network, Web and Communication
+
; Algorithms
 +
:; Algorithms on Sequences
 +
* [[Dynamic Programming]] (Dynamic Programming and Optimal Pairwise Sequence Alignment, appropriate problems for DP, procedural and recursive formulation of solutions)
 +
* [[Multiple Sequence Alignment]]
 +
* [[Genome Assembly]] (long and short reads)
 +
:; Algorithms on Structures
 +
* [[Docking]]
 +
* [[Protein Structure Prediction]] (''ab initio'')
 +
:; Algorithms on Trees
 +
* [[Computing with trees]] (Bayesian approaches for phylogenetic trees, tree comparison)
 +
:; Algorithms on Networks
 +
* [[Network metrics]] (Degree distributions, Centrality metrics, other metrics on topology, small-world- vs. random-geometric controversy)
 +
** [[Dijkstras Algorithm]]
 +
** [[Floyd Warshall Algorithm]]
 +
 
 +
;Communication and collaboration
 
* [[MediaWiki]]
 
* [[MediaWiki]]
 
* [[Web Communication]]
 
* [[Web Communication]]
 
* [[HTML essentials]]
 
* [[HTML essentials]]
 +
* [[HTML 5]]
 +
* [[SADI|SADI Semantic Automated Discovery and Integration]]
 
* [[CGI]] (introduction, configuring apache, executing a script, query string input)
 
* [[CGI]] (introduction, configuring apache, executing a script, query string input)
* [[Network Analysis]]
 
* [[Network Configuration]]
 
 
  
 +
; Statistics
 +
* [[Pattern discovery]]
 +
* [[Correlation]] (Covariance matrices and their interpretation, application to large problems, collaborative filtering, MIC and MINE)
 +
* [[Clustering methods]] (Algorithms and choice (including: hierarchical, model-based and partition clustering, graphical methods (MCL), flow based methods (RRW) and spectral methods). Implementation in R if possible)
 +
* [[Cluster metrics]] (Cluster quality metrics (Akaike, BIC)–when and how)
 +
* [[Map equation|The Map Equation]] (A network flow approach to hierarchical partitioning of large datasets)
 +
* [[Machine learning]] (Classification problems: Neural Networks, HMMs, SVM..)
 +
* [[R]]
 +
** [[R plotting]]
 +
** [[R programming]] (Scope, passing parameters, scalars, matrices, lists, functions)
 +
** [[R EDA]]
 +
** [[R regression]] (linear and non-linear, Calculation, confidence limits and interpretation)
 +
** [[R PCA]]
 +
** [[R Clustering]]
 +
** [[R Classification]] (Phrasing inquiry as a classification problem, dealing with noisy data, machine learning approaches to classification, implementation in R)
 +
** [[R hypothesis testing]] (Major approaches and when to apply them (including simulation methods for arbitrary PDFs))
 +
** [[Bioconductor]] (Scope, contents, highlights use)
  
 +
;Applications
 +
* [[Data integration]] (BioMart: Biodata integration, and data-mining of complex, related, descriptive data)
 +
* [[Text mining]] (Use cases, tasks and metrics, taggers, vocabulary mapping, Practicals: R-support, Python/Perl support, others...)
 +
* [[High-throughput sequencing]]
 +
* [[Functional annotation]] (GFF)
 +
* [[Microarray analysis]] (... in R: differential expression and multiple testing; Loading and normalizing data, calculating differential expression, LOWESS, the question of significance, FWERs: Bonferroni and FDR; SAM and LIMMA)
  
 
</div>
 
</div>

Revision as of 02:35, 15 September 2012


Hardware
Tools
Programming
Algorithms
Algorithms on Sequences
Algorithms on Structures
Algorithms on Trees
Algorithms on Networks
Communication and collaboration
Statistics
  • Pattern discovery
  • Correlation (Covariance matrices and their interpretation, application to large problems, collaborative filtering, MIC and MINE)
  • Clustering methods (Algorithms and choice (including: hierarchical, model-based and partition clustering, graphical methods (MCL), flow based methods (RRW) and spectral methods). Implementation in R if possible)
  • Cluster metrics (Cluster quality metrics (Akaike, BIC)–when and how)
  • The Map Equation (A network flow approach to hierarchical partitioning of large datasets)
  • Machine learning (Classification problems: Neural Networks, HMMs, SVM..)
  • R
Applications
  • Data integration (BioMart: Biodata integration, and data-mining of complex, related, descriptive data)
  • Text mining (Use cases, tasks and metrics, taggers, vocabulary mapping, Practicals: R-support, Python/Perl support, others...)
  • High-throughput sequencing
  • Functional annotation (GFF)
  • Microarray analysis (... in R: differential expression and multiple testing; Loading and normalizing data, calculating differential expression, LOWESS, the question of significance, FWERs: Bonferroni and FDR; SAM and LIMMA)