Nick Crawford Evolution and more…

16Dec/090

40 Essential Tools and Resources to Visualize Data | FlowingData

This looks incredibly useful.  I really need to sit down and learn Flash and Processing.

40 Essential Tools and Resources to Visualize Data | FlowingData.

Filed under: Blog, Software No Comments
22Nov/090

Blue Collar Bioinformatics

Just wanted to recommend Blue Collar Bioinformatics a slick blog with lots of useful bioinformatics scripts.  Everything is written in python and the full working source is typically available on GIT.

18Nov/090

Academia vs. Business via XKCD

So true.

Filed under: Blog, Software No Comments
28Oct/092

F$@%ing R: Adventures with Tcltk in OSX

RlogoI've got a bunch of RNA-seq reads I need to analyze and for the the most part I've been writing my own code to do the analysis.  However, a recent paper in BioInformatics (Wang et al. 2009) describes a new R package for the identification of differentially expressed genes in RNA-seq datasets.  R is a pretty straightforward language with a built-in installation system so I should just have to type two lines of code...

source("http://bioconductor.org/biocLite.R")
biocLite("DEGseq")

Not so quick. When I ran this code R tells me it can't find the DEGseq library. A bit more poking around on the internets and I discover that there's an alternate download site:

source("http://bioinfo.au.tsinghua.edu.cn/software/degseq/DEGseqInstall.R")

But after installing some dependancies it also spits out a bunch of errors.  I compare the errors... Hmmm... Both installs appear to by dying on the tcl/tk install, but tcltk is a default R library.  I can see it right there in "/Library/Frameworks/R.framework/Resources/library".  Two hours later and after trying a bunch of crap I find this helpful website:

http://cran.r-project.org/bin/macosx/tools/

A quick and dirty install of the tcltk-8.5.5-x11.dmg and now "library(tcltk") works like a charm.  No errors.

I install DEGseq with the following set of commands:

source("http://bioconductor.org/biocLite.R")
biocLite("DEGseq")

Now, a day an a half later I can see if it's useful. Woo.

Citations:

L Wang, Z Feng, X Wang, X Wang, X Zhang. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics (2009)

5Mar/090

Textmate Nexus Bundle

textmate-12-14-07I wrote a very simple textmate bundle for working with nexus files.

Version 1 Functionality: folds NEXUS blocks, highlighting bayes block mcmc line and typing command-B will automatically calculate burnin at 25%, more to come as I think of it.... probably contains bugs

Download nexus bundle version 1. (03/05/09)

Filed under: Blog, Software No Comments
16Feb/092

New measures of genetic differentiation

Lou Jost has pretty neat paper out describing new measures genetic diversity that accurately account for allelic differences.  I've written some PYTHON scripts to parse genepop files, measure allele frequencies, and generate [some of] these measures of diversity.  Enjoy.

http://www.ngcrawford.com/django/jost/

Jost L. 2008. GST and its relatives do not measure differentiation. Molecular Ecology,17(18), 4015 - 4026

Filed under: Blog, Software 2 Comments