Chris Miller

Bioinformatics Grad student at Baylor College of Medicine. My online home is at http://www.chrisamiller.com/
How to get a significant correlation value by moving just one point around. http://bl.ocks.org/4731053
Staff Scientist and Software Developer - The Genome Institute at Washington University Saint Louis - http://www.biostars.org/p...
We currently have several openings at TGI. One fis or a statistically-oriented staff scientist, and two for more traditional software developers (though bioinformatics experience is a plus!). Though the job descriptions are often boring, the work is anything but! We're doing kick-ass genomic research on cancer and other human disease and building software pipelines designed to make biological insights from petabyte-scale data. Apply through the links below, but feel free to message me with questions and I'll either answer them or put you in touch with someone who can. Staff Scientist https://jobs.wustl.edu/psp... Software Developers (2) https://jobs.wustl.edu/psp...... - Chris Miller
Listing of US Programs in Bioinformatics - http://www.biostars.org/p...
Mark Gerstein is putting together a list of US programs in bioinformatics here and is soliciting additions: http://blog.gerstein.info/2013... I figured that if anyone can help him expand that list, it's the Biostar community. - Chris Miller
A: Makefile-driven workflows and Bioconductor objects - http://www.biostars.org/p...
Someone pointed me to Drake the other day. I haven't used it, but it seems appropriate for a lot of bioinformatics tasks: http://blog.factual.com/introdu... - Chris Miller
C: C: C: C: C: C: C: C: C: C: C: A: Spam in RSS feeds - http://www.biostars.org/p...
Just be careful - don't want to discourage either new bioinformatics folks or ESL users - Chris Miller
C: C: C: A: Spam in RSS feeds - http://www.biostars.org/p...
FYI, a spam post still got through to the RSS last night. (id 62389). Just another data point, in case you hadn't noticed - Chris Miller
RT @tlwriter: Of 79 residential nbhoods in STL City, 31 are either 90+% white or 90+% Af-American. 15 more are 80%+ one or the other. (cont...) #stlmayor
RT @Massgenomics: Probably the most impressive academic lab web site I've seen is that of the Stam Lab at the University of Washington: http://www.stamlab.org/
C: C: C: A: Spam in RSS feeds - http://www.biostars.org/p...
I feel like the 4-hour delay is a little long. In the interest of timely answers, can we cut it to 2 hours? Do we have enough international coverage with our moderators to cover spam deletion that quickly when it crops up at odd hours? - Chris Miller
"Verbing weirds language"
C: C: A: Spam in RSS feeds - http://www.biostars.org/p...
Moderator approval of the first post seems reasonable to me, as long as it can trigger an email notification or something. I'm happy to approve some posts if it means cutting down on this nonsense. If Istvan's new countermeasures don't work out, we could give it a shot. - Chris Miller
"Boy Scouts close to ending ban on gays" Awesome news if true. Now how long will it take the BSA to accept atheists? http://usnews.nbcnews.com/_news...
Awesome news if it pans out. Now how long will it take the BSA to accept atheists? http://usnews.nbcnews.com/_news...
C: C: A: Length of Read Needed to Confidently Map Sequence - http://www.biostars.org/p...
Right. This calculation used single-end reads, so the numbers will be lower than what you can get from paired-end reads, using that extra information. - Chris Miller
A: how and where to retrieve the cnv information from the breakdancer results - http://www.biostars.org/p...
Breakdancer doesn't call copy number variants. It calls structural variants. If you see a large deletion in breakdancer, it may be the case that a copy number deletion has occurred. It's also possible that the deleted sequence got re-integrated somewhere else, in which case, there would be no effect on copy number. - Chris Miller
QOTD: "Twitter makes it so hard not to accidentally be an asshole"
WTF is wrong with these people? "New Mexico Bill Would Criminalize Abortions After Rape As 'Tampering With Evidence" http://www.huffingtonpost.com/2013...
RT @neiltyson: In 5-billion yrs the Sun will expand & engulf our orbit as the charred ember that was once Earth vaporizes. Have a nice day.
RT @MayorSlay: Soulard is a neighborhood that has earned its status as a premier residential and entertainment district. #fgs
A: How are sequencing error rates defined? - http://www.biostars.org/p...
It may depend on the source, but in my experience, it's defined as the percentage of bases that are incorrectly called. The 0.8% error rate that you describe would mean that of every 1000 bases coming off the sequencer, 8 of them will report the incorrect base. - Chris Miller
That's just tuition. I honestly cannot see how spending 250k+ on undergrad increases your earning power enough to make any sense.
"Undergraduate tuition at WUSTL will be $44,100 for the 2013-14 academic year — a $1,600 (3.8 percent) increase over 2012"
A: How to interpret the output generated by the calc-bmr Music by WashU - http://www.biostars.org/p...
I'm not sure I entirely understand your question, but there doesn't appear to be anything wrong with the output you posted. It indicates that you have 7 mutations in LATS1, all SNVs. If you had 2 indels and 2 SNVs, the total mutations would still be 7. According to the FDR values, this gene is mutated more frequently than would be expected, based on the background mutation rate in the samples. - Chris Miller
RT @CcSteff: Date night! (We're going to find a parking lot, have a quickie in the back seat and then sleep for an hour before going to pick up the kid.)
"I've had a pretty good success facing Stan (Musial) by throwing him my best pitch and backing up third base." --Carl Erskine RIP, The Man
C: What is an ideal feature (gene, exons or transcripts) to summarize RNASeq data ? - http://www.biostars.org/p...
There isn't any single answer to this question. It all depends on what kind of biological question you're trying to answer with the RNAseq data. If I'm looking for differential exon usage due to spliceosome mutations, gene-level data is useless to me. If I'm trying to work with a huge network of genes, I may need to simplify my inputs and use gene-level metrics to make the problem tractable. - Chris Miller
C: C: A: Structural breakpoint frequency - http://www.biostars.org/p...
No worries, man. If I had a dollar for every time something turned out to be way harder than I expected... - Chris Miller
A: Structural breakpoint frequency - http://www.biostars.org/p...
a) I don't know of any off-the shelf tools, b) this is generally difficult, because breakpoint-spanning reads may map to one side or the other, but also may not map at all. To get a high confidence call, you could create a short contig containing your breakpoint sequence +/- 200 bp, append it to the reference genome, then realign all of your reads against this. Compare the depth of breakpoint spanning reads on your contig to the depth at the breaks on the original reference sequence, and that ratio will give you a pretty good idea of the frequency. - Chris Miller
C: C: Bioinformaticians / Computational Biologists wanted - distance working - http://www.biostars.org/p...
Even if it was a missing zero typo, 3200 a month would be 38,400 a year, which is less than a postdoc salary. - Chris Miller
C: Cancer Research - http://www.biostars.org/p...
I'm assuming POA = "Plan of Attack". You're going to have to be a lot more specific about what you're asking in order to get any help here. I'm closing this post. If you have a specific question about some aspect of gene expression analysis, feel free to try again with a new question. - Chris Miller