December 2013 Galaxy Update
Welcome to the December 2013 Galaxy Update, a monthly summary of what is going on in the Galaxy community. Galaxy Updates complement the Galaxy Development News Briefs which accompany new Galaxy releases and focus on Galaxy code updates.
50+ Public Servers
Three new servers were added to the list of publicly accessible Galaxy servers last month. The number of servers has more than doubled in the past 12 months, with the new additions bringing the total to 51.
The server list has also been reorganized from a single alphabetical list into three distinct depending on server breadth of coverage. From generality to specificity the 3 lists are:
General Purpose Servers These servers support analysis across a wide range of biology. Domain Servers These servers are dedicated to a particular domain, either within the tree of life, or in a particular type of analysis. Tool Publishing Servers These servers' primary function is to enable researchers to use a specific set of locally developed tools, without having to install them locally and use command line interfaces.
The newly added servers are:
OSDDlinux LiveGalaxy
-
Links:
- OSDDlinux LiveGalaxy, part of the OSDD Portal.
- Customized Galaxy with Applications as Web Services and on the Grid for Open Source Drug Discovery, presentation by Anshu Bhardwaj and Chintalapati Janaki at GCC2011 (Slides, Video).
- Domain/Purpose: OSDDlinux LiveGalaxy exposes the tools that are included in OSDDlinux, "a customized linux operating system for drug discovery that integrates open source software, libraries, workflows and web services in linux for creating environment for drug discovery." OSDDLinux contains a Galaxy instance with applications for Drug Discovery.
- Comments: OSDDlinux LiveGalaxy integrates GPSR1.0, GPSR2.0 along with the webservers and the standalone software supported by OSDDlinux. See the complete list of tools and services included in OSDDlinux LiveGalaxy.
- User Support: Web form or [Email](mailto:osddlinux AT gmail DOT com)
- Quotas:
- Sponsor(s): The Open Source Drug Discovery Project and the OSDD Linux Project under the direction of Dr. GPS Raghava of the CSIR-Institute of Microbial Technology (IMTECH).
Dostie Lab Galaxy
- Link: Dostie Lab Galaxy
- Domain/Purpose: Chromosome Conformation Capture (3C) and 3C Carbon Copy (5C) analysis tools.
- Comments: "Recent studies indicate that genomes are organized into dynamic three-dimensional physical networks, which are required for correct gene expression. The functional (physical) connectivity between genes and regulatory DNA elements must be mapped to understand how correct gene expression is obtained and ultimately how the genome functions." These tools quantitatively measure the intensity of individual DNA contacts within the genome.
- User Support:
- Quotas: Must create a login to run tools. Anyone can create a login.
- Sponsor(s): Canadian Institute of Health Research, and the Canadian Cancer Society
PreSTIGE
- Link: PreSTIGE
- Domain/Purpose: "PreSTIGE (Predicting Specific Tissue Interactions of Genes and Enhancers) predicts cell-line specific gene-enhancer interactions for a user-provided cell line as well as 12 PreSTIGE cell lines."
-
Comments:
- Users can generate predictions for any cell type of interest for which H3K4me1 ChIP-seq and RNA-seq data are available.
- See "Combinatorial effects of multiple enhancer variants in linkage disequilibrium dictate levels of gene expression to confer susceptibility to common traits" by Corradin, et al., Genome Research (2013)
- User Support: PreSTIGE User Guide, [Email](mailto:olivia DOT corradin AT case DOT edu)
- Quotas: Available for academic use only
- Sponsor(s): Scacheri Lab at Case Western Reserve University, Department of Genetics and Genome Sciences
# New Papers
# | Tag | # | Tag | # | Tag | # | Tag | |||
---|---|---|---|---|---|---|---|---|---|---|
5 | Cloud | - | Project | 5 | Tools | - | UsePublic | |||
1 | HowTo | 3 | RefPublic | - | UseCloud | - | Visualization | |||
5 | IsGalaxy | 2 | Reproducibility | 4 | UseLocal | 18 | Workbench | |||
26 | Methods | - | Shared | 11 | UseMain |
61 papers (a new record) were added to the Galaxy CiteULike Group in November. Some papers that may be particularly interesting to the Galaxy community:
- "BS-Seeker2: a versatile aligning pipeline for bisulfite sequencing data" by Guo, et al., BMC Genomics, Vol. 14, No. 1. (10 November 2013), 774, doi:10.1186/1471-2164-14-774
- "Ten recommendations for creating usable bioinformatics command line software" by Torsten Seemann, GigaScience, Vol. 2, No. 1. (2013), 15, doi:10.1186/2047-217x-2-15
Who's Hiring
The Galaxy is expanding! Please help it grow.
- Stage Master2 "Annotation du transcriptome de l’esturgeon" Bordeaux, France
- Sysadmin/Computing Officer at Queen Mary University of London
- Informatics Software Support Specialist at the University of Minnesota, United States
- CDD Gif-sur-Yvette NGS et Variation Structurales UMR de Génétique Végétale, INRA-Université Paris Sud-CNRS
- CDD en bioinformatique : recherche de variants structuraux par reséquençage (NGS)
- Statistical Genomics Postdoc opening in the Makova lab at Penn State
- The Galaxy Project is hiring software engineers and post-docs, including positions at George Washington University.
Got a Galaxy-related opening? Send it to outreach@galaxyproject.org and we'll put it in the Galaxy News feed and include it in next month's update.
Events
GCC2014: June 30 - July 2, Baltimore
The 2014 Galaxy Community Conference (GCC2014) will be held June 30 through July 2, at the Homewood Campus of Johns Hopkins University, in Baltimore, Maryland, United States.
Training Day Nominations, December 2-20
GCC2014 will start with a Training Day on June 30. As for GCC2013, the specific topics offered will be determined by you, the Galaxy Community. Topic nomination will open December 2 and remain open through December 20. Nominated topics will be posted to the Training Day page as they come in. The organizers will then collate the nominations into a single list that the community (yes, that means you) will vote on the nominations in January. The final schedule of topics and instructors will be published no later than February 10, when registration opens.
Watch the Galaxy Mailing Lists for announcements and links to the nomination form.
Training Day is an opportunity to learn about all things Galaxy including using Galaxy, deploying and managing Galaxy, extending Galaxy, and Galaxy internals. There will be 4 parallel tracks, each with 3 sessions, with each of those sessions two and half hours long. That's 12 sessions and 30 hours of workshop material.
Sponsorships
We are pleased to announce several sponsorships for GCC2014. First, Ion Torrent will again be the exclusive Platinum Sponsor for the annual Galaxy meeting. This is the third year in a row that Ion Torrent has been the Platinum Sponsor of this event.
We are also pleased to announce that BioTeam will again be a Gold Sponsor for this annual event. This is the second year in a row that BioTeam has been a GCC Gold Sponsor.
There are still Gold, Silver and Bronze sponsorships still available. Please contact the Organizers if your organization would like to help sponsor this event.
Exhibitors
In 2014 we are also adding non-sponsor exhibit spaces in addition to the sponsor exhibits. This will significantly increase the size of the exhibit floor. Please contact the Organizers if your organization would like to have an exhibit space at GCC2014.
UC Davis Bioinformatics Boot Camps
There is still space in 3 of the 4 UC Davis Bioinformatics Bootcamps this month.
The next offering of UC Davis Bioinformatics Bootcamps will be held on the UC Davis campus December 10-13.
These focused one-day courses are for researchers looking to get up to speed quickly on the latest technologies and techniques in bioinformatics. Students will work on their own laptops and have continued access to software and example data used in the exercises through a public Amazon Web Services virtual machine. The first three bootcamps will use the Galaxy platform, and the final bootcamp will use both Galaxy and the command-line. The Alignment and Assembly bootcamps (Dec. 11th & 12th) require you to know Galaxy, so if you are unfamiliar with Galaxy, you should also take the Introduction bootcamp on Dec. 10th.
Tuesday, December 10:
Introduction to Next-Generation Sequence Analysis with Galaxy
Wednesday, December 11: Next-Generation Sequence Alignment and Variant Discovery
Thursday, December 12: Genome Assembly using Next-Generation Sequence Data
Friday, December 13: Introduction to the Amazon Cloud for Galaxy and the Command-Line
Enrollment for each bootcamp is capped at 24 students. Please enroll early to be assured of a seat, as these bootcamps usually fill up quickly! More information, including full descriptions of each bootcamp is available online. The cost for each bootcamp is $200 (academic/government) or $250 (non-academic/industry).
Other Events There is a lot going on in the next three months. Also see the [Galaxy Events Google Calendar](http://bit.ly/gxycal) for details on other events of interest to the community.
Galaxy Distributions
Nov 04, 2013 Galaxy Distribution
Highlights:
- A security vulnerability with filter tools trapped and fixed: Upgrade or Patch NOW
- Galaxy Tool Migration: 48 tools migrated to Tool Shed for a leaner distribution.
- Improved tools for administrators (email verification, reports).
- The framework for Tools, Visualizations, and the Core have been upgraded for performance and scalability.
- Come see! A whopping 26 pull requests incorporated!. Many thanks to our open source community!
- Numerous Tool Shed enhancements and upgrades: API,
READMEs
, Functional Tests, easier installs, and much much more. - Plus enhancements to Workflows, API, CloudLaunch, UI, and Bug Fixes.
http://bitbucket.org/galaxy/galaxy-dist
http://galaxy-dist.readthedocs.org
new: $ hg clone https://bitbucket.org/galaxy/galaxy-dist#stable
upgrade: $ hg pull
$ hg update release_2013.11.04
A new version of CloudMan was released in July.
ToolShed Contributions
New Public ToolSheds
This is the first time we've highlighted publicly accessible ToolSheds. We will continue to advertise this as we become aware of them.
GenOuest ToolShed
The GenOuest public ToolShed contains over 60 valid tools in 36 repositories in 9 categories.
Repositories in the Galaxy Project ToolShed
There were many...
- fastx_barcode_splitter_enhanced: allows barcodes in a separate index read, loads results into a history
- shear: Sample Heterogeneity Estimation and Assembly by Reference
- dimont_motif_discovery: A general approach for de-novo motif discovery from high-throughput data
- arts: Automated Randomization of multiple Traits for Study design
- samtools_idxstats: BAM mapping statistics (using samtools idxstats)
- vcftools_{annotate, compare, isec, merge, slice, subset} and all_vcftools
- filter_tools: Various Imaging Filters including Median, Emboss, Blur, Contour, Detail, ...
- image_tools: General Image Tools: Image Filmify, Image Tileify, Image Rotate, Image Resize
- and ...
deepTools
deepTools is a suite of tools that close the software gap between the very first steps of data processing of deeply sequenced data and biologically insightful downstream analyses. deepTools comprise efficient and reliable programs that enable users to perform quality controls and to generate bigWig files of normalised read coverages with different scaling approaches. Furthermore, the tools offer fast computation of coverages for user-specified genomic regions including their visualisation as meta-profiles and heatmaps. deepTools can be installed via the ToolShed and source is at GitHub.
Other News
- bioblend 0.4.1 CloudMan and Galaxy API library released
- New accounts on Main now have to activate using the link sent to email address or they can't run jobs (includes upload). Want to enable this on your own Galaxy instance? See the User Accounts Administration page.