October 2014 Galaxy Update
Welcome to the October 2014 Galaxy Update, a summary of what is going on in the Galaxy community. Galaxy Updates complement the Galaxy Development News Briefs which accompany new Galaxy releases and focus on Galaxy code updates.
Galaxy Needs Your Input!
The Galaxy Project is preparing for our next grant cycle and we are seeking your feedback and comments on on all things Galaxy. There are two questionnaires, each with a different focus, based on how you interact with Galaxy:
Please take a few minutes and fill out whichever surveys apply to you. The questionnaires are structured so you can skip topics that don't apply to you, and every question is optional.
And, to thank you for your time and effort, the Galaxy Project will increase your storage quota on usegalaxy.org by 50GB, a 20% increase.
Let your voice be heard!
IRC Channel Policy Change
A proposed change for the #galaxyproject IRC channel was proposed, and then discussed, and approved on Galaxy Biostar. Starting sometime in October, posts to this channel will be made available in a searchable archive on the web.
Thanks to everyone who participated in the decision, and those at GCC2014 who suggested this.
Events
Galaxy at ECCB'14
Galaxy had a strong presence at ECCB'14. Galaxy was featured in workshops, demos and posters. Couldn't make it? No worries, as most slides and posters are now available online.
Other Events
There are upcoming events in Switzerland, Germany, Australia, Norway, France, Italy, and the United States. See the Galaxy Events Google Calendar for details on other events of interest to the community.
New Papers
The Galaxy CiteULike library of publications reached 1800 papers this month. 71 papers (a new record) were added to the Galaxy CiteULike Group in September, including:
- Executing SADI services in Galaxy, by Aranguren, et al. Journal of Biomedical Semantics, Vol. 5, No. 1. (2014), 42, doi:10.1186/2041-1480-5-42
- A Survey of Cloud-Based Service Computing Solutions for Mammalian Genomics, by Church & Goscinski, IEEE Transactions on Services Computing, DOI: 10.1109/TSC.2014.2353645
- An automated infrastructure to support high-throughput bioinformatics, by Cuccuru, et al. High Performance Computing & Simulation (HPCS), 2014 International Conference on (July 2014), pp. 600-607, doi:10.1109/hpcsim.2014.6903742
- Experiences building Globus Genomics: a next-generation sequencing analysis service using Galaxy, Globus, and Amazon Web Services, by Madduri, et al. Concurrency and Computation: Practice and Experience, Special issue on XSEDE13, Volume 26, Issue 13, pages 2266–2279, 10 September 2014
- MIRPIPE – quantification of microRNAs in niche model organisms, by Kuenne, et al. Bioinformatics (2014) doi: 10.1093/bioinformatics/btu573
- ballaxy: web services for structural bioinformatics, by Hildebrandt, et al. Bioinformatics (2014) doi: 10.1093/bioinformatics/btu574
The new papers were tagged in many different areas:
# | Tag | # | Tag | # | Tag | # | Tag | |||
---|---|---|---|---|---|---|---|---|---|---|
4 | Cloud | - | Project | 4 | Tools | 5 | UsePublic | |||
- | HowTo | 2 | RefPublic | - | UseCloud | - | Visualization | |||
5 | IsGalaxy | 2 | Reproducibility | 7 | UseLocal | 22 | Workbench | |||
31 | Methods | - | Shared | 5 | UseMain |
Who's Hiring
The Galaxy is expanding! Please help it grow.
- CDD Ingénieur NGS - Institut Curie, Paris, France
- Emploi CDD Ingénieur Bioinformatique - ChIP-seq, Marseille, France
- Research Specialist, Michigan State University, United States
- Bioinformatics and Computational Biology, US Army Engineer Research and Development Center’s Environmental Laboratory, Vicksburg, MS, United States
- Computational Science Developer I, Cold Spring Harbor Laboratory (CSHL), New York, United States
- Statistical Genomics Postdoc opening in the Makova lab at Penn State
- The Galaxy Project is hiring software engineers and post-docs
Got a Galaxy-related opening? Send it to outreach@galaxyproject.org and we'll put it in the Galaxy News feed and include it in next month's update.
New Public Servers
Two new public Galaxy server was added to the published list in September:
GalaxEast
GalaxEast aims at providing a large range of bioinformatics tools for the analysis of various types of Omics data. It supports reproducible computational research by providing an environment for performing and recording bioinformatics analyses.
The GalaxEast project has the following main objectives:
- Provide the academic scientific community with an open and powerful Galaxy instance with a guaranteed availability. The platform offers access to cutting-edge and up-to-date tools for Omics data analysis with help and support.
- Propose innovative developments and new helpful tools packaged for Galaxy (available in the GalaxEast toolshed)
- Promote the packaging of new developments for Galaxy (through wrappers and/or toolshed packages).
GalaxEast is supported by IGBMC, CNRS, Inserm, and Université Strasbourg.
See GalaxEast: an open and powerful Galaxy instance for integrative Omics data analysis, poster presented at ECCB'14 by Stephanie Le Gras, et al. for more.
MIRPIPE
MIRPIPE focuses on quantification of microRNA based on smallRNA sequencing reads. From the home page: In opposition to present algorithms that generally rely on genomic data to identify miRNAs, MIRPIPE focuses on niche model organisms that lack such information. Among the MIRPIPE features are automatic trimming and adapter removal of raw RNA-Seq reads originating from various sequencing instruments, clustering of isomiRs, and quantification of detected miRNAs by homology search versus public or user uploaded reference databases.
See "MIRPIPE – quantification of microRNAs in niche model organisms," C. Kuenne, et al. for more. Email support and a MIRPIPE Manual are provided. MIRPIPE is sponsored by the Max Planck Institute for Heart and Lung Research.
Galaxy Community Hubs
Share your experience now |
The deployment details for the GalaxEast public server were posted in September. Tracey Timms-Wilson's (of the NERC Environmental 'Omics Synthesis Centre) Overview of Galaxy on Bio-Linux 8 page was also added to the Community Log Board.
The Deployment Catalog and Community Log Board Galaxy community hubs were launched in 2013. If you have a Galaxy deployment, or experience you want to share then please publish them this month.
New Releases
New versions of Galaxy, CloudMan, BioBlend, and blend4j were all released in August. And so was Galaxy IPython too.
Look for a new Galaxy distribution in October.
ToolShed Contributions
Galaxy Project ToolShed Repos
Here are new contributions for the past two months.
In no particular order:
Tools
-
From saket-choudhary
- sift_web: PROVEAN and SIFT predictions for a list of human genome variants.
-
From gbcs-embl-heidelberg
- jemultiplexer: debarcoding/demultiplexing tool for FASTQ files accommodating all complex multiplexing protocols (iCLIP, molecule barcoding, ...).
-
From mcharles
- rapsodyn: tools and workflow used for the Rapsodyn Project
- rapsosnp: workflow to detect and select SNP for the rapsodyn project
-
From ayllon
- tcoffee: T-Coffee multiple alignment suite.
-
From anton
- bamtools: A collection of tools for manipulation of bamfiles
- vcfflatten: Removes multi-allelic sites by picking the most common alternate
-
From crs4
- kggseq_variant_selection: Variant selection with KGGSeq
-
From tyty
- structurefold: StructureFold predicts RNA secondary structures from high throughput RNA structure profiling data
-
From big-tiandm
- sirna_plant: plant siRNA analysis toolkits. siRNA prediction, siRNA annotation, siRNA quantify
-
From geert-vandeweyer
- dc_genotyper: genotyper aimed at finding SNPs in high-ploidy (or pooled) samples sequenced at very high depth in a targeted region.
-
From galaxyp
- fasta_merge_files_and_filter_unique_sequences: Merge FASTA files, keeping only unique sequences
- filter_by_fasta_ids: Extract sequences from a FASTA file based on a list of IDs
- myrimatch: protein identification via database search using Bumbershoot MyriMatch
- ltq_iquant_cli: iQuant performs tag based isobaric quantification
- idpqonvert: Bumbershoot idpQonvert, a part of Bumbershoot IDPicker.
- directag_and_tagrecon: protein identification via Directag and TagRecon.
Suites
-
From biomonika
-
From anton
- suite_vcflib_tools_3_0: 23 tools for manipulation of VCF datasets
Packages / Tool Dependency Definitions
-
From fubar
-
From iuc
- package_graphicsmagick_1_3_20:
- package_bioperl_1_6: downloads and compiles version 1.6 of Bioperl.
- package_onto_perl_1_41: downloads and compiles version 1.41 of the Ontology toolkit written in perl.
-
From saket-choudhary
- package_requests_2_2_1: Tool dependency definition of python-requests
- package_beautifulsoup4_4_1_0: Tool dependency definition for python-bs4
-
From qfab
- opal2_4_1: Opal Package - GVL
-
From galaxyp
- package_peptides_to_gff_0_1: Installs the peptides_to_gff python script
- package_myrimatch:
- package_directag:
- package_idpqonvert:
- package_ltq_iquant_cli:
- package_mgf_formatter:
- package_tagrecon:
-
From anton
- package_vcflib_8a5602bf07: Compiled vcflib binaries for x86_64
-
From geert-vandeweyer
- package_igvtools_2_3_32: igvtools binaries, to be used as dependency in other tools.
-
From lparsons
- package_rseqc_2_4: downloads and compiles version 2.4 of RSeQC.
Updates
-
From fubar
- toolfactory: Citations added (thanks John!) and a few more output formats for Alistair Chilcott
-
From nilesh
- rseqc: Upgraded RSeQC to version 2.4
Other News
- Why the three biggest positive contributions to reproducible research are the iPython Notebook, knitr, and Galaxy on the Simply Statistics blog
- Updated wiki page about dynamically discovering output datasets at runtime.
- The Ansible playbook used to update usegalaxy.org is available in GitHub.
- New GVL Galaxy Release: Metagenomics Tutorial tools, MACS2, BLAST, MEME, hg38, rn6, and Trinity.
- Galaxy Community UK launches a Twitter channel: @GalaxyUKFriends
- BOSC 2015 will be in Dublin with ISMB/ECCB 2015. We should have voted more often!
- Supporting Enhanced Reproducibility for Platforms like Galaxy, discussion on GitHub.