新闻 | 论坛 | 生物信息学专题 | 新思路 | 软件下载 | 相关数据库 | 免费主页

网站首页 BioSino Databese BioSino Lab BioSino Navigator 关于本站

 
站内搜索:  

专家问答:questions about linux clusters

 

[编者的话]

专家问答是Bio-IT的一个栏目,近期他们邀请了Linux NetworX 公司的技术官员Joshua Harr来回答有关于Linux Clusters的一些问题,这里节选了其中的一些问题和解答。

 

QUESTION:

I am interested in setting up a Linux cluster for our microarray core facilities. We can get a few old PCs. I wonder if any one has a good step-by-step guide for this purpose.

ANSWER :

Assembling spare PC equipment into a Beowulf cluster has a long successful history. Microarray analysis is a common application for these clusters. You can find benchmarking information for microarray analysis on a Beowulf cluster at: http://cmag.cit.nih.gov/Microarray.htm.

There are a number of good resources for building a cluster at: http://www.linuxhpc.org/pages.php?page=Documentation.

You can also find an active mailing list and wealth of other useful information at www.Beowulf.org. The mailing list is an excellent source of answers for beginners and experts alike.

Best of luck in your Beowulf adventure!

QUESTION:

Has anyone done a total-cost-of-ownership study on Linux clusters (vs. SGI, etc.)?

ANSWER :

To my knowledge, no one has performed a proper study of the total cost of ownership (TCO) of a Linux cluster versus alternatives. Some of the issues relating to total cost of ownership of a Linux cluster are covered in my response to the previous question. Some more specific work relating to reducing the TCO for a Linux cluster is covered in this article at the Cambridge Healthtech Institute: http://www.chiresource.com/newsarticles/issue12_2.asp

QUESTION:

We have a four-processor Sun and currently are serving approximately 40 users for high-throughput sequence analysis/annotation using GCG and in-house developed tools (mostly using BLAST and processing BLAST results).

In comparison to Unix (Sun Solaris), what are the hidden costs of operating a Linux cluster (apart from obvious issues about increased administration costs)? CPU-to-CPU, does the Linux solution require more floor space, cooling power, electricity, replacement parts, etc.?

If all you want to do is improve performance of BLAST, etc., is it more economical to invest in accelerator hardware such as Decypher?

ANSWER :

Overall, the total cost of ownership for Linux clusters relative to productive work performed is lower than any alternative I am aware of. The cost breakdown does differ from your current Sun system, however. Systems such as your Sun server tend to have very high initial up-front costs and high yearly maintenance costs with lower administration costs. Linux clusters generally have lower initial costs, lower priced yearly maintenance contracts, but higher administration costs (as you recognize in your question).

Overall, the largest cost comes from increased administration costs. There is a great amount of development work, by Linux NetworX and others, that aims to reduce this cost.

Linux clusters generally provide a significant reduction in the amount of required floor space compared to competitive systems. Several vendors offer systems that require 1U or less of rack space for the highest-performance servers (i.e., cluster nodes). The CPUs that are commonly used tend to draw a little more power and require more HVAC than your Sun system. The costs associated with power and cooling are negligible for most customers unless a very large system is deployed.

The largest factor in the reliability (i.e., maintenance costs) of Linux clusters is initial quality. The hardware used in Linux clusters changes rapidly and requires a dedicated testing effort to ensure that components work properly and work together in the system properly. A cluster from a vendor with good quality assurance procedures might be expected to have one to two RMA issues each year per fifteen nodes or so. The cost of replacement parts is generally covered by the vendor's support agreement, which is typically a fraction of the cost of a similar support agreement from a traditional high-end server vendor.

The greater administration costs and slightly greater maintainability are more than offset by the improvement in productive work that can be gained. It is very common to realize a 10-50x improvement in computational throughput per dollar spent compared to traditional high-performance computing equipment. A good vendor can help you benchmark your particular code on different platforms to help you determine the real-life improvement you will see. You can use this information to help you determine whether a Linux cluster is a good investment for you relative to other solutions, such as a BLAST accelerator. Also note that several groups are working on FPGA-based BLAST acceleration methodologies similar to DeCypher that can be deployed in a Linux cluster, taking advantage of the best of both solutions.

Answer Provided by: Joshua Harr

Joshua Harr is chief technology officer for Linux NetworX. Harr leads his company's effort to make clusters easier to manage, more powerful, and more reliable.

 


1999-2005 中国科学院上海生命科学研究院生物信息中心  
版权所有 All rights reserved.