Research Computing Users Group & Workshop

For those of you that might be unaware, RC has been holding weekly user group and workshop meetings to answer questions and discuss issues in person.

The weekly Research Computing Workshop Sessions will be moving to the new Advanced Visualization Center beginning this week. The Visualization Center is in room PHY 147, which is across [...]

Issue with /work 04/12/2012

Today it was discovered that roughly 40 compute nodes within our cluster had dropped their /work mounts.

As a result of this issue, those nodes needed to be rebooted.  Once the nodes resumed operations, /work was again available without any issue.

Because of the reboots, running user jobs would have been negatively affected.  These jobs will need to be re-submitted given [...]

InfiniBand Fabric Issue, 04/10/2012

During work to install several more compute nodes to the wh.2012.01.q hardware pool, the power cable to one of the InfiniBand switches was briefly disconnected by accident. If any jobs fail with the below error, please resubmit them as the problem has been resolved. We apologize for any inconvenience.


WARNING: There is at least one [...]

May 2012 Upgrades to /work Filesystem

During the first week of May, we will be upgrading the current /work filesystem in order to provide significant performance enhancements and most importantly, long-awaited fixes related stability.

/work will be migrated from a home-grown GlusterFS volume of roughly 100TB to a vendor-supported Lustre filesystem of equivalent capacity, capable of 4GB/s of read/write throughput [...]

New Compute Resources: Sandy Bridge and Fermi GPUs

Despite the significant system expansion that occurred back in January, adding over 1500 CPU cores and several terabytes of memory, we can always use more resources. We are finalizing some small details to add at least 60 new systems to the cluster based on the E5-2630 Intel Sandy Bridge CPU. This will add an additional [...]