Cluster Information

From Earlham Cluster Department

(Difference between revisions)
Jump to: navigation, search
(Current To-Do)
(Current To-Do)
Line 2: Line 2:
Date represents last meeting where we discussed the item
Date represents last meeting where we discussed the item
-
* Writeup for Jeff Krause -- see email -- '''Aaron, Sam, Fitz, Gus?, Charlie?'''
+
* Writeups for Jeff Krause (3/Nov/09)
-
* Brad's Graphing Tool  (19/Oct/09) -- '''Brad'''
+
* Brad's Graphing Tool  (19/Oct/09)
   * 2 y-axis scales (runtime and problem size)
   * 2 y-axis scales (runtime and problem size)
   * error bars for left and right y-axes with checkboxes for each
   * error bars for left and right y-axes with checkboxes for each
   * eps file for the poster
   * eps file for the poster
-
* TeraGrid Runs  (19/Oct/09) -- '''Aaron, Sam, Gus'''
+
* TeraGrid Runs  (3/Nov/09)
   * Update configure.ac based on Fitz's notes from Kraken
   * Update configure.ac based on Fitz's notes from Kraken
-
* New LittleFe Boards  (19/Oct/09) -- '''Gus'''
+
  * Status on pople?
-
  * Order the boards -- '''Charlie'''
+
  * Remember Big Red has a debug queue
-
  '''Board Criteria:'''
+
* New LittleFe Boards  (19/Oct/09)
-
  * mini itx form factor
+
* BCCD Testing  (3/Nov/09)
-
  * at least 2 core (probably max 2 core)
+
   * Liberation testing
-
  * 2 GB ram
+
   * pxe booting
-
  * Cuda Enabled (with chip on board); OpenCL O.K. too
+
   * Test all boot options
-
* BCCD Testing  (19/Oct/09) -- '''Gus, Sam, and Aaron'''
+
  * Test CUDA boot options (waiting for message from Leandro)
-
  * How many node boot up as dhcp servers?  (answer:  just the first one, confirmed 10/13)
+
   * Send /etc/bccd-revision with each email
-
   * Liberation testing -- '''Charlie, Gus, Sam, Aaron''' -- moved to Tuesday, 10/20 @ 2:30p
+
  * Send output of netstat -rn and /sbin/ifconfig -a with each email
-
   * Verify no I/O errors in dmesg
+
-
   * Test all boot options, including linux 5
+
-
   * Send /etc/bccd-revision to Skylar w/ each email
+
   * Test MPICH2 on Life, paramspace, GalaxSee, etc.
   * Test MPICH2 on Life, paramspace, GalaxSee, etc.
     * module unload openmpi && module load mpich2 && make clean && make
     * module unload openmpi && module load mpich2 && make clean && make
     * for MPICH2, machines file must be in current directory
     * for MPICH2, machines file must be in current directory
     * do -np 2, 4, etc.
     * do -np 2, 4, etc.
-
  * Send the results of running netstat -rn and /sbin/ifconfig -a on our test station to Skylar
+
* SCED Poster Sessions (3/Nov/09)
-
 
+
   * Send current copy to Jeff by next Monday
-
* Travel Plans  (12/Oct/09)
+
-
   * Send flight info to Jeff Krause -- '''Charlie'''
+
-
  * Talk to Travel-On -- '''Brad and Charlie'''
+
-
* Code Sanity Check  (12/Oct/09) -- '''Aaron and Sam'''
+
-
  * Take exact same set of runs on Sooner
+
-
* EC/SCED Poster Sessions  (19/Oct/09) '''Aaron, Sam, Fitz, and Brad'''
+
-
  * Describe the environment we've created -- how we obtain results is just as interesting as results themselves
+
-
  * Pick good graphs
+
-
  * Add text of TeraGrid poster into CVS -- '''Sam'''
+
-
  * Update Fitz's abstract -- '''Fitz'''
+
-
  * EC -- Wednesday (10/21) @ 6 pm
+
-
  * SCEd -- Combine TeraGrid and Fitz posters
+
== Summer of Fun (2009) ==
== Summer of Fun (2009) ==

Revision as of 17:23, 3 November 2009

Contents

Current To-Do

Date represents last meeting where we discussed the item

 * 2 y-axis scales (runtime and problem size)
 * error bars for left and right y-axes with checkboxes for each
 * eps file for the poster
 * Update configure.ac based on Fitz's notes from Kraken
 * Status on pople?
 * Remember Big Red has a debug queue
 * Liberation testing
 * pxe booting
 * Test all boot options
 * Test CUDA boot options (waiting for message from Leandro)
 * Send /etc/bccd-revision with each email
 * Send output of netstat -rn and /sbin/ifconfig -a with each email
 * Test MPICH2 on Life, paramspace, GalaxSee, etc.
   * module unload openmpi && module load mpich2 && make clean && make
   * for MPICH2, machines file must be in current directory
   * do -np 2, 4, etc.
 * Send current copy to Jeff by next Monday

Summer of Fun (2009)

An external doc for GalaxSee
Documentation for OpenSim GalaxSee

What's in the database?

GalaxSee (MPI) area-under-curve (MPI, openmpi) area-under-curve (Hybrid, openmpi)
acl0-5 bs0-5 GigE bs0-5 IB acl0-5 bs0-5 GigE bs0-5 IB acl0-5 bs0-5 GigE bs0-5 IB
np X-XX 2-20 2-48 2-48 2-12 2-48 2-48 2-20 2-48 2-48

What works so far? B = builds, R = runs, W = works

B-builds, R-runs area under curve GalaxSee (standalone)
Serial MPI OpenMP Hybrid Serial MPI OpenMP Hybrid
acls BRW BRW BRW BRW BRW
bobsced0 BRW BRW BRW BRW BRW
c13 BRW
pople
Charlie's laptop BRW

To Do

Implementations of area under the curve

GalaxSee Goals

GalaxSee - scale to petascale with MPI and OpenMP hybrid.

LittleFe

Notes from May 21, 2009 Review

BobSCEd Upgrade

Build a new image for BobSCEd:

  1. One of the Suse versions supported for Gaussian09 on EM64T [v11.1] - Red Hat Enterprise Linux 5.3; SuSE Linux 9.3, 10.3, 11.1; or SuSE Linux Enterprise 10 (see G09 platform list) <-- CentOS 5.3 runs Gaussian binaries for RHEL ok
  2. Firmware update?
  3. C3 tools and configuration [v4.0.1]
  4. Ganglia and configuration [v3.1.2]
  5. PBS and configuration [v2.3.16]
  6. /cluster/bobsced local to bs0
  7. /cluster/... passed-through to compute nodes
  8. Large local scratch space on each node
  9. Gaussian09
  10. WebMO and configuration [v9.1] - Gamess, Gaussian, Mopac, Tinker
  11. Infiniband and configuration
  12. GNU toolchain with OpenMPI and MPICH [GCC v4.4.0], [OpenMPI v1.3.2] [MPICH v1.2.7p1]
  13. Intel toolchain with OpenMPI and native libraries
  14. Sage with do-dads (see Charlie)
  15. Systemimager for the client nodes?

Installed:

Fix the broken nodes.

(Old) To Do

BCCD Liberation

Curriculum Modules

LittleFe

Infrastructure

SC Education

Current Projects

Past Projects

General Stuff

Items Particular to a Specific Cluster

Curriculum Modules

Possible Future Projects

Archive

Personal tools
Namespaces
Variants
Actions
websites
wiki
this semester
Toolbox