高性能计算集群的搭建

合集下载
  1. 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
  2. 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
  3. 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
分节点:eth1 内网 ip=192.168.0.101~192.168.0.121,localhost=hpc01~hpc21
2. 安装主节点
在 hp ProLiant DL385 上外接 usb dvd 光驱,bios 设置 usb 光驱第一顺序启动, 安装 Redhat 5.4 Enterprise Server。
高性能计算集群的搭建
PC-Cluster 手记
(Version: 0.91rc) 黄灿
Baidu Nhomakorabeacanhuang@mail.ustc.edu.cn
中国科学技术大学 地球和空间科学学院 2010 年 9 月 13 日
目录
1. 硬件平台和网络..........................................................................................................................1 2. 安装主节点..................................................................................................................................1 3. 配置主节点的 dhcp、nfs 和 tftp 服务......................................................................................1
1. 硬件平台和网络
1 个主节点:hp ProLiant DL385 21 个分节点:hp ProLiant DL145
Building a cluster system for HPC
Version: 0.91rc
主节点:eth0 外网 ip=222.195.74.94 eth1 内网 ip=192.168.0.1,localhost=hpc
在主节点根目录下新建 exports 目录,为以后网络共享使用。在 exports 目录 下新建 x64 目录,将安装光盘内的所有文件拷入,为网络安装备用。
3. 配置主节点的 dhcp、nfs 和 tftp 服务
3.1 dhcp 服务 在主节点配置 dhcp 服务的目的是为了在分节点网卡 pxe 启动时能够找到主
节点分配到 ip 以便以后访问。 编辑 dhcp 服务的配置文件:
# vi /etc/dhcpd.conf 键入以下内容: # # DHCP Server Configuration file. # see /usr/share/doc/dhcp*/dhcpd.conf.sample # option domain-name "ustc.edu.cn"; ddns-update-style none; default-lease-time 6000; max-lease-time 11400; server-name "bootserver"; use-host-decl-names on; option option-128 code 128=string; option option-129 code 129=string;
3.1 dhcp 服务 ................................................................1 3.2 nfs 服务 .................................................................5 3.3 tftp 服务 ................................................................5 4. 网络安装分节点..........................................................................................................................7 5. 设置主节点无密码 ssh 访问 ......................................................................................................7 6. 分节点配置 nfs 服务 ..................................................................................................................8 7. 主节点配置 nis 服务...................................................................................................................8 8. 分节点配置 nis 服务...................................................................................................................9 9. 安装 openmpi............................................................................................................................10 10. 安装 torque 和 ifort................................................................................................................ 11 10.1 主节点 .................................................................11 10.2 分节点 .................................................................12 10.3 ifort 的安装配置 .......................................................13 附录 I 管理员须知........................................................................................................................14 11.1 新建用户 ............................................................... 14 11.2 删除用户 ............................................................... 14 11.3 设置运行作业的机器数 ................................................... 14 附录 II 用户须知 ..........................................................................................................................15 12.1 串行作业 ............................................................... 15 12.2 并行作业 ............................................................... 15
subnet 192.168.0.0 netmask 255.255.255.0 { option routers 192.168.0.1; deny unknown-clients; group{ next-server 192.168.0.1; filename "pxelinux.0";
1
host hpc01 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:F1:14; fixed-address 192.168.0.101; } host hpc02 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:F1:3E; fixed-address 192.168.0.102; } host hpc03 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:F0:0E; fixed-address 192.168.0.103; } host hpc04 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:86:D5; fixed-address 192.168.0.104; } host hpc05 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:F0:4E; fixed-address 192.168.0.105; } host hpc06 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:86:AF; fixed-address 192.168.0.106; } host hpc07 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:86:43; fixed-address 192.168.0.107; } host hpc08 {
2
Building a cluster system for HPC
Version: 0.91rc
filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:94:58; fixed-address 192.168.0.108; } host hpc09 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:F0:66; fixed-address 192.168.0.109; } host hpc10 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:F1:34; fixed-address 192.168.0.110; } host hpc11 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:86:F3; fixed-address 192.168.0.111; } host hpc12 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:93:82; fixed-address 192.168.0.112; } host hpc13 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:5F:94:40; fixed-address 192.168.0.113; } host hpc14 { filename "pxelinux.0"; server-name "bootserver"; hardware ethernet 00:15:60:09:93:A0; fixed-address 192.168.0.114; } host hpc15 { filename "pxelinux.0"; server-name "bootserver";
相关文档
最新文档