I am out of the office until 08/12/2012.
hi All,
I will take vacation next week from 2012/08/06(Mon) to 2012/08/10(Fri).
Following is my schedule:
Defects on OS deployment: wang hua zhong
Darpa demo monitoring : I have finished the videos, will be reviewed after
I come back. During this
hi,
all the xcat log entries are placed in local4 facility with respective
levels, so with rsyslog, you can simply append a line
local4.*/var/log/messages
to rsyslog rule configuration file /etc/rsyslog.conf, then restart rsyslog
service to apply the rule.
thanks
best regards
China System Technology Laboratory
Tel: 86-10-82452903
Email: yang...@cn.ibm.com
Address: Building 28, ZhongGuanCun Software Park,
No.8, Dong Bei Wang West Road, Haidian District Beijing 100193,
PRC
北京市海淀区东北旺西路8号中关村软件园28号楼
邮编: 100193
From: Yuan Y Bai/China/IBM
To: Song BJ Yang
hi,
I think some of the CentOS/RH/Fedora kickstart templates are out of date.
I have opened a defect to track this problem(
https://sourceforge.net/p/xcat/bugs/3742/), hope can be fixed in xCAT
2.8.3.
thanks
best regards
any
more
Hi,
After updating to xCAT-2.8.4-snap201405120509.x86_64 the /install
directory of the xcatmaster's httpd is not available any more because
the file /etc/httpd/conf.d/xcat.conf is missing. Is this intended?
Best regards,
Markus Hillenbrand
[attachment smime.p7s deleted by Song BJ
hi,
for xCAT commands such as "genimage","packimage","nodeset", please use
osimage name as the argument, the previous "-p","-o","-a" options are
deprecated.
thanks
--
YANG Song (杨嵩)
IBM China System Technology
that they have used with xCAT, and/or advise how my kickstart file
should be modified.
Best regards,
David.
[attachment compute7.tmpl deleted by Song BJ Yang/China/IBM
cn.ibm.com
Address: Building 28, ZhongGuanCun Software Park,
No.8, Dong Bei Wang West Road, Haidian District Beijing 100193,
PRC
北京市海淀区东北旺西路8号中关村软件园28号楼
邮编: 100193
From: Russell Jones <russell-l...@jonesmail.me>
To: Song BJ Yang/China/IBM@IBMCN, Xiao Peng Wang/China/IBM@I
hi,
For the error messages you listed, it is a xcat bug
https://github.com/xcat2/xcat-core/issues/573, it has been fixed in 2.10:
https://github.com/xcat2/xcat-core/pull/738
would you please patch "/opt/xcat/sbin/xcatd" with the patch provided in
the issue and try again?
thanks
__
> > xCAT-user mailing list
> > xCAT-user@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/xcat-user
> >
>
> --
>
>
> ---
hi Anderw,
I have encountered the similar issue with NextScale NX360M5, xCAT provides
some steps on this:
https://sourceforge.net/p/xcat/wiki/XCAT_iDataPlex_Cluster_Quick_Start/#begin-installation
would you please the steps mentioned in the doc? hope this helps.
10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Er Tao Zhao/China/IBMTo: Song BJ Yang/China/IBM@IBMCNCc:Subject: Fw: [xcat
: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Xiao Peng Wang/China/IBMTo: Song BJ Yang/China/IBM@IBMCNCc:Subject: Fw: Re: [xcat-user] SL7.2
hi Rich, Victor,
when os provision is finished successfully, the node will still try to boot from network, but since xCAT will leave a "#boot exit" in the boot loader configuration to tell the node to exit and try the next boot option configured in firmware.
# cat
message -From: peter CZ1 Peng <peng...@lenovo.com>To: Song BJ Yang/China/IBM@IBMCNCc: "xcat-user@lists.sourceforge.net" <xcat-user@lists.sourceforge.net>Subject: RE: RE: 回复:[xcat-user] xcat 2.13.0 with bad template for the centos 7.3Date: Tue, Dec 13, 2016 10:45 AM
软件园28号楼邮编: 100193
- Original message -From: Xiao Peng Wang/China/IBMTo: Song BJ Yang/China/IBM@IBMCN, Er Tao Zhao/China/IBM@IBMCNCc:Subject: Fw: [xcat-user] Power8 Minksy eval node - RHEL 7.3 installation troublesDate: Wed, Dec 14, 2016 8:12 AM
It's a Minsky customer, please take
GuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
----- Original message -From: Xiao Peng Wang/China/IBMTo: Song BJ Yang/China/IBM@IBMCNCc:Subject: Fw: Re: [xcat-user] /usr/bin/ping on diskless lost capabilities rh
ngGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
----- Original message -From: Wei Hua WH Hu/China/IBMTo: Song BJ Yang/China/IBM@IBMCNCc:Subject: Fw: Re: [xcat-user] RHEL-7.3 pro
ANG Song (杨嵩)IBM China System Technology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Er Tao Zhao/China/
hi techie,
Thanks for your interest on xCAT.
"don't want Xcat to flag the current running systems to provision.", so what features of xCAT you plan to leverage in your cluster management? is there an existing cluster management tool in your cluster?
This is an interesting question for us,
hi David Johnson,
In the scenario you described, I think xCAT installation will affect the stuff under `/install/postscripts`
1) files with the same name with the files under `/install/postscripts` shipped by xCAT will be overwritten
2) the credentials under "/install/postscripts/hostkeys",
hi Daniel,
which discovery method are you using? mtms-based or switch-based? The invalid ipmi commands in dodiscovery will break the mtms-based discovery and the task like `bmcsetup` defined in the chain attribute.
For switch-based discovery, the mandatory information is the mac address, which
hi Pharthiphan Asokan,
the error message "SSLEAY_RAND_BYTES:PRNG not seeded" is thrown out by openssl due to lack of random source and seed file on the node. Please check the openssl version on your management node, and check the existence of random devices by `ls -l /dev/*random` and seed file
hi Rogie Pamintuan,
xCAT leverage `logrotate` , the configuration can be found in `/etc/logrotate.conf` and `/etc/logrotate.d/xcat`, you can customize them according to your requirement
--YANG Song (杨嵩)IBM China
Hi Daniel,
Currently, "rmimage" does not support delete diskful osimages, this is because other osimages might rely on the `pkgdir` of the diskful osimage, such as the diskless/statelite osimages with the same `pkgdr`.
Back to your question, the command combination of `rmdef -t osimage -o `
hi Jeff Berry,
when did you see the error message? during `genimage`? or during node boot up?
"
code killed, status 6/ABRT
on restart ‘/run/log/journal//system.journal corrupted or uncleanly shut down.
"
can you see the login prompt on the console?
directory in the image, but that didn’t help.Where do I turn up the debugging?Thanks!― ddj
hi peng cheng zhu,
the packimage will update the "/etc/shadow" and hence the /etc/shadow in the provisioned diskless node according to your passwd table , you can rerun packimage and check whether the file is updated
hi Sam,
is the screenshot captured during the rootimg boot up? or during initrd boot up before rootimage tarball is download? please check the status of the node by `lsdef -i status`
--YANG Song (杨嵩)IBM China System
nology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Brian Joiner To: Song BJ Yang Cc:Subject: R
fication, please make sure the modification is ok in your env.
--YANG Song (杨嵩)IBM China System Technology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong
ring the upgrade.
Thanks,
Brian Joiner
On Thu, Aug 9, 2018 at 10:10 PM, Song BJ Yang <yang...@cn.ibm.com> wrote:
Hi Brian Joiner,
is there any packages specified in `otherpkglist` and `otherpkgdir`? which which will be installed by `otherpkgs` during the post-installation reboot
wo
Hi Brian Joiner,
is there any packages specified in `otherpkglist` and `otherpkgdir`? which which will be installed by `otherpkgs` during the post-installation reboot
would you please provide the osimage definition and node definition? thanks
`updatenode` will download all the postscripts from "/install/postscripts/" on MN to "/xcatpost/" on CN.
so please make sure whether the missing files exist in "/install/postscripts/" on MN.
If yes, then check whether there is some issue on the httpd/apache server side during downloading, you
Hi,
Please ssh the booted up compute node with password and "cd /xcatpost"
then make sure the scripts allowcred.awk and getcredentials.awk are there and you can run them.
the shebang of these scripts is "/usr/bin/awk", so please make sure the awk path is ok
Hi Jeff,
did you enabled kdump? the dump core file might help to find out the problem
--YANG Song (杨嵩)IBM China System Technology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun
> I made abstraction of BMC configuration and I didn't use xcat with KVM, the idea is to simulate physical deployment.
not quite understand this, what is the "mgt" attribute of the node?
> The problem : the PXE boot works fine until the download of the vmlinuz image and it hangs, and nothing
Hi Keith,
> The references I see to creating repos (base, or updates) on the xCAT management node all indicate that only local repos are supported (e.g. "baseurl=file:///..").
for redhat diskless osimage, we do support online repo in pkgdir, for example,
`chdef -t osimage -o myosimage
hi Jonathan,
"getcredentials.awk" expect 2 environment variables:
USEOPENSSLFORXCAT=1
XCATSERVER=:3001
and 1 argument, i.e, the name of the credential, such as "ssh_rsa_hostkey"
so you can export the 2 environment variables first, run allowcred.awk in background with "./allowcred.awk
I have tried removing the cons setting, to no avail. The console settings are “115200”.Group entry from the nodehm table: “"ipmi","ipmi","ipmi","0","115200","hard",,”Thank you all,
Sam
From: Song BJ Yang Sent: Tuesday,
pc64 < 1:2.13.10--> Finished Dependency ResolutionError: xCAT-genesis-base-ppc64 conflicts with 1:xCAT-genesis-scripts-ppc64-2.12.5-1.noarchError: xCAT-genesis-base-x86_64 conflicts with 1:xCAT-genesis-scripts-x86_64-2.12.5-1.noarch You could try using --skip-broken to work around the problem
2.30.101.3 - - [06/Jul/2018:10:36:50 -0400] "GET /tftpboot/xcat/osimage/KSU-rhels7.3-netboot-compute/initrd-stateless.gz HTTP/1.1" 200 34271424 "-" "iPXE/1.0.3-131028 (d603e)"
172.30.101.3 - - [06/Jul/2018:10:53:06 -0400] "GET //install/netboot/rhels7.3/x86
Hi Javier,
seems this is not a fresh install, it's an update process. What is your current xCAT version? There might be problem when upgrading xCAT across 2+ releases.
or you can try to remove current xCAT packages first, then install xCAT 2.12
best regards
.
Regards,
Jeff
From: Song BJ Yang [mailto:yang...@cn.ibm.com]Sent: 20 June 2018 08:19To: xcat-user@lists.sourceforge.netCc: xcat-user@lists.sourceforge.netSubject: Re: [xcat-user] SciLinux 7.4 statelite problems
hi Jeff Berry,
when did you see the error message? during `genimage`? or during
BTW, refer to Doc to see whether some steps are missed
https://xcat-docs.readthedocs.io/en/latest/advanced/hierarchy/databases/mysql_configure.html?highlight=mysqlsetup
--YANG Song (杨嵩)IBM China System Technology
Hi,
first of all, "Unable to read private ECDSA key from /etc/xcat/hostkeys" is thrown out by `remoteshell`, which is caused by version difference of openssh-server on MN and CN, ECDSA is not supported on some old openssh-server version, it is not a fatal error and won't block anything.
what
if you can login in the provisioned diskless os on compute node, please check /var/log/xcat/xcat.log on it, this log includes the logs during postscripts and postbootscritps running
--YANG Song (杨嵩)IBM China System
Hi Brian,
Cong!
Please tell us the os version and arch of your service node, or better the osimage definition of the service node, so that we can confirm whether `perl-DBD-MySQL` is missing somewhere, thanks
--YANG
hi langton nkiwane,
it is not recommended to keep the "ethX" naming, since the NIC name like "ethX" is not consistent across system reboot, even not consistent in "initrd" phase and the installed system. This means you cannot predict which NIC is "ethX" when you specify it, so it make no sense
Hi,
I do not think "issue an yum with installroot argument and fire up an upgrade" is a good idea, the basic idea is that you can always regenerate the desired osimage via `genimage`+`packimage`, so all the modifications/customizations upon the diskless osimage should be tracked with the
.These messages are an attempt to steal your username and password. Please do not reply to, click the links within, or open the attachments of these messages. Delete them!
On Tue, Sep 4, 2018 at 8:10 PM Song BJ Yang <yang...@cn.ibm.com> wrote:
Hi Keith,
> The references I see to creati
what is the error message?
can you find anything wrong in `/var/log/httpd/access_log` and `/var/log/httpd/error_log`?
--YANG Song (杨嵩)IBM China System Technology LaboratoryTel: 86-10-82452903Email:
, or open the attachments of these messages. Delete them!
On Tue, Sep 4, 2018 at 8:10 PM Song BJ Yang <yang...@cn.ibm.com> wrote:
Hi Keith,
> The references I see to creating repos (base, or updates) on the xCAT management node all indicate that only local repos are supported (e.g. &quo
hi,
currently, `localdisk` only support "mbr" partitions. But I think this is a reasonable feature for us, you can open a feature request in https://github.com/xcat2/xcat-core, we can plan this in the following development sprints.
Now, you can modify
Hi Hannum,
did you observe any error in apache/httpd log,"access_log" and "error_log" under "/var/log/httpd/", on the server "172.20.0.11"?
did it work if you restart apache/httpd?
--YANG Song (杨嵩)IBM China System
"primarynic" has been deprecated
#tabdump -d noderes
> installnic: The network adapter on the node that will be used for OS deployment, the installnic can be set to the network adapter name or the mac address or the keyword "mac" which means that the network interface specified by the mac
what is the backend database? run `lsxcatd -d` to get the backend db
--YANG Song (杨嵩)IBM China System Technology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software
, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Javier Ron To: "xcat-user@lists.sourceforge.net" Cc:Subject: Re: [xcat-user] restoredb errorsDate: Fri, Dec 21, 2018 6:18 PM
Hi,
It's
dbengine=SQLite
the 2 shared nfs entries will be added on xCAT installation or upgrading or `xcatconfig -i/-f`
these 2 nfs shared directories are only used in 2 scenarios:
1) NFS based statelite
2) hierarchy cluster when site.sharedtftp and site.sharedinstall
any missing scenario?
We got some complains on
Hi Kevin,
Interesting to hear that you are managing Dell servers with xCAT, may I have 2 additional questions on this?
1. is there any gap or manual workaround in your practice to manage Dell R230 servers with xCAT? Since from our perspective, the official verification on managing
28, ZhongGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Rich Sudlow To: xCAT Users Mailing list , Song BJ Yang Cc:Subject: Re: [xcat-user] How to re-discover a node?Date: Wed, Nov 21
the attachments of these messages. Delete them!
On Tue, Nov 20, 2018 at 7:21 PM Song BJ Yang <yang...@cn.ibm.com> wrote:
Hi Kevin,
Interesting to hear that you are managing Dell servers with xCAT, may I have 2 additional questions on this?
1. is there any gap or manual workaround i
PLEASE CHECK whether these directories are included in "exlist" of the osimage, which will excluded from the compressed rootimg during `packimg`
--YANG Song (杨嵩)IBM China System Technology LaboratoryTel:
hi Kevin,
We have just finished the support of site.httpport support,
you can apply the feature with the following 2 steps:
1. modify the apache/httpd configuration to switch the server port
2. change site.httpport to the number you expect
3. `makedhcp -n`, `nodeset/rinstall`
The
> if for some reason installnic is not available, does xNBA fallsback to whatever works (in BIOS order) ?[installnic concerns xNBA, doesn't it] ?
the behavior is determined by the "boot order" in bios settings, if the selected network boot device("installnic" you mentioned) is not available, the
ets/10.40.0.0_16
Do you have any ideas why this might have started happening? Because this was not the case previously…
The xCAT version is: 2.14.5
Thanks,
Sandra
From: Song BJ Yang Sent: Saturday, 16 March 2019 1:23 AMTo: xcat-user@lists.sourceforge.netCc: xcat-user@lists.source
Hi,
`sleep 36500d` means there are fatal error in the post install phase, and xcat make the installer stop there for further field debug.
You can search `sleep 36500d` in `/install/autoinst/` on MN for the possible fatal errors and relevant log messages that can make the post-installation
Hi David,
xCAT does provide the support for cumulus switch provision and configuration, please find the Doc on https://xcat-docs.readthedocs.io/en/latest/advanced/networks/onie_switches/index.html
you mentioned
> I'd like to add a small dynamic DHCP range on that 10.4.y.z network while nodes
ure use...
Thanks,
Brian Joiner
On Wed, Mar 6, 2019 at 3:52 PM Brian Joiner <martinitime1...@gmail.com> wrote:
Song,
So configure the postinstall script with "systemctl start gmond"?
On Tue, Mar 5, 2019 at 8:15 AM Song BJ Yang <yang...@cn.ibm.com> wrote:
the postscrip
the postscripts and postbootscripts are invoked on the [start] of a system service named "xcatpostinit1.service", since systemd start the services in parallel with consideration on dependency. There might be some kind of deadlock while trying to restart a service in another service. Someone has
Hi,
We encountered a similar issue https://github.com/xcat2/xcat-core/issues/274 , but in this case the console uncovered the root cause.
However, your console output does not show why the boot up process hang. I suggest you add more verbose output during boot up, this is a reference
38, Angelo Cavalcanti <angelo.cavalca...@gmail.com> escreveu:
1. The status is "powering-on"
2. Yes, the issue happens in the same node
3. Ok. I will send the xCAT-probe output session
Angelo Cavalcantibr.linkedin.com/in/angelocr
Em sex, 15 de mar de 2019 às 07:37, Song BJ Y
interesting feature. I would like to take a look at your scripts.
several questions:
1. is this automatic console behavior only availabe when `console=` not specified in the kernel options?
2. what is "DCD"?
--YANG
Hi Daniel,
Great work!
I will look into the repo and forward this mail to the team to see whether we can build the automation of discovery and hardware control like this, currently all the such kind of test are triggered manually on bare metal servers.
one comment for your repo
hi, what is your node definition and os definition?
ps, according to https://github.com/xcat2/xcat-core/wiki/XCAT_2.13.2_Release_Notes, the last xCAT release verified on rh6.8 is xCAT 2.13.2
--YANG Song (杨嵩)IBM China
ina System Technology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong Bei Wang West Road, Haidian District Beijing 100193, PRC北京市海淀区东北旺西路8号中关村软件园28号楼邮编: 100193
- Original message -From: Angelo Cavalcanti To: Song BJ Yan
message -From: Song BJ Yang/China/IBMTo: angelo.cavalca...@gmail.comCc: xcat-user@lists.sourceforge.netSubject: Re: [xcat-user] [External] Netboot process stuckDate: Fri, Mar 15, 2019 6:35 PM
Hi,
If the console output covers the whole process, seems the the initrd boot up process did not reach
What is the osimage definition? if you are provisioning rh7/centos7 and using customized partition file with disk specified with /dev/sdx, the sdx disk name is not persistent across reboots, as well as installer and the booted up system
another hint is to enable xcatdebug mode by changing
uld I check?
Can I get more debug info from the genesis kernel?
Can I get more debug info from the xcat master?
On Thu, May 23, 2019 at 4:51 AM Song BJ Yang <yang...@cn.ibm.com> wrote:
Hi,
You can watch the output of `journalctl -u xcatd -f` in another session during nodediscovery
r.
I have ensured that iptables is open for those ports.
I have ensure that there is a process listening on port 3001.
What else should I check?
Can I get more debug info from the genesis kernel?
Can I get more debug info from the xcat master?
On Thu, May 23, 2019 at 4:51 AM Song BJ Yang <ya
Hi,
You can watch the output of `journalctl -u xcatd -f` in another session during nodediscovery
there are some similar issues reported before, see https://sourceforge.net/p/xcat/mailman/search/?q=Unrecognized+directive+ , you can simply go through them for any hint
These 2 lines are not fatal errors which caused otherpkgs return non-zero
./otherpkgs: line 821: /usr/bin/logger: Argument list too long
./otherpkgs: line 931: /usr/bin/logger: Argument list too long
Please provide the full output of `updatenode -P otherpkgs -V` to find out the real point of
picious as to why "getdestiny" returns data when using the "rinstall" method, but doesn't return data when doing nodediscovery.
What should the "nodediscoverstart" command do?
How can I check that nodediscoverstart did (or did not do) the right thing on the master no
see my reply in https://github.com/xcat2/xcat-core/issues/6285
--YANG Song (杨嵩)IBM China System Technology LaboratoryTel: 86-10-82452903Email: yang...@cn.ibm.comAddress: Building 28, ZhongGuanCun Software Park,No.8, Dong
84 matches
Mail list logo