ORACLE 11G rac OCR自動(dòng)備份文件權(quán)限異常知道空間不停的增長(zhǎng)
今天巡檢,發(fā)現(xiàn) 一個(gè)機(jī)器gi 對(duì)應(yīng)的目錄/u01空間達(dá)到60G,其他機(jī)器正常,
通過(guò)du -sk 進(jìn)一步發(fā)現(xiàn),$GRID_HOME/cdata(該目錄默認(rèn)是存放olr和ocr的自動(dòng)備份和手工備份)下的ocr自動(dòng)備份有異常,很多number{n}.ocr文件,
-rw------- 1 grid:oinstall 6766592 Nov 10 22:17 week.ocr
-rw------- 1 grid:oinstall 6766592 Nov 10 22:17 day.ocr
-rw------- 1 grid:oinstall 6766592 Nov 11 02:17 day_.ocr
-rw------- 1 grid:oinstall 6766592 Nov 11 02:17 backup02.ocr
-rw------- 1 grid:oinstall 6766592 Nov 11 06:17 backup01.ocr
-rw------- 1 grid:oinstall 6766592 Nov 11 10:17 backup00.ocr
-rw------- 1 root system 7094272 Nov 11 18:21 91530747.ocr
-rw------- 1 root system 7426048 Nov 11 22:21 20587917.ocr
-rw------- 1 root system 7426048 Nov 12 02:21 29546896.ocr
看出ocr的自動(dòng)備份產(chǎn)生的新的備份文件名稱為number{n}.ocr的文件,也就是自動(dòng)備份出現(xiàn)異常,是個(gè)BUG? , 使用ocrconfig -showbackup列出的備份文件還是正常的文件, 對(duì)比正常系統(tǒng)的文件的狀態(tài)和屬性,發(fā)現(xiàn)文件的屬組不一樣,難道是在安裝過(guò)程中出現(xiàn)問(wèn)題,就是rootcrs.pl(root.sh)在修改文件權(quán)限的時(shí)候出現(xiàn)問(wèn)題;
Due to bug 9446443, automatic OCR backups are incorrectly owned which is preventing CRSD from overwriting them.
Expected ownership and permission on Linux - all 7 of them:
-rw------- 1 root root 11640832 Aug 30 08:46 backup00.ocr
-rw------- 1 root root 11640832 Aug 30 04:46 backup01.ocr
-rw------- 1 root root 11640832 Aug 30 00:46 backup02.ocr
-rw------- 1 root root 11640832 Aug 30 00:46 day_.ocr
-rw------- 1 root root 11640832 Aug 29 00:46 day.ocr
-rw------- 1 root root 11640832 Aug 26 00:45 week_.ocr
-rw------- 1 root root 11640832 Aug 19 00:44 week.ocr
有一個(gè)BUG,bug 9446443 is fixed in 11.2.0.2, 12.1.
It's recommended to apply patch to fix the issue, but if patch is unavailable, workaround is to change ownership and permission of all 7 automatic backup files manually. OCR should be owned by root, but depend on platform, group may or may not be root - you can check any randomly named backup file to identify what ownership and permission it should have; in example below:
-rw------- 1 root root 7143424 Aug 30 09:40 38455890.ocr
With this, please change all 7 automatic backup files to be owned by root:root with permission "-rw-------"
根據(jù)文檔介紹,再結(jié)合自己的壞境的情況,查看對(duì)應(yīng)crs的操作日志:
2016-03-16 06:24:59.079: [UiServer][12081]{1:19564:21073} Done for ctx=11191c2f0
2016-03-16 06:25:54.968: [ OCRRAW][3599]th_delete_backupfile: Failed to delete the backup file [
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster/backup02.ocr] Retval:[-2]
2016-03-16 06:25:54.968: [ OCRSRV][3599]th_delete_backupfile: Failed to delete the backup file:[backup02.ocr] Location:[
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster]
2016-03-16 06:25:55.026: [ OCRRAW][3599]proprbkp_rename: Failed to rename the backup file [
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster/backup01.ocr] Retval:[1]
2016-03-16 06:25:55.026: [ OCRSRV][3599]th_rename_backupfile: Failed to rename the backup file:[backup01.ocr] Location:[
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster]. Retval:[49]
2016-03-16 06:25:55.030: [ OCRRAW][3599]proprbkp_rename: Failed to rename the backup file [
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster/backup00.ocr] Retval:[1]
2016-03-16 06:25:55.030: [ OCRSRV][3599]th_rename_backupfile: Failed to rename the backup file:[backup00.ocr] Location:[
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster]. Retval:[49]
2016-03-16 06:25:55.033: [ OCRRAW][3599]proprbkp_rename: Failed to rename the backup file [
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster/16654495.ocr] Retval:[1]
2016-03-16 06:25:55.033: [ OCRSRV][3599]th_rename_backupfile: Failed to rename the backup file:[16654495.ocr] Location:[
/grid/product/11.2.0/gridhome_1/cdata/c4bidb-cluster]. Retval:[49]
2016-03-16 06:25:55.036: [ OCRSRV][3599]th_manipulate_backups: Failed to rename the temporary backup file [16654495.ocr].
日志上在對(duì)ocr自動(dòng)備份的過(guò)程中,需要?jiǎng)h除老文件,創(chuàng)建新的文件,但是crs操作失敗,而產(chǎn)生性的默認(rèn)文件名來(lái)代替
通過(guò)上面的列出,應(yīng)該確定是由于文件權(quán)限導(dǎo)致問(wèn)題,不是本文中提到的BUG,單純是權(quán)限問(wèn)題;
解決方法是修改默認(rèn)備份文件名的權(quán)限為root:system,且手工刪除number{n}.ocr的文件, 觀察每4小時(shí)的備份正常,且集群狀態(tài)正常;
這個(gè)問(wèn)題,根因,就是操作失誤,本來(lái)在一臺(tái)新機(jī)器上進(jìn)行安裝,結(jié)果,在連接到正在運(yùn)行的主機(jī)上操作,
比如 chown -R grid:oinstall /u01/app ,chmod 755 /u01/app
之后,就crs出現(xiàn)問(wèn)題了。通過(guò)一些處理,crs可以正常了,但其他一些目錄沒(méi)有修改,導(dǎo)致存在隱患。

ORACLE 11G rac OCR自動(dòng)備份文件權(quán)限異常知道空間不停的增長(zhǎng)的評(píng)論 (共 條)
