We recently received a letter from one of our long standing customers, Jill Salo, Oracle DBA at Westpeak Global Advisors about her experience using Dbvisit due to a real life database failure.
Hi Arjen,
As I mentioned a few weeks ago, we lost our production database and had to failover to our standby using Dbvisit.
I thought you would appreciate some technical details. Feel free to share our experience if you feel it would be helpful to others.
OS version: Linux OEL5
Oracle version: 10.2.0.4 Standard Edition.
Storage: Lefthand Networks (now HP) iscsi SAN, mounted as ext3
Cause of crash: I suspected our mounted volume had a corruption as we were seeing a significant degradation in I/O performance. After many database checks, we decided to run Iometer to check if the disk speeds were below par.
I ran Iometer on the volume that held the database files.
We think Iometer may have hit some corruption because this caused us to lose the whole file partition. Even after rebooting, the system could not find that partition to mount. It had simply disappeared. The SAN partition was still there and available.
Using Dbvisit, we were able to switch over to the standby database in a few minutes. The switch was simple and with no errors. The only problems we encountered after that were network issues on our side.
1. We had tested with one subnet (subnet2) but as an oversight, computers on subnet3 could not see the database. This was rectified by some network configuration changes (by the network admin).
2. The firewall had the default timeout setting of 2 hours. We have some long running jobs that process mostly in Java with occasional Oracle calls. These jobs would fail with the “Connection reset by peer” error message. We set this timeout to the max of 12 hours which seemed to fix the problem.
When we first installed Dbvisit, it did not include the option to create a standby using the Dbvisit software. In a crisis situation, it was a very helpful option and once again it worked without errors. Using the menu I was able to recreate the standby by navigating a few menus. The final switch back to our production server worked seamlessly as well. It did take a couple of hours due to our large and numerous redo logs (write intensive database) but there were no problems.
Thanks again for a superb product. It keeps getting better and has once again saved the day!
Sincerely,
Jill Salo
Oracle DBA
Westpeak Global Advisors




 


Dbvisit