Tuesday, 16 October 2012

Can restoring a SQL database bring down the entire virtual infrastructure?



Can restoring a SQL database bring down the entire virtual infrastructure?

Issue
Few days ago, we ran a simple database restore as we were in the process of migrating a system. The restore should have completed relatively quickly but we were waiting for an hour and we knew something was not right. Meanwhile we started receiving calls from here, there and everywhere asking if there is anything wrong with the servers.

Diagnosis
We started having a look at our Vsphere virtual infrastructure as 90% of our estate is virtualised and straight away noticed that there are read write errors on some of our virtual servers. Anyone who has worked on VMware for few years would think ‘storage’ almost instantly. We use HP EVA 6400 and all our storage is presented from a single disk group; we have about 140 disks in the group, all FCAL 600GB.

Action
We have had couple of issues with our Storage performance in the past but nothing major but still believe there are some issues with the storage. We decided to create a new disk group and present the storage to the SQL Clustered instance through this disk group and ran the restore to the LUN presented from this disk group. This time restore completed in less than 10 minutes without any issues at all.

Unfortunately the inbuilt monitoring capabilities of the SAN are not great, so we decided to send the logs to the SAN Vendor so that we could find out, what went wrong? And if we have too much running on the 140 disks. Unfortunately the performance counters had to be reset before capturing the logs, which wouldn’t have been very useful to us, but we sent the logs anyway to see if there are any other issues. The results were clear and the engineers could not find a lot wrong with the SAN.

Lessons learnt

  • Don’t put all your eggs in one basket - Plan and Test

  • Importance of having monitoring systems so that you can backtrack!

  • If the inbuilt monitoring capabilities of the product are not great, there is a possibility of integrating third party monitoring tools (there are a lot of them)

  •  If you have the luxury of setting up a test environment and do some testing – DO IT!!

  • Sometimes even the experts don’t have the right answer so do your research (I work as a contractor and see this again and again), ask your connections, and ask questions on the relevant forums.


Monday, 15 October 2012

ICT Hangout First Blog



OK ! Here is my first blog then.

And as it is my first blog, please excuse all the formatting, grammar, etc. etc.

Just a bit of background:  I have been working in IT since 2005. I work as a freelance Senior IT contractor and have worked for SME’s, Pubic Sector, Private Sector, a University, a Global Automotive company. I get a kick out of if someone can do better in their life or work, like, either a help-desk technician making a jump to the next level with my advice and doubling their salary from 20K to 40K or an IT Director being able to save a million pound and make the right decision with the information and facts that I provide him with individually or as a part of the team.

I come across various issues and shortcomings in the way whole IT is run, like, the way junior IT staff is trained or well not trained, limited information that organizations have to make decisions to implement their key systems, some global software companies not being able to help implementing their own products, conflict between the server and development team etc etc. and the list goes on.

What I aim to do at this blog is (which I have decided to call ICT hangout), I aim this to be a useful resource for students who are studying IT, so that they can choose the right career path. A good resource for people already in the IT industry so that they can make the right decisions, a good resource for IT directors so that they can make better decisions for the team, organization and the individuals. As I am a contractor and move from one to company to the other quite frequency (well after a year or two) I come across a lot of people who really know their stuff very well and I would certainly be convincing some of them to write blogs on here for the betterment of the IT community as a whole !

I anyone with the right experience and ideas would like to write on this blog please get in touch through the blog with me.

See you all with the next blog soon!!