July | 2011 | SQL Blog

Get-WmiObject -ComputerName -Class Win32_Volume -Filter ‘DriveType = 3’ | select name,capacity,freespace

Results from one of my SQL Servers

name capacity freespace
—- ——– ———
Q:\ 508248064 464125952
T:\ 508248064 472387584
N:\Default\SnapInfo\ 28858826752 26194558976
N:\Default\SysDB\ 21340921856 18748485632
M:\ 491794432 460210176
M:\Default\MsSqlData\ 53559341056 35197067264
M:\Default\MsSqlLog\ 32617779200 27183591424
M:\Default\Sysdb\ 21340921856 18669568000
M:\Default\SnapInfo\ 28858826752 27821281280
N:\ 286162944 262578176
N:\Default\MSSqlData1\ 107253968896 75615485952
N:\Default\MSSqlData2\ 107253968896 77887352832
N:\Default\MSSqlData3\ 107253968896 75426820096
N:\Default\MSSqlLog1\ 16078700544 12808699904
N:\Default\MSSqlLog2\ 21340921856 8436752384
N:\Default\MSSqlLog3\ 16078700544 11557416960
C:\ 104751689728 88810995712
D:\ 194580049920 84245086208

Cisco C210 M2 problems with Hardware Interrupts

We were seeing the above SQL Server alerts with increasing frequency… a week ago we’d get 1 or 2 a day, and yesterday we were seeing spikes of up to 10 an hour.
Error text is
ERROR: 18056, Severity: 20, State: 29… OR “The client was unable to reuse a session with SPID 70, which had been reset for connection pooling. The failure ID is 29. This error may have been caused by an earlier operation failing. Check the error logs for failed operations immediately before this error message.”
CPU status in PerfMon showed nothing out of the ordinary. CPU stats in Resource Monitor showed an interesting cascade type pattern where all the CPU’s (24 on this machine) would spike to 80% then slowly fade back to 0. CPU0 on Node 1 however showed a more steady spike to 80% then back to 0% pattern.
In the interests of finding out what was on CPU0 on Node 1, I downloaded PE (Process Explorer). PE showed me nothing that could be tied to a single core, but I did notice that the Process Interrupts CPU Time was about 2/3rds greater than the System Idle Process.
I couldn’t glean any additional information from PE, so I downloaded XPERF, a Microsoft tool to dig into kernel performance. There I could see that CPU 12 (CPU 0 on Node 1) displayed a regular spike to 100% and back to 0%.
Xperf –on Diag –stackwalk PROFILE
Let that run for a while to capture the needed data
Xperf –d foo.etl to stop and merge the trace results into file foo.etl
Xperf foo.etl to see the results.
A stack walk showed that NDIS.sys was the leading cause of the interrupts. That told me that I should check for new network drivers. I installed the latest Cisco supplied Intel Drivers…
Below is a before and after of the DPC CPU Usage. The differences are night and day.
BEFORE

AFTER

Now, let’s see if we get any more nasty ERROR: 18056, Severity: 20, State: 29 errors logged in SQL Server. Followed in the logs with “The client was unable to reuse a session with SPID 70, which had been reset for connection pooling. The failure ID is 29. This error may have been caused by an earlier operation failing. Check the error logs for failed operations immediately before this error message.”

XPERF can be installed as part of the Windows SDK packaging. Make sure you select the Developer Tools, and then you can find the msi’s for the xperf under the installation directory.

SQL Blog

Adventures in being a SQL Server DBA

Monthly Archives: July 2011

Powershell to show disk space including mountpoints

XPERF is a great Windows diagnostics tool!