Quantcast
Channel: THWACK: Popular Discussions - Server & Application Monitor
Viewing all 3454 articles
Browse latest View live

Win32_PerfRawData_PerfProc_Process - Invalid Class

$
0
0

For much longer than I care to admit I was struggling to monitor any Windows services using WMI on one of my Windows 2008 Servers. Instead I received the dreaded "Invalid Class" error in the APM info window. This was later confirmed using WBEMTEST. When using WMI Explorer it was clear to see that the "Win32_PerfRawData_PerfProc_Process" class did not exist. 

I opened a case with Solarwinds support and was directed to their WMI support document and related online documentation. Unfortunately none of the information provided proved useful in this circumstance.

I spent a several hours googling around and was able to find several others with my issue, but no one had found a solution. With nowhere else to go, I spent the $250.00 and called Microsoft Support. It took a few days, and at least three Microsoft support engineers later that we finally got to the root of the problem. I knew from the very beginning that this was going to be something stupid, but I never realized just how obscure the problem would be, so I decided to save others my pain and post my resolution in hopes that others might find it useful in the future. 

We went into registry “HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\services\PerfProc\Performance” In the right hand pane we found that “Disable Performance Counters” was set to 1, which means it was disabled. We used “exctrlst.exe” tool, enabled ‘perfproc’ and rebooted. After Rebooting we were able to run the WMI query (SELECT * from Win32_PerfRawData_PerfProc_Process where NAME <> '_Total') using the WBEMTEST tool built into Windows. After verifying with WBEMTEST I confirmed that APM was properly polling my monitored services properly. 

I hope someone else finds this helpful.


Powershell Exit Code 1 = Get Output Failed

$
0
0

I having a tough time getting my script to correctly report the exit code to SAM 6.1.1. I have read The Basics of PowerShell (part 3) and I am using the required Message, Statistic and Exit codes. My script Telnets to a server (using a built telnet function, grabs a payload and verifies that its valid. I want 3 exit codes, 0 - Payload is confirmed (Up), 3 - Payload is invalid (Critical), 1 - Any error & failed to connect (Down). Exit codes 0 and 3 report correctly but 1 will not.

 

$error.clear()

$ErrorActionPreference= 'silentlycontinue'

#This builds a custom telnet function from http://community.spiceworks.com/scripts/show/1887-get-telnet-telnet-to-a-device-and-issue-commands

Function Get-Telnet

{  Param (

        [Parameter(ValueFromPipeline=$true)]

        [String[]]$Commands = @(""),

        [string]$RemoteHost = "",

        [string]$Port = "",

        [int]$WaitTime = 1000,

        [string]$OutputPath = ""

    )

    #Attach to the remote device, setup streaming requirements

    $Socket = New-Object System.Net.Sockets.TcpClient($RemoteHost, $Port)

    If ($Socket)

    {  $Stream = $Socket.GetStream()

        $Writer = New-Object System.IO.StreamWriter($Stream)

        $Buffer = New-Object System.Byte[] 1024

        $Encoding = New-Object System.Text.AsciiEncoding

 

        #Now start issuing the commands

        ForEach ($Command in $Commands)

        {  $Writer.WriteLine($Command)

            $Writer.Flush()

            Start-Sleep -Milliseconds $WaitTime

        }

        #All commands issued, but since the last command is usually going to be

        #the longest let's wait a little longer for it to finish

        Start-Sleep -Milliseconds ($WaitTime * 4)

        $Result = ""

        #Save all the results

        While($Stream.DataAvailable)

        {  $Read = $Stream.Read($Buffer, 0, 1024)

            $Result += ($Encoding.GetString($Buffer, 0, $Read))

        }

    }

    Else 

    {  $Result = "Unable to connect to host: $($RemoteHost):$Port"

    }

    #Done, now save the results to a file

    $Result | Out-File $OutputPath

}

#This clears the content of the output file so its not mistakenly read from previous telnet test

Clear-Content "F:\scripts\EDITelnetOutput.txt"

 

#This will telnet to the remote server and issue the command to check the payload. The output must be written to a file because get-telnet cmdlet sucks.

Get-Telnet -RemoteHost ${IP} -Port "3575" -Commands "Blah" -OutputPath "F:\scripts\TelnetOutput.txt"

$TelnetOutput = get-content "F:\scripts\TelnetOutput.txt"

 

#This looks for "Keyword" in the payload. If found then status is 1 and application is considered functional.

IF ($error)

{

Write-Host "Statistic: 1"

Write-Host "Message: $($error[0])"

    Exit 1

  }

IF ($TelnetOutput -match "Keyword") {

        Write-Host "Statistic: 0"

        Write-Host "Message: Valid Payload"

      Exit 0

    }

    ELSE {

        Write-Host "Statistic: 3"

        Write-Host "Message: Invalid Payload"

        Exit 3

          }


Tests for Exit 0 and 3 are successful with proper output. Test for exit 1 is done by turning off the application. The port is closed and the telnet connection will fail. Below is the error that I get for Exit 1 "Get Output Failed". It should result in Exit 1 - Component is Down.

GetOutPut.JPG

 

I found that if I changed "Exit 1" any other number value it will run successfully. Something is wrong with Exit 1.

net-snmp on linux target is working, but fails snmp poling test

$
0
0

I'm trying to get my feet wet using SAM and a 30-day trial here, so please excuse the newbie question.

 

In trying to monitor a Centos 6.4 target machine using a SAM 5.5.0 server, I first installed net-snmp 5.5 on the target and used snmpconf to get me going.  I didn't do much config for performance, just mainly open up access.  IPTables are turned off.

 

I can successfully do "snmpwalk -v2c -cpublic <target ip address> system" from another (Solaris) machine.

 

But when I change the poling method for the target machine from "ICMP " to "SNMP and ICMP" and use the "test" button on that page, it seems to time-out and say "Test Failed".

 

Any log I can look through to get a bit better idea of what's failing?  And BTW, all 3 sysems are in the same network.

 

Thanks for any pointers....Lyle

 

A Progress Update: 

I found the c:\"Program Files (x86)"\SolarWinds\Orion\SnmpWalk.exe utility on the SAM server.

Maybe there's an easy way to run that utility from within SAM, but I ran it by hand, and it generated the output of the snmpwalk just fine.

So there's some other reason for the "Test Failed" message.

HP vs HPE - Warranty Status Incorrect

$
0
0

We are finding that our warranty status information on our HP server hardware is no longer correct.  When you click the link to view the details, you are directed to the HP site which has incorrect information:  Check your warranty status - HP Support Center  - HP

 

However if you use the new link for the HP Enterprise site the information is correct:  Check your warranty status - HP Support Center  - HPE

 

Is there a way we can edit the URL for the API call so that it will point to HPE?  Is this already a known issue that is being corrected?

Check All Services Which Set To Automatic Mode Are Running

$
0
0

Hi All,

 

This is my first post   OK, straight to the point:

 

Attached template is based on VBScript and will check status of all services on the Windows based machines. If those services which set to automatic are not running script will increase "Statistic" counter. I have set threshold to CRITICAL = "1", as there is no real reason why AUTOMATIC service should not be running (there are some exceptions of course). The template will also report back names (Service Display Name) of those services which are set to automatic and not running. This is handled by "Massage" variable and will be visible in SAM, so, you can see in SolarWinds SAM all you need straight away.

 

You can also define exceptions. Well, this can be improved to make it bit more easier to manage. At the moment you will need to modify VBScript itself. Just open it and you will see straight away the line of code which you need to copy-pase. Add additional exception if you need to.

 

Thank you very much for using it,

 

Best of luck,

Alex

VCSA and PSC CPU/Memory monitoring

$
0
0

We have set up the nodes for our VCSA (vCenter Server Appliance) and PSC (vCenter Platform Services Controller) in SolarWinds for monitoring.  However, the average CPU and memory statistics for both show 100%.  If you look at the resource consumption in vSphere neither the CPU or memory statistics are anywhere close to that.  According to vSphere performance monitoring our VCSA CPU average is 19.3% and the memory is 11% and the PSC CPU average is 2.8% and the memory is 13.7%.

We have rebooted both guests, removed and added back the resource from the node, and unmanaged/remanaged each node.  In SolarWinds the monitors for both guests and their resources are showing 100%.

Has anyone experienced this before?  If so, what did you do to get it fixed?

 

Current environment

VCSA and PSC are VMware Linux OS

ESX hosts are at 6.0.0, build 4600944

Orion Platform 2016.1.5300, SAM 6.2.4

WMI on Windows Server 2012

$
0
0

We are trying to add a Windows 2012 Hyper-V server to Orion but are having trouble getting WMI configured so that Orion can poll it.  Has anybody else had problems with this or have any suggestions on how to get it working properly?

 

Thanks in advance for any suggestions!

Custom Properties

$
0
0

I have a request from a client that is asking how they can manage Nodes during Maintenance Windows and eliminate Alerts for the duration period.

 

Would I create a Custom Property to group the batch of Nodes together and then assign the Custom Property Group to the Trigger Condition of an Alert with An exclusion set against the Time & Day?

 

What is the best practise of doing this as I'm guessing this is fairly common in view of Maintenance Upgrades/WSUS, and such like.

 

My example scenario: I have Customer 1 who would like to run Windows Updates against all of their Server 2008 Nodes.

 

 

Another question; I've been tasked to review the customers entire environment and their grouping doesn't seem to be logical enough for assigning the correct Alerts easily. I would like to better group their environment and have started at the Network Layer grouping together ALL Network Devices (Switches/Routers/Network Mgmt) via Custom Properties, and will then create Interface or other related Alerts and adding the Custom Property (Network Device Group) to the 'Trigger Condition' of the Alert. Is the a logical approach, or not?

 

With the above in mind I have sub divided their environment up into Network, Physical Device, Virtual Hosts, Core Services, and Applications Layers. I have then placed all of the devices, servers, services, systems, and key applications into these (Custom Property Groups), and will assing these Properties to the required groups.

 

In advance thank you for your advice and help.


PowerShell Remoting by FQDN instead of IP

$
0
0

I'm trying to use the  Windows PowerShell Monitor component (actually as part of the "SolarWinds Web Performance Monitor (WPM) Player" template) in the Remote Host Execution Mode.  The component attempts to connect using an IP instead of an FQDN, so this error is generated:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The WinRM client cannot process the request. Default authentication may be used with an IP address under the following conditions: the transport is HTTPS or the destination is in the TrustedHosts list, and explicit credentials are provided. Use winrm.cmd to configure TrustedHosts. Note that computers in the TrustedHosts list might not be authenticated. For more information on how to set TrustedHosts run the following command: winrm help config. For more information, see the about_Remote_Troubleshooting Help topic.

 

If I specify HTTPS instead, then this error is returned:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The server certificate on the destination computer (172.10.10.31:5986) has the following errors: The SSL certificate contains a common name (CN) that does not match the hostname.

 

Is there some tricky way to tell the component monitor to access via some Fully Qualified Domain Name instead, or is this a feature request that needs to be made?  I realize that could go about modifying the TrustedHosts setting on all of my different pollers, but we'd prefer the ability to not to and to be able to use HTTPS as it's available for all of our connections.

Nagios Migration

$
0
0

I have been asked to build with native SAM capabilities all the alerts Nagios currently runs for a few dozen servers. Has anyone else successfully pulled this off? We are apparently getting rid of Nagios entirely, so I'm told the Nagios Script Monitor is not an option. There seems to be a wide array of things Nagios is doing, some that seem capable of cloning in Solarwinds, though I haven't figured out how, and others I can't seem to find a native capability for. For instance, I think I can figure out latency and host flapping, maybe, but can't seem to find anything on socket timeout, whether an SSL connection can be made, or if a connection to a specific port is possible. There is more, I'm sure.

 

This is what I'm working with:

Orion Platform 2017.1.3 SP3

IPAM 4.5.1

NCM 7.6

CloudMonitoring 1.0.0

NPM 12.1

DPAIM 11.0.0

QoE 2.3

NTA 4.2.2

VIM 7.1.0

SAM 6.4.0

NetPath 1.1.0

 

If anyone has advice, I'd greatly appreciate it.

net-snmp on linux target is working, but fails snmp poling test

$
0
0

I'm trying to get my feet wet using SAM and a 30-day trial here, so please excuse the newbie question.

 

In trying to monitor a Centos 6.4 target machine using a SAM 5.5.0 server, I first installed net-snmp 5.5 on the target and used snmpconf to get me going.  I didn't do much config for performance, just mainly open up access.  IPTables are turned off.

 

I can successfully do "snmpwalk -v2c -cpublic <target ip address> system" from another (Solaris) machine.

 

But when I change the poling method for the target machine from "ICMP " to "SNMP and ICMP" and use the "test" button on that page, it seems to time-out and say "Test Failed".

 

Any log I can look through to get a bit better idea of what's failing?  And BTW, all 3 sysems are in the same network.

 

Thanks for any pointers....Lyle

 

A Progress Update: 

I found the c:\"Program Files (x86)"\SolarWinds\Orion\SnmpWalk.exe utility on the SAM server.

Maybe there's an easy way to run that utility from within SAM, but I ran it by hand, and it generated the output of the snmpwalk just fine.

So there's some other reason for the "Test Failed" message.

Agent Deployment Firewall Ports help needed.

$
0
0

Hello!

 

I am trying to deploy an agent to a server across our WAN.

 

Ports that I have opened are 135 and 443 to the server and 17778 back as per the documentation.

 

This keeps failing with "Credentials test failed. Path not found" 

 

Analysing the network we find that it is trying to connect with ports, 445, 139 and 137

 

Are ports ports, 445, 139 and 137 the ones I should be opening?

Are there any other ports I should be using?

VCSA and PSC CPU/Memory monitoring

$
0
0

We have set up the nodes for our VCSA (vCenter Server Appliance) and PSC (vCenter Platform Services Controller) in SolarWinds for monitoring.  However, the average CPU and memory statistics for both show 100%.  If you look at the resource consumption in vSphere neither the CPU or memory statistics are anywhere close to that.  According to vSphere performance monitoring our VCSA CPU average is 19.3% and the memory is 11% and the PSC CPU average is 2.8% and the memory is 13.7%.

We have rebooted both guests, removed and added back the resource from the node, and unmanaged/remanaged each node.  In SolarWinds the monitors for both guests and their resources are showing 100%.

Has anyone experienced this before?  If so, what did you do to get it fixed?

 

Current environment

VCSA and PSC are VMware Linux OS

ESX hosts are at 6.0.0, build 4600944

Orion Platform 2016.1.5300, SAM 6.2.4

How to monitor and alert on event id on windows server

$
0
0

I am new to the Solarwinds world but have several alerts setup in the Advanced Alert Manager in Orion. I am now trying to monitor an old fax server we have. I want to monitor the event logs for an event id and alert if less than two matches per hour are sent. However I do not know what option to select under "Type of Property to Monitor" under the Edit Alert section.

 

See attached.

 

Hope I am making sense and asking in the correct section. The manuals are rather large.

PowerShell Remoting by FQDN instead of IP

$
0
0

I'm trying to use the  Windows PowerShell Monitor component (actually as part of the "SolarWinds Web Performance Monitor (WPM) Player" template) in the Remote Host Execution Mode.  The component attempts to connect using an IP instead of an FQDN, so this error is generated:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The WinRM client cannot process the request. Default authentication may be used with an IP address under the following conditions: the transport is HTTPS or the destination is in the TrustedHosts list, and explicit credentials are provided. Use winrm.cmd to configure TrustedHosts. Note that computers in the TrustedHosts list might not be authenticated. For more information on how to set TrustedHosts run the following command: winrm help config. For more information, see the about_Remote_Troubleshooting Help topic.

 

If I specify HTTPS instead, then this error is returned:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The server certificate on the destination computer (172.10.10.31:5986) has the following errors: The SSL certificate contains a common name (CN) that does not match the hostname.

 

Is there some tricky way to tell the component monitor to access via some Fully Qualified Domain Name instead, or is this a feature request that needs to be made?  I realize that could go about modifying the TrustedHosts setting on all of my different pollers, but we'd prefer the ability to not to and to be able to use HTTPS as it's available for all of our connections.


Hardware Health - Disk & Array missing

$
0
0

New member here with an issue with Hardware Health items missing. I recently updated several hosts to ESXi 5.0 & 5.1 and all them are no longer showing Disk or Array in hardware health. This is one of the major items we are needing to monitor as the alerts to our helpdesk about degraded/failed drives are originated by Solarwinds. Below is a breakdown on the environment.

 

Dell PowerEdge 2950

ESXi 5.0 & 5.1

SUU 7.1.1 updates installed

Dell OpenManage 7.1 installed (on ESXi 5.0)

SNMP enabled (obviously working b/c other hardware health is showing)

 

As always, any tips or advice is greatly appreciated!x

Quick way to look through all alert trigger definitions for a specific custom property?

$
0
0

I am planning to go through and remove some old custom properties that are no longer in use.  Is there an easy way (SQL Query, etc.) that can be done to look through all of my alerts for the use of any given custom property in the trigger conditions?  This would save me from having to go manually look at every alert I have configured.

 

Thanks in advance for any suggestions!

Agent Deployment Firewall Ports help needed.

$
0
0

Hello!

 

I am trying to deploy an agent to a server across our WAN.

 

Ports that I have opened are 135 and 443 to the server and 17778 back as per the documentation.

 

This keeps failing with "Credentials test failed. Path not found" 

 

Analysing the network we find that it is trying to connect with ports, 445, 139 and 137

 

Are ports ports, 445, 139 and 137 the ones I should be opening?

Are there any other ports I should be using?

newbie question on alert resources

$
0
0

My company has been using SolarWinds for years but we've only ever concentrated on the generation of alerts, and the occasional one-off report to troubleshoot an issue.  I've been asked to start building out dashboards simple enough for even an executive to understand.  To that end, I'm trying to build views that focus on specific sets of devices.

 

The first thing I did was create a custom property for nodes called Environment.  It's a dropdown with values like Dev, UAT and Prod.  I then imported my Environment values from a .csv I'd put together. 

 

I'm now beginning to work on the view.  I started with a test view containing two resources:  All Dev Nodes (wsql:  Environment = 'Dev') and All Prod Nodes. 

 

Now I'd like to add a resource called All Active Prod Alerts, but the Alerts resource doesn't allow me to add WSQL.  If I click Edit the only things I can change are Title, Subtitle and Show Acknowledged.  I'm clearly missing something obvious here - could someone point me in the right direction to filter Active Alerts by the parent node's Environment field?

 

Many thanks!

Node Showing more applications assigned than really are.

$
0
0

Orion Platform 2016.2.100, NPM 12.0.1, SAM 6.3.0, DPA 10.2.0, VNQM 4.2.4, IVIM 2.1.2, NetPath 1.0.1, QoE 2.2.0, SRM 6.3.0, WPM 2.2.1, NTA 4.2.1

 

I have a node called "scipitmdb1".  In the "Node Details" page is shows it has 67 applications assigned to it:

 

 

This isn't correct. It's showing it has applications that are assigned to other nodes assigned this node.

I then went into the SAM settings and found the node in the "Assigned Application Monitors" page and it's showing correctly where it should only have 7 applications assigned.

 

 

Any ideas why the "Node Details" page all of sudden is showing the node as having all of these random applications?

Thank you.

Viewing all 3454 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>