Quantcast
Channel: THWACK: Popular Discussions - Server & Application Monitor
Viewing all 3454 articles
Browse latest View live

URL/Http Monitoring

$
0
0

Hi There,

Below is what I intend to achieve....can you let me whether this can be done.

Please execute one HTTP check that hits the URL http://xx.xxxx.xxxxservlet/StatusServlet  (give it a timeout of 30 seconds, with a 2nd knock if possible just to be sure). This check should can occur every 15 minutes.

If the above ping check succeeds, but If this URL is unreachable send an email to XXX@xxx.com which notifies me


- If the url is reachable either of the following 2 strings of text in the response indicates things are OK and no notification is necessary
<overall_status>ok
<overall_status>warn

- If the url is reachable, the following string indicates a failure condition that I need to be notified about
<overall_status>fail
    

We are using Orion Network Performance Monitor 9.1 SP5. I want to do this check without assigning a node......however this is the only way it seems to be done through HTTP Monitor template.


WMI on Windows Server 2012

$
0
0

We are trying to add a Windows 2012 Hyper-V server to Orion but are having trouble getting WMI configured so that Orion can poll it.  Has anybody else had problems with this or have any suggestions on how to get it working properly?

 

Thanks in advance for any suggestions!

Monitoring a linux process via SNMP or script

$
0
0

I'm trying to monitor the CPU and Memory utilization of a process on a linux server. When I use the Process Monitor -SNMP template, I can browse the list of processes in a very generic fashion, but it doesn't give me the granularity I want. For instance, on one of my boxes I have 3 'java' processes running, but I"m only interested in the performance of one of them.  I can use the template to add the 'java' process, but when I view the application details I have to know the PID of the individual processes to know which one is associated with the app of interest.

 

So, I thought I might look into writing a script that finds the PID of the java process I'm interested in and prints out the cpu and memory information.  If I do that though, how does APM get that information and store it? Or is that even possible?  When I looked over the section on linux script monitoring in the Administrators Guide, it appears to function very similar to Nagios, where your script is expected to return a small range of status codes, followed by a message, but performance data isn't stored. Is my understanding correct on that?

Thanks in advance,

 

--Brandon

URL/Http Monitoring

$
0
0

Hi There,

Below is what I intend to achieve....can you let me whether this can be done.

Please execute one HTTP check that hits the URL http://xx.xxxx.xxxxservlet/StatusServlet  (give it a timeout of 30 seconds, with a 2nd knock if possible just to be sure). This check should can occur every 15 minutes.

If the above ping check succeeds, but If this URL is unreachable send an email to XXX@xxx.com which notifies me


- If the url is reachable either of the following 2 strings of text in the response indicates things are OK and no notification is necessary
<overall_status>ok
<overall_status>warn

- If the url is reachable, the following string indicates a failure condition that I need to be notified about
<overall_status>fail
    

We are using Orion Network Performance Monitor 9.1 SP5. I want to do this check without assigning a node......however this is the only way it seems to be done through HTTP Monitor template.

HP vs HPE - Warranty Status Incorrect

$
0
0

We are finding that our warranty status information on our HP server hardware is no longer correct.  When you click the link to view the details, you are directed to the HP site which has incorrect information:  Check your warranty status - HP Support Center  - HP

 

However if you use the new link for the HP Enterprise site the information is correct:  Check your warranty status - HP Support Center  - HPE

 

Is there a way we can edit the URL for the API call so that it will point to HPE?  Is this already a known issue that is being corrected?

Errors in Event Log from servers with > 32 logical CPUs

$
0
0

I have been noticing a lot of EventID 2006 entries in the Application Event Logs of our 40 core Dell R910 servers.  These machines run Windows 2008 R2 Enterprise SP1.

 

"Unable to read Server Queue performance data from the Server service. The first four bytes (DWORD) of the Data section contains the status code, the second four bytes contains the IOSB.Status and the next four bytes contains the IOSB.Information."

 

I believe these are coming from the Orion polling engine, as it seems to be a 32-bit process.  This KB article from Microsoft seems to explain what I'm dealing with: 32-bit application cannot query performance "Server Work Queues" counters on Windows Server 2008 R2-based computer that has more than 32 processors

 

My guess would be that Orion encounters this issue when doing the individual CPU core polling for the machine view in Managed Nodes.  For these systems, SAM only finds data for the first 32 cores.  I thought at first that it was some limitation of the chart type and forgot about it a long time ago, but now I'm not so sure.

 

Is this a known issue with Orion SAM and many-core machines? 

Has Anybody Ever Gotten WMI to Work without Admin credentials

$
0
0

Just a status check...  Last time I went down this road it seemed to be the consensus that you could not get remote WMI queries to work without admin privileges on the remote server.   We have everything working and do not need to "fix" anything, I am just curious if anyone has ever gotten WMI to work without admin rights?

URL/Http Monitoring

$
0
0

Hi There,

Below is what I intend to achieve....can you let me whether this can be done.

Please execute one HTTP check that hits the URL http://xx.xxxx.xxxxservlet/StatusServlet  (give it a timeout of 30 seconds, with a 2nd knock if possible just to be sure). This check should can occur every 15 minutes.

If the above ping check succeeds, but If this URL is unreachable send an email to XXX@xxx.com which notifies me


- If the url is reachable either of the following 2 strings of text in the response indicates things are OK and no notification is necessary
<overall_status>ok
<overall_status>warn

- If the url is reachable, the following string indicates a failure condition that I need to be notified about
<overall_status>fail
    

We are using Orion Network Performance Monitor 9.1 SP5. I want to do this check without assigning a node......however this is the only way it seems to be done through HTTP Monitor template.


VCSA and PSC CPU/Memory monitoring

$
0
0

We have set up the nodes for our VCSA (vCenter Server Appliance) and PSC (vCenter Platform Services Controller) in SolarWinds for monitoring.  However, the average CPU and memory statistics for both show 100%.  If you look at the resource consumption in vSphere neither the CPU or memory statistics are anywhere close to that.  According to vSphere performance monitoring our VCSA CPU average is 19.3% and the memory is 11% and the PSC CPU average is 2.8% and the memory is 13.7%.

We have rebooted both guests, removed and added back the resource from the node, and unmanaged/remanaged each node.  In SolarWinds the monitors for both guests and their resources are showing 100%.

Has anyone experienced this before?  If so, what did you do to get it fixed?

 

Current environment

VCSA and PSC are VMware Linux OS

ESX hosts are at 6.0.0, build 4600944

Orion Platform 2016.1.5300, SAM 6.2.4

Monitoring non default event logs

$
0
0

Hi,

 

How I can monitoring event logs like:

Custom Log to Monitor: Microsoft-Windows-TerminalServices-SessionBroker/Admin

Log Source: TerminalServices-SessionBroker

802

 

and

 

Custom Log to Monitor: Microsoft-Windows-TerminalServices-SessionBroker/Operational

Log Source: TerminalServices-SessionBroker

2055

 

 

Events are available, but SAM not found them.

% Processor Time vs CPU Utilization

$
0
0

What is the difference?  I have a Windows 2008 R2 x64 virtual machine (VMware).  Orion NPM says that the CPU has not gone over 25% in the past day.  Orion APM, Windows 2003-2008 Template, says that % Processor Time has been riding about 75-80 % constant in the past day.  There are no alarms in NPM, but APM is in a near constant alarm state.

The virtual server is a vSphere 5, vCenter 5 server - 4GB RAM and 2 x CPU.

Any help is appreciated.

- Dave Claussen

Changing Polling Engine for Agent Node

$
0
0

When I try and change the polling engine of an agent node by editing the node settings and pointing it at a new polling engine it doesn't seem to work.  Orion seems to acknowledge that the node is on the new polling engine; however, the node then shows as down.  I had to go edit the agent settings on the node itself and in there it still showed it pointed at the old polling engine, I had to update this to reflect the new polling engine also before the change was working properly.

 

It seems like I should be able to do this without having to log into each node, is this not the case?

 

Thanks in advance for any suggestions!

SolarWinds SAM - Asset Inventory option missing - Red Hat 7.4

$
0
0

Noticed when I install the SolarWinds Agent running RHEL 7.4, the Asset Inventory option is missing when I go to select resources from the List Resources, hence the Asset Inventory tab is empty.  Has anybody else observed anything similar?  From what I can see:

  • The agent checks in and can see basic info (RAM, # of CPUs, etc.)
  • I see all the partitions on the List Resources screen.
  • I see all the NICs on that screen.

 

I also suspect I may have forgotten to set something on the SAM sever, but I know I installed the agent on a few other boxes that were not 7.4 and everything checked in fine.  Are there any hotfixes available that anybody is aware of?

PowerShell Remoting by FQDN instead of IP

$
0
0

I'm trying to use the  Windows PowerShell Monitor component (actually as part of the "SolarWinds Web Performance Monitor (WPM) Player" template) in the Remote Host Execution Mode.  The component attempts to connect using an IP instead of an FQDN, so this error is generated:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The WinRM client cannot process the request. Default authentication may be used with an IP address under the following conditions: the transport is HTTPS or the destination is in the TrustedHosts list, and explicit credentials are provided. Use winrm.cmd to configure TrustedHosts. Note that computers in the TrustedHosts list might not be authenticated. For more information on how to set TrustedHosts run the following command: winrm help config. For more information, see the about_Remote_Troubleshooting Help topic.

 

If I specify HTTPS instead, then this error is returned:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The server certificate on the destination computer (172.10.10.31:5986) has the following errors: The SSL certificate contains a common name (CN) that does not match the hostname.

 

Is there some tricky way to tell the component monitor to access via some Fully Qualified Domain Name instead, or is this a feature request that needs to be made?  I realize that could go about modifying the TrustedHosts setting on all of my different pollers, but we'd prefer the ability to not to and to be able to use HTTPS as it's available for all of our connections.

SQL Server User Experience Monitor (SUEM) - Custom SQL Query

$
0
0

Hello Thwack!

I'm trying to run a custom SQL query against a database and get back a set of information to be displayed.  I've been attempting to use the SQL SUEM, but I'm running into an issue.

When I run the query, I only seem to get one cell of information back instead of the information from the entire scripts.

 

Here is the expected outcome:

SQL Script Expected Outcome

 

After inputting the (adjusted) code into SUEM, here is the result I get:

SQL SUEM Test

 

As you can see, I'm only getting the first data cell instead of the information from the entire scripts.

I've read through a couple other posts about this and I've added a "SELECT 0" to my script to get it to even run, but I'm trying to figure out how to get the entire script to run/output properly.

Here is the adjusted and sanitized scripts below:

 

use msdb

SELECT 0, j.name JobName,h.step_name StepName,

     CONVERT(CHAR(10), CAST(STR(h.run_date,8, 0) AS dateTIME), 111) RunDate,

     STUFF(STUFF(RIGHT('000000' + CAST ( h.run_time AS VARCHAR(6 ) ) ,6),5,0,':'),3,0,':') RunTime,

     STUFF(STUFF(REPLACE(STR(run_duration, 6, 0), ' ', '0'), 3, 0, ':'), 6, 0, ':') AS run_duration,

     case h.run_status when 0 then 'failed'

     when 1 then 'Succeded'

     when 2 then 'Retry'

     when 3 then 'Cancelled'

     when 4 then 'In Progress'

     else 'unknown'

     end as ExecutionStatus

     FROM [SERVERNAME].msdb.dbo.sysjobhistory h left join msdb.dbo.sysjobs j

     ON j.job_id = h.job_id

     where name in ('KNX Processor: F6P430_F6P_ALL_XR_BACKUP'

     ,'KNX Processor: F6P430_F6P_PREMRP_MD'

     ,'KNX Processor: F6P430_F6P_PREMRP_TD1_INIT'

     ,'KNX Processor: F6P430_F6P_PREMRP_TD2_INIT'

     ,'KNX Processor: F6P430_F6P_PREMRP_TD3_INIT'

     ,'KNX Processor: F6P430_F6P_2ND_RUN_NC'

     ,'KNX Processor: F6P430_F6_XR_OPT_BACKUP')

     and CAST(STR(h.run_date, 8, 0) AS DATETIME) + CAST(STUFF(STUFF(RIGHT('000000' + CAST (h.run_time AS VARCHAR(6)), 6), 5, 0, ':'), 3, 0, ':') AS DATETIME) > getdate()-10

     and h.step_name = '(Job outcome)'

     and h.run_status <> 4

 

 

Any help would be greatly appreciated!


HP vs HPE - Warranty Status Incorrect

$
0
0

We are finding that our warranty status information on our HP server hardware is no longer correct.  When you click the link to view the details, you are directed to the HP site which has incorrect information:  Check your warranty status - HP Support Center  - HP

 

However if you use the new link for the HP Enterprise site the information is correct:  Check your warranty status - HP Support Center  - HPE

 

Is there a way we can edit the URL for the API call so that it will point to HPE?  Is this already a known issue that is being corrected?

WMI on Windows Server 2012

$
0
0

We are trying to add a Windows 2012 Hyper-V server to Orion but are having trouble getting WMI configured so that Orion can poll it.  Has anybody else had problems with this or have any suggestions on how to get it working properly?

 

Thanks in advance for any suggestions!

VCSA and PSC CPU/Memory monitoring

$
0
0

We have set up the nodes for our VCSA (vCenter Server Appliance) and PSC (vCenter Platform Services Controller) in SolarWinds for monitoring.  However, the average CPU and memory statistics for both show 100%.  If you look at the resource consumption in vSphere neither the CPU or memory statistics are anywhere close to that.  According to vSphere performance monitoring our VCSA CPU average is 19.3% and the memory is 11% and the PSC CPU average is 2.8% and the memory is 13.7%.

We have rebooted both guests, removed and added back the resource from the node, and unmanaged/remanaged each node.  In SolarWinds the monitors for both guests and their resources are showing 100%.

Has anyone experienced this before?  If so, what did you do to get it fixed?

 

Current environment

VCSA and PSC are VMware Linux OS

ESX hosts are at 6.0.0, build 4600944

Orion Platform 2016.1.5300, SAM 6.2.4

% Processor Time vs CPU Utilization

$
0
0

What is the difference?  I have a Windows 2008 R2 x64 virtual machine (VMware).  Orion NPM says that the CPU has not gone over 25% in the past day.  Orion APM, Windows 2003-2008 Template, says that % Processor Time has been riding about 75-80 % constant in the past day.  There are no alarms in NPM, but APM is in a near constant alarm state.

The virtual server is a vSphere 5, vCenter 5 server - 4GB RAM and 2 x CPU.

Any help is appreciated.

- Dave Claussen

PowerShell Remoting by FQDN instead of IP

$
0
0

I'm trying to use the  Windows PowerShell Monitor component (actually as part of the "SolarWinds Web Performance Monitor (WPM) Player" template) in the Remote Host Execution Mode.  The component attempts to connect using an IP instead of an FQDN, so this error is generated:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The WinRM client cannot process the request. Default authentication may be used with an IP address under the following conditions: the transport is HTTPS or the destination is in the TrustedHosts list, and explicit credentials are provided. Use winrm.cmd to configure TrustedHosts. Note that computers in the TrustedHosts list might not be authenticated. For more information on how to set TrustedHosts run the following command: winrm help config. For more information, see the about_Remote_Troubleshooting Help topic.

 

If I specify HTTPS instead, then this error is returned:

 

PowerShell script error. Connecting to remote server 172.10.10.31 failed with the following error message : The server certificate on the destination computer (172.10.10.31:5986) has the following errors: The SSL certificate contains a common name (CN) that does not match the hostname.

 

Is there some tricky way to tell the component monitor to access via some Fully Qualified Domain Name instead, or is this a feature request that needs to be made?  I realize that could go about modifying the TrustedHosts setting on all of my different pollers, but we'd prefer the ability to not to and to be able to use HTTPS as it's available for all of our connections.

Viewing all 3454 articles
Browse latest View live


Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>