Has anyone had any recent issues with the new solarwinds? I have dome a migration from what it a relatively old, but stable environment that just worked, to a new and shiny environment that appears to be not what I expected.
Though, now that I think about it, I mean there could be all kinds of reasons why things aren't working. Some of these could include:
- Using more features.
- Building it into a retrospective real-life environment.
- Other external effects such as OS, network speed, storage devices, ect.
So, let me explain from the beginning, do a compare and contrast and then a analysis.
Introduction.
We had an old, stable environment that was running solarwinds 10.5 and then I did an upgrade to 11.5. It worked for it's purpose (kind of) but it wasn't how I liked it when I adopted it. I still don't like it how I adopted it, but that's another story.
It operated like the following:
- Nodes were added to solarwinds and then an alert was made for that node(s) eg - alert for customer x.
- Nodes were added to SAM as per a node basis.
- pictures were used...I hate pictures. -.-
- add ons such as NCM and WPM were forgotten and left and on separate servers with their own databases
- network data/information was left to as is and left to rot.
- Reporting was out of date
- Alerts were ignored.
- nodes were left or never added
- Clusters and the resources were forgotten
Since I did a new environment, I want to make the following happen:
- I now run a regular weekly scan on our environment to ensure things are not ignored/forgotten
- I have included all of our failover environments, their resources and their respective IP addresses.
- Reporting now has a process and rather than fill up on reports we don't need, I include a 30 day policy that deletes old reports.
- I wanted to start a process of having everything in a group and then working from that group - Eg - groups would be used to application monitoring, for dependencies, for network mapping, to build our entire environment.
- From there I would build the alerting based on logic and grouping of resources/nodes - ergo making it easier to manage.
- I have a daily report to show me things that have broke over the last week/day.
- I am forcing people to acknowledge alerts and act on them.
- NCM, IPAM and other networking applications, WPM are now integrated into solarwinds - thus making it easier to manage
- I have no pictures because I hate pictures. (this is something that I know people might enjoy, which is fine, but I have not had any support on what people wanted within my organisation. So I see it if you do not provide me with any requirements that you want, or come to any meeting I have arranged then simply you see it as everything is fine.)
- I am encouraging people to notify me when work is being carried out so I may do my admin in solarwinds
- I would like it to be part/integrated into service-now. If possible for things such as audit tracking of nodes.
So, that is my goal in the end. But I have come into some weird things. These particular things are annoying me somewhat and solarwinds has still not come back to me with a fix/hot patch/update/other. Neither have I found a real solution within thwack:
- Sam does not add groups to templates. It sometimes kinda works by restarting a service (I think) on the solarwinds box and then mess around with it (I haven't done it for a few months so would need to refresh my memory)
- Custom charts are breaking the solarwinds box. You create them and then they suddenly just bottle out the server(s) I did a schema cache fix and that has so far proven unsuccessful. I am still working on this fix. Though this has been known for ages about this.
- Sometimes things work and sometimes they don't. I know that everyone's environment is different to other peoples and there's lots of variables because although there is some universals within environments, there are some huge differences. That's fine. But why do some hot fixes work for people and then some don't? Again, I know why, it's just a niggle that I have had recently.
So that's my beef.