Ad Widget

**PeterZielony** · 12-06-2024, 15:57

I'd say this is bad design - you don't want to run 50 scripts by agent at the same time even if they are very tiny

I'm working with tons of custom scripts but I always keep in mind to run as little as possible on boxes.

there is a limit 30sec for each script - if you run 50 at the same time they will go out beyond 30sec easily as each need to initialize and then run.

What are those scripts doing if you don't mind asking - some operation or reading something - if reading then what exactly? I might help but I would need the whole picture

and yeah .. 4x version probably doesn't help here either.

**Markku** · 12-06-2024, 17:20

As a side story, I once created a Python app that was using PowerShell commands to interface with PowerShell-only APIs. But very quickly I had to abandon the design because starting the PowerShell interpreter for each API call separately was so slow (in the order of seconds) that the app was unusable and requests couldn't be served. I had to redesign the whole app to execute as native long-running PowerShell script that read the requests in a loop from an external queue, instead of starting the script separately for each request.

cmd.exe starts probably much faster, if that could be usable in your case. And, maybe some WMI items could be used instead of PowerShell scripts?

Finally, another way to feed data to Zabbix is to send it as Zabbix trapper items: have the PowerShell script running all the time (from task scheduler or so), and make it loop and send the relevant metrics every X seconds with zabbix_sender. It is then almost like active agent, just controlling the frequency within the script instead of Zabbix item interval configuration.

Markku

**zabbattical** · 13-06-2024, 14:35

PeterZielony
- Checking MSMQ on Servers if replication is running fine
- checking files on header information to see if it was built and transmitted correctly
- checking smb directories if files get processed
- failover cluster monitoring
- lots and lots of SQL Query checks (we run this in powershell because the agent user runs with a serviceaccount that has access and so we don't need to keep the user credentials in zabbix)
- checking backup data for consistency
- kafka prometheus monitoring (response >64kb, to large for zabbix web request)
- and lots of other stuff i'm currently not thinking off

the 30 second limit is a per script limit i think
but since the powershell queue as a whole can only run a defined number of parallel processes, the runtime of the single scripts cumulates and thats why my 1min items gehts delayed because the sum of items that run every 5 minutes takes so long that the 1min items already get queued again and won't run until the 5min items are worked through.

So for example zabbix sends all 1min items into queue they only take 30 seconds to work through in total.
Then after 5 minutes alle the 5 minute items get shoved into the Powershell queue for zabbix to work through
But since thats a so huge number the Queue takes ~4 minutes to work through all the 5 minute items.
Would be fine for the 5 minute Items.
But in the meantime the 1 minute items already get queued again.
When they then finally run they are already 4 minutes delayed.

This would also describe the behaviour that i'm seeing currently - but i really dunno how the queuing works thats why i'm here

Markku
zabbix sender was also my plan to go, but somewhere in the manual i read that you shouldn't use zabbix sender if there is the possibility to run it through an agent

**cyber** · 13-06-2024, 15:53

Originally posted by zabbattical

zabbix sender was also my plan to go, but somewhere in the manual i read that you shouldn't use zabbix sender if there is the possibility to run it through an agent

That's total BS, pardon my french...

It is clearly seen here, that your chosen way does not work very well (and it really has nothing to do with Zabbix itself, but the way windows starts things and manages them)... You should try better approach and sender is more efficient here.
I dare you to find that quote from manual again..

I'd really like to read it..

**PeterZielony** · 13-06-2024, 17:03

Originally posted by zabbattical

PeterZielony
- Checking MSMQ on Servers if replication is running fine
- checking files on header information to see if it was built and transmitted correctly
- checking smb directories if files get processed
- failover cluster monitoring
- lots and lots of SQL Query checks (we run this in powershell because the agent user runs with a serviceaccount that has access and so we don't need to keep the user credentials in zabbix)
- checking backup data for consistency
- kafka prometheus monitoring (response >64kb, to large for zabbix web request)
- and lots of other stuff i'm currently not thinking off

the 30 second limit is a per script limit i think
but since the powershell queue as a whole can only run a defined number of parallel processes, the runtime of the single scripts cumulates and thats why my 1min items gehts delayed because the sum of items that run every 5 minutes takes so long that the 1min items already get queued again and won't run until the 5min items are worked through.

So for example zabbix sends all 1min items into queue they only take 30 seconds to work through in total.
Then after 5 minutes alle the 5 minute items get shoved into the Powershell queue for zabbix to work through
But since thats a so huge number the Queue takes ~4 minutes to work through all the 5 minute items.
Would be fine for the 5 minute Items.
But in the meantime the 1 minute items already get queued again.
When they then finally run they are already 4 minutes delayed.

This would also describe the behaviour that i'm seeing currently - but i really dunno how the queuing works thats why i'm here

Markku
zabbix sender was also my plan to go, but somewhere in the manual i read that you shouldn't use zabbix sender if there is the possibility to run it through an agent

hm .. this is a lot and will require re-design - this is for sure. I don't have much experience with MSMQ - but I'm sure since this is Microsoft it has some form of accessing data other than powershell. I have to admit - i would love to be in your position to investigate everything and write out solutions for everything - without being there it would be hard for me to pint point to a solution in a single message since a lot is going on your severs.

- MSMQ is an MS product that surely exposes WMI metrics that can be natively collected via agent. (https://wutils.com/wmi/root/cimv2/default.html)
- headers? Do you mean from a file or some API call "in-fly"?
- "checking smb directories if files get processed" from where do you get file info that is supposed to expose this information (how often file name changes or there are multiple ones, from where you get it, what are the paths etc)
- failover cluster - this is when potentially can be observed via PowerShell
- SQL query checks - again PowerShell only if cannot use ODBC - what info do you get from SQL to confirm "failed/Success" and based on what query - is this MSMQ DB?
- backups - meaning which backups? SQL?
-- etc -- etc
---- There is a lot of questions that need to be asked and without access to script and environment, barely impossible to suggest a "one fit all" resolution.

The best approach for you is to investigate every single script and separate them by documenting each task with objectives:
- what is purpose of this check
- from where you can get info (snmp, powershell, WMI, SQL, log files etc) and explore every option if its possible
- how often does it need checking
- is there a way to get a list of things for specific information as long data is somewhat static to create discovery rules and items etc

Then you need to group things "per service" - for example all tasks related to "MSMQ" etc.

Test individual metrics needed separately but avoid using scripts on host at all cost - if you cannot then you will have to write powershell service that will collect data and either expose it to log files (each service/task - separate log file, ideally rotating) -- or use Zabbix sender so you can send data to Zabbix for further processing.

I get this is a very vague answer but really all processes you have there require very specific technical documentation to see what is available, which will help connect everything together later on.

If you need help - sure, we can help but this seems we will have to take it step by step. Each script has 30sec timeout if triggered by agent, but if there are 50 at same time - it simply won't work and each of tasks you described is a challenge in it self if you want reliable observability, Zabbix will help ofc but this have to be redesigned from ground up

unless you are familiar with go and C like Tim suggested - then you can write your own build-in functions without needing PS to check things.

**zabbattical** · 14-06-2024, 13:00

Thanks for your help.

Theres a lot of work for me to do...
Will split up scripts to the host where they belong instead one zentralised scripts server.
Also will change some items from powershell to wmi where it's possible.
User trappers, odbc, etc..

All in All a complete overwork

**cyber** · 17-06-2024, 13:51

PeterZielony commented
14-06-2024, 15:36
also consider agent ver 1 which allows running multiple instances (each will have its own listening port tho) of agents whereas agent 2 just 1 but it is more "packed"

You can do that with agent2 also...

Ad Widget

Huge Performance Issues - System.run - Powershell Scripts

Huge Performance Issues - System.run - Powershell Scripts

Comment

Comment

Comment

Comment

Comment

Comment

Comment