1. FAQ About Alarm
What should I do when the alarm never goes off even though I have configured policies?
- Check the log of Sender, Alarm, Judge, Hbs, Agent and Transfer to see if there is any error
- Visit the http webpage of Alarm through your browser to check if there isare any alarms needs to be reset,.If there is, then the alarmthey has been already generated. The reason why you didn't receive alarms is probably an error occurs at mail or SMS sending port. Then you need to check the API configuration in Sender.
- Open Debug in Agent to see if it is still pushing data properly.
- Check the configuration of Agent to see if the address of Heartbeat(HBS) and Transfer is correctly configured and enabled.
- Check the configuration of Transfer to see if the address of Judge is correctly configured.
- Judge provides an http port for debugging. We can use it to check if certain data has been correctly pushed. For example, if we want to check the data of "cpu.idle" of the machine "qd-open-falcon-judge01.hd", then run
"127.0.0.1:6081" above means the http port of Judge.curl 127.0.0.1:6081/history/qd-open-falcon-judge01.hd/cpu.idle
- Check if the time of server is synchronized through ntp or chrony.
- Check if the HBS address of Judge is correctly configured.
- Check if database address of HBS is correctly configured.
- Check if alarm recipients are correctly configured in the policy template of Portal.
- Check if the policy template in Portal configuration is bound to a HostGroup and the target machine happens to be in this HostGroup.
- Check UIC if you are added in the alarm recipient group
- Check UIC to see if your contact information is correct.
An error occurs when I add a machine in HostGroup after creating a HostGroup in Protal page.
- Check if the address of Heartbeat in correctly is configured and enable in Agent.
- Check the log HBS.
- Check if database address of HBS is correctly configured.
- Check if the configuration of hosts is sync in HBS. HBS will only write host list when it is blank and you can only add a machine when there are data in host list.