Skip to content

Add monitoring and alerting mechnism#910

Open
ML217 wants to merge 82 commits into
monitoring_alertingfrom
main
Open

Add monitoring and alerting mechnism#910
ML217 wants to merge 82 commits into
monitoring_alertingfrom
main

Conversation

@ML217
Copy link
Copy Markdown
Collaborator

@ML217 ML217 commented May 17, 2026

Added health check verification and add service update, response time and error rate monitoring

ohwenyi and others added 30 commits March 15, 2026 22:40
Engine fixes:
- Engine.Dockerfile: deploy echo_engine_iot.py as echo_engine.py so
  the IoT engine actually runs in Docker
- Engine/requirements.txt: add scikit-learn (imported but missing)
- echo_engine.json: add API_URL field to replace hardcoded endpoint
- echo_engine_iot.py: read API URL from config, remove duplicate import,
  add _handle_edge_prediction() for edge devices, route on payload type
- test_iot_integration.py: fix import so unit tests run locally

Light engine MQTT fix:
- light_echo_engine.json: correct MQTT_CLIENT_URL from mqtt-broker to
  ts-mqtt-server-cont, fixing DNS resolution error on startup
- torch_impl/requirements.txt: add paho-mqtt==1.6.1, pymongo, geopy,
  google-cloud-storage (all imported but missing)

Edge inference (new):
- IoT/edge_inference/iot_edge_client.py: RPi script that records audio,
  runs EfficientNetV2 TFLite on-device, publishes only prediction result
  over MQTT (no audio transmitted)
- IoT/edge_inference/requirements.txt: RPi deps
- IoT/edge_inference/README.md: deployment guide
Signed-off-by: mraula <96328895+mraula@users.noreply.github.com>
Added error_analysis folder with analysis notebook and resulting output that covers core essentials of production model strengths and weaknesses.
KishanFW and others added 30 commits May 17, 2026 09:55
Signed-off-by: Kishan Fernando <116418723+KishanFW@users.noreply.github.com>
…g-task

Completed onboarding task - added bird descriptions
…lity-guide

Add engine maintainability guide and handover workbook
…diness

Sprint 2 Task 4 Dataset Readiness Workflow
…diness

Ee/ds/dataset readinessSprint 2 Task 4 Dataset Readiness Workflow
…apping

Added ALA label mapping validation workflow
…nfidence-eval

Add Sprint 2 Task 2 deliverable: calibration evaluation report
Created final Error Analysis Report from data gathered from the error_analysis Notebook.
…standardisation

Ee/ka/calibrated confidence standardisation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.