python
3 days, 11 hours ago
——————————————————————————————————————————
Fixing Autodoc excel readability issue and moving to production usage.
Created new implementation for excel sheet to open and load all formulas automatically and then run autodoc PPT generation.
Discussed with stakeholders and made the scripts to work out the best way to move it production runs.
Created packaging configurations so that it will be uploaded to nexus and anyone can use it by just install using pip install autodoc-pptx
Onboarding snowflake datasets to Anamolo
Understood the complete flow from snowflake to Anamolo and alerting of the anomalies/data checks.
Migrated snowflake data tables to the anamolo and configured the tables to run checks on daily basis so that everything will be tracked over time.
Worked on anamolo APIs to automate the table configuration process with python, worked on core apis and its usage to understand automation.
Implemented unit tests to cover over 90% of the code coverage and fixed all sonar checks.
Conduct KT and sessions for team and interns on Anamolo setup and configurations.
I’ve conducted lot of KT sessions and presentations to the team to educate how anamolo will work and how to configure everything.
I also gone through the lot of DQ checks which are written in python and documented how to configure the same in anamolo as well.
——————————————————————————————————————————
Question
What knowledge or skills did you develop this year that helped you achieve positive results?
——————————————————————————————————————————
In the past year, I have focused on learning and automating processes related to Anomalo—a powerful data quality monitoring tool. Anomalo has become essential for detecting and diagnosing data quality issues in large datasets. To enhance the efficiency and reliability of my tasks, I combined this knowledge with Python automation. Here’s how this skill has contributed to positive results:
Understanding Anomalo's Usage
Anomalo excels at identifying data quality issues by analyzing anomalies, missing data, schema changes, and trends. By deeply learning its functionalities, I have been able to:
* Set up automated anomaly detection to monitor data health continuously, without manual intervention.
* Understand various data quality metrics and alerts Anomalo offers, allowing me to interpret results more accurately and act on them quickly.
* Customize monitoring configurations based on specific business needs, which enables a proactive response to potential data issues before they impact downstream processes.
Developing Python Automation for Anomalo
I used Python to automate many of Anomalo's manual processes, enhancing productivity and reliability. This automation skillset has allowed me to:
* Automate report generation: By scripting automated reports in Python, I saved hours of manual work and provided stakeholders with up-to-date data quality insights.
* Integrate anomaly alerts into workflow systems (like Slack or email), ensuring quick responses to any issues.
* Create scripts to handle repetitive tasks, such as re-running checks, setting up new monitoring rules, or updating alert parameters, which increased operational efficiency.
* Analyze and visualize data trends from Anomalo’s output, helping to identify long-term patterns and root causes of recurrent data issues.
——————————————————————————————————————————
An overall rating is required. When assigning an overall rating, consider the following:
* Contributions to Risk, Compliance, and Issue Management
* Results Delivered
* Demonstration of the Discover Behaviors
Note: The Overall Rating cannot be more than one rating higher than the Required Risk Goal rating.
——————————————————————————————————————————
1. Enhanced Autodoc Excel Functionality for Production: Resolved readability issues with Autodoc Excel, created a seamless process for formula loading, and transitioned the Autodoc PPT generation tool to production. Configured packaging for easy access through Nexus and simplified usage with pip installation.
2. Automated Snowflake Dataset Integration with Anamolo: Developed a robust workflow for onboarding Snowflake data into Anamolo, enabling continuous data monitoring. Automated table configuration processes using Anamolo’s APIs, ensuring daily checks and seamless anomaly tracking.
3. Improved Code Quality and Coverage: Worked on Anamolo’s core APIs to automate data processes with Python, achieving over 90% code coverage through comprehensive unit tests and resolving all SonarQube checks for quality assurance.
4. Knowledge Transfer and Training: Conducted multiple knowledge transfer sessions for team members and interns, covering Anamolo setup, configuration, and Python-based data quality checks. Documented the configuration process for standardized use across the team.
5. Stakeholder Collaboration: Engaged with stakeholders to refine automation scripts, ensuring optimal performance in production. Collaborated to align implementation with user needs, enhancing tool reliability and user satisfaction.
0 Comments
Please Login to Comment Here