Role: Data Catalog Engineer
Location: NYC, NY (Remote)
Contract
- Data Catalog Engineer
- We are seeking a Data Catalog Engineer to lead the implementation and management of our data catalog solutions. You will set up AWS DataZone and enhance its metamodel to ensure seamless data discovery and governance.
- Technical Skills Required:
- Overall 10+ Years of IT Experience with
- Proficiency in AWS services, including DataZone, Glue Catalog, Crawlers, and Lake Formation.
- Strong experience in metadata management and data classification techniques.
- Responsibilities:
- AWS DataZone Setup: Configure and optimize AWS DataZone for effective data cataloging.
- Metadata Management: Harvest metadata from diverse sources like S3, Redshift, and Aurora PostgreSQL, ensuring comprehensive asset documentation.
- Glue Catalog Management: Create and maintain the AWS Glue Data Catalog for accurate metadata representation.
- Crawlers Configuration: Set up and manage AWS Glue Crawlers for automated data discovery and catalog updates.
- Lake Formation Integration: Leverage Lake Formation for secure data lake setup and access management.
- Data Classification: Implement data classification strategies using AWS Macie and Glue to enhance data governance.
- Access Request Processes: Establish processes for requesting data access and managing identity and access permissions.
- Data Inventory Maintenance: Maintain an up-to-date inventory of data assets to facilitate data discovery for consumers.
- User Interface Customization: Extend the DataZone UI to improve user experience and functionality.
- Collaboration: Work with cross-functional teams to ensure data assets meet organizational needs and compliance standards
From:
Sivabalan,
CAGUS
sivabalan.c@cagus.com
Reply to: sivabalan.c@cagus.com