SponsorUnited is the leading global sports and entertainment intelligence platform, delivering real-time trends and on-demand research that provides invaluable insights.
With tens of millions of marketing partnerships, brands, rights-holders and creative tracks, our SaaS database enables organizations to partner more effectively and make decisions at speed and scale. By connecting the entire sponsorship ecosystem through the most comprehensive data available anywhere, SponsorUnited is fueling smarter partnerships.
Job Description:
SponsorUnited is seeking a Data Solutions Engineer to spearhead the development, implementation, and management of our data pipelines and infrastructure. This role is central to our mission, ensuring the robustness, scalability, and efficiency of our data operations. As the steward of our data quality and data architecture, you will play a critical role in leveraging AWS technologies to optimize our data warehousing and data lakes, setting the stage for innovative uses of ML and AI across our platform.
Key Responsibilities:
- Data Pipeline Ownership : Architect, deploy, and manage scalable data pipelines capable of handling vast volumes of data from a diverse array of data sources, including websites, images, audio feeds, and video feeds.
- Data Extraction and Intelligence: Implement pipelines to extract relevant information with precision, ensuring high-quality data is readily available for analytics and machine learning applications.
- Data Infrastructure Management : Oversee our comprehensive data infrastructure, ensuring optimal performance, reliability, and security across our AWS-based data warehousing and data lake solutions.
- Data Technology Expertise : Leverage extensive knowledge of AWS data services (including but not limited to Redshift, S3, Glue, Athena, and Kinesis) to build and maintain state-of-the-art data solutions that support our analytical and operational needs.
- Cross-Functional Leadership : Collaborate closely with product managers, data scientists, and engineering teams to understand and fulfill data requirements, facilitating the seamless integration of ML/AI technologies to enhance our offerings.
- Performance Optimization : Apply advanced techniques for tuning and optimizing data throughput and storage efficiency, ensuring swift and reliable access to critical data insights.
- Innovation and Best Practices : Stay at the forefront of data management and AWS cloud technologies, advocating for and implementing cutting-edge practices that contribute to our legacy of technical innovation and excellence.
Qualifications:
- 10+ years of experience in data engineering or a similar role, with a strong emphasis on building scalable solutions, demonstrating the ability to design and deploy scalable data solutions.
- Bachelor’s degree in Computer Science, Data Science, or a related field (or equivalent experience).
- Expert knowledge of data lake and data warehouse architecture, data ingestion, transformation, and data modeling best practices.
- Strong, hands-on experience with cloud-based databases (e.g., AWS RDS, Azure SQL Database or Google cloud) AWS RDS and data lake technologies, like AWS S3, Azure Data Lake Storage, or Google Cloud Storage, for managing and storing unstructured and semi-structured data.
- Certification in relevant AWS architectures and database technologies such as AWS Certified Solutions Architect Associate/Professional and AWS Certified Data Analytics Specialty is a strong plus.
- Strong programming skills in languages to develop applications and scripts for data automation and knowledge of languages like Python, Bash, R, Java, for automation and maintenance tasks.
- Expertise in AWS data warehousing platforms, such as Amazon Redshift, as well as exposure to other platforms like Google BigQuery.
- Strong understanding of data security and compliance regulations, like SOC2, NIST, PCI, etc.
- Excellent problem-solving and communication skills.
Azure SQL Database amazon-s3 Artificial intelligence (AI) Python Amazon Web Services (AWS) Amazon Redshift Amazon Athena Amazon Kinesis SaaS pci Java Machine Learning Bash amazon-rds SOC2 aws-glue google-cloud-storage