Company Overview
Dadosfera is transforming the data landscape with a platform that delivers advanced Data, AI, and Analytics capabilities—previously available only to tech giants like Meta, Amazon, Alphabet, and Microsoft—into the hands of small and medium-sized businesses. By democratizing access to these technologies, we empower our clients to accelerate business opportunities through AI-powered Data Apps.
Our platform minimizes the time spent on cloud management and streamlines the entire data lifecycle—from collection and exploration to processing and interaction—allowing clients to focus on their business goals. Built on a high-growth SaaS model, Dadosfera offers tailored solutions that meet real business needs.
What sets Dadosfera apart is our innovative focus on speed, scalability, and flexibility, combining big data storage with advanced analytics to drive strategic decision-making. Leveraging years of experience with data platform implementations on foreign public clouds, we now deliver a cost-effective, locally adapted solution tailored for both Brazilian and global markets. By blending deep expertise with localized infrastructure, Dadosfera provides a powerful, efficient tool that optimizes corporate data management and drives data-driven transformation.
About the Job
Dadosfera, a leader in Data & AI Platform Solutions, is seeking exceptional talent in Data Engineering. We’re looking for someone who is passionate about data, thrives in an autonomous, high-learning environment, and enjoys working with cutting-edge technologies. As a Data Engineer, your mission will be to manage large volumes of data, creating solutions and generating insights for our clients.
Responsibilities
- Design, develop, and maintain scalable, efficient, and robust data pipelines.
- Implement data ingestion, transformation, and quality processes, ensuring reliability and governance across deliveries.
- Build and optimize data architectures using cloud services, Kubernetes, and orchestration tools.
- Develop and maintain APIs, scripts, and integrations to ingest data from multiple sources (APIs, files, crawlers, connectors).
- Execute web scraping and crawler development, ensuring performance and integrity of collected data.
- Collaborate with product, analytics, data science, and engineering teams to enable end-to-end data solutions.
- Monitor performance of pipelines and implemented solutions, identifying bottlenecks and proposing continuous improvements.
- Support data architecture initiatives, technical standardization, and engineering best practices.
- Contribute to the development of data applications (data apps), including prototyping and optimization using Streamlit.
- Participate in strategic projects involving automations, integrations, and—when applicable—AI-enabled features.
Minimum Qualifications
- 3+ years of experience in Data Engineering or a related field.
- Bachelor's degree in a STEM field or equivalent industry experience.
- Proficiency in building, maintaining, and optimizing data pipelines.
- Experience with SQL and Python development.
- Hands-on experience with Linux environments and cloud services.
- Familiarity with CDC ingestion tools (e.g., Debezium, DMS) and data orchestration tools (e.g., Airflow, NiFi).
- Practical experience with Kubernetes and Apache Spark.
- Ability to build, maintain, and consume APIs, as well as develop scripts and automations for data manipulation and integration from various sources (APIs, files, custom connectors).
- Experience with web scraping, crawlers, and web data extraction tools (e.g., Scrapy, Beautiful Soup).
- Strong organizational and prioritization skills in multi-demand environments.
- Experience developing data applications (data apps), especially with Streamlit.
Preferred Qualifications
- Experience with LLM-based projects, including integration and consumption of AI models within data solutions.
- Experience in data platform or environment migration projects.
- Knowledge of MLOps and DataOps practices to support AI-driven applications.
- Experience with infrastructure-as-code tools such as Terraform or CloudFormation.
- Knowledge of relational and non-relational databases.
- Experience in tech startups or with BI tools (Power BI, Tableau, Metabase).
- Intermediate or advanced English.
At Dadosfera, we celebrate diversity in all its forms and are committed to fostering an inclusive environment every day. We have zero tolerance for any form of discrimination. Do you share our values? Then don’t wait—apply today!