Description:
What Youll Be Doing (aka Fun with Data!):
Designing, developing, and implementing data pipelines and ETL processesthink of it as building the Large Hadron Collider of data!
Collaborating with clients to understand their business requirements (or as I call it, decoding their quantum superpositions of needs)
Working closely with cross-functional teams, ensuring that everyones code plays nice together
Collecting and integrating data from various sources like databases, APIs, external providers, and even mysterious streaming sources (cue X-Files theme)
Aggregating unstructured data into a structured format for data warehousingbecause chaos is cool in physics, not in data storage
Optimizing data schemas and ensuring data quality and integritybecause Schrödingers cat should be alive OR dead, not both in your dataset
Processing and analyzing massive datasets (which is just a fancy way of saying, "Youll be a data wizard!")
Implementing and understanding Data Architecture, including data in motion, data at rest, and the deep philosophical question: Why do we store so much data?
Keeping up with the latest trends in data engineeringbecause being outdated is so dial-up
What You Need to Succeed (aka Your Superpowers):
Strong proficiency in SQL (obviously)
Proficiency in at least one programming language: Python, Scala, or Java. (Bonus points if you can argue why one is superior.)
Solid experience working with relational databases (or at least knowing why they still matter)
Hands-on experience with cloud-based data platforms like AWS, Azure, or Google Cloud (because everything is in the cloud now, even your grandmas recipes)
Expertise in Data Modelling and Database Design
Experience in designing and implementing efficient ETL pipelinesbecause data teleportation isnt a thing yet
Bonus Skills (Not Required, But Will Impress Me):
Experience with Snowflake and Matillion (because who doesnt love cool-sounding tools?)
Knowledge of NoSQL databases like MongoDB or Cassandra (because sometimes SQL just doesnt cut it)
Experience with distributed systems like Hadoop and Spark (because Big Data is like the universeever-expanding)
Familiarity with Apache NiFi (its not sci-fi, but its close)
Exposure to MapReduce, Hive, Pig, or HBase (if you know these, youre already cooler than most people I know)
Understanding of operating systems like UNIX, Linux, and Windows (because knowing one OS is so basic)
Qualifications Im Looking For (Yes, Theres a Bit of a Nerd Filter):
Essential:
BSc in Computer Science or Information Technology.
Preferred:
BSc Honours in Computer Science or Information Technology.
BEng / BSc Engineering.
Who Im Looking For (Aka "Are You My Data Jedi?")
06 Mar 2025;
from:
gumtree.co.za