SEMINARS AND WORKSHOPS
Workshop on “Self-Organizing Networks (SONs) in 5G using AI
On the 16th of September 2025, the Department of Computer Science and Engineering (Data Science) hosted a workshop for 7th-semester students on the role of a Data Engineer and the use of ETL (Extract, Transform, Load) tools. The session highlighted the importance of building and maintaining data pipelines that facilitate the movement and processing of data for analytics and decision-making.
The speaker, Mr. Saubhagya Ranjan Sahoo, Data Engineer at Pentland Brands, emphasized that a Data Engineer is responsible for designing scalable pipelines, integrating data from multiple sources, ensuring data quality and reliability, and enabling analysts and data scientists to derive insights.
The session introduced key tools such as:
- AWS S3 – a cloud-based central data lake offering durability, scalability, and cost-effectiveness.
- Snowflake – a cloud data warehouse that separates compute and storage while supporting scalable analytics through SQL queries.
- An ETL/ELT tool – providing a visual interface to efficiently extract, transform, and load data into systems like Snowflake.
Workflow integration was also discussed, showcasing how data is first ingested and stored in AWS S3, then processed for deriving business insights.
In conclusion, the session emphasized that modern data engineering relies heavily on cloud platforms and ETL tools to manage large-scale data effectively. Understanding tools like AWS S3 is essential for building efficient, automated, and scalable data pipelines.