Data Science for Computer Scientists SoSe 2025
(DS4CS)

Data Science for Computer Scientists offers a practical and theoretical foundation in data science, tailored for students with a computer science background. This course covers essential topics such as data preprocessing, statistical analysis, and machine learning. Emphasizing hands-on experience, students will apply algorithms to various datasets, extracting meaningful patterns from data, and visualizing results.


Organization

  • Lecturer: Prof. Dr. Ziawasch Abedjan, D2IP
  • Teaching Assistants: Muaid Mughrabi & Fedor Turchenko, D2IP
    • muaid.mughrabi (at) tu-berlin.de
    • fedor.turchenko (at) tu-berlin.de
  • Lecture: Mondays 14:15 - 15:45, HFT-TA 131
  • Exercise: Wednesdays 8:15 - 9:45, HFT-TA 131
  • Registration: As we have limited places for the course, the registration will be performed through a registration form. The form will be published after first lecture (14.04.2025) and should be submitted before the first lab (16.04.2025, 08:00). Students will be accepted through a random selection process, based on the student groups as defined in the form.

Content

  • Lectures: Weekly theoretical sessions covering key concepts. These sessions aim to build foundational knowledge and introduce topics relevant to the course.
  • Lab sessions: Hands-on practical exercises designed to apply lecture content. Students will work with datasets and different tools to enahnce their understanding and develop technical skills. They also serve as a practical preparation for the assignments.
  • Assignments: Periodically assigned tasks designed to encourage deeper exploration of course materials and to evaluate comprehension and progress. The assignments should be performed in groups of 2-3 students.

Deliverables

  • Assignments

Infrastructure

We will use the JupyterHub as the shared environment for the lab sessions and assignments. It allows for seamless distribution of the materials and eases collaboration between the students.

  • JupyterHub URL: http://130.149.21.99/
    • Connection outside the TU Berlin network can only be performed via VPN
    • If you are macOS user, it is recommended to connect to JupyterHub via Safari browser
  • Your username is a substring before @ of the TU Berlin email
    • e.g., name@campus.tu-berlin.de> -> name will be a username
  • You create your password during the first login. It should be at least 7 characters long
  • In case you forgot your credentials or have troubles logging in, please, write an email to fedor.turchenko (at) tu-berlin.de


Schedule

Lecture Dates (Mondays):

  • Monday, 14.04.2025
  • Monday, 21.04.2025 – Cancelled
  • Monday, 28.04.2025
  • Monday, 05.05.2025
  • Monday, 12.05.2025
  • Monday, 19.05.2025
  • Monday, 26.05.2025
  • Monday, 02.06.2025
  • Monday, 09.06.2025 – Cancelled
  • Monday, 16.06.2025
  • Monday, 23.06.2025
  • Monday, 30.06.2025 – EB 222 (different room)
  • Monday, 07.07.2025
  • Monday, 14.07.2025

Lab Dates (Wednesdays):

  • Wednesday, 16.04.2025
  • Wednesday, 23.04.2025
  • Wednesday, 30.04.2025
  • Wednesday, 07.05.2025
  • Wednesday, 14.05.2025
  • Wednesday, 21.05.2025
  • Wednesday, 28.05.2025
  • Wednesday, 04.06.2025
  • Wednesday, 11.06.2025
  • Wednesday, 18.06.2025
  • Wednesday, 25.06.2025
  • Wednesday, 02.07.2025
  • Wednesday, 09.07.2025