Topics: applied machine learning, data leakage, reproducibility Skills: Python, data analysis, machine learning Difficulty: Medium Size: Large (350 hours) Mentors: Fraida Fund and Mohamed Saeed Project Idea Description
Data leakage has been identified as a major cause of irreproducibility of a paper’s findings, when machine learning techniques are applied to problems in science.