The aim of the proposed project is to interpret big and noisy visual data, which has been recorded in diversified environments with no predefined constraints. To this end, the goal is to develop and apply original data mining methods towards extracting important knowledge and increase the accessibility of such archives. Particularly, we aim to focus on summarization approaches, so that the big visual data is more effectively structured and enriched with additional semantic information. The summarization approaches that make use of the multi-modal nature of the data will focus on three main problems: 1) To learn semantic concepts and spatio-temporal attributes from big visual data; 2) organization of large photograph collections; 3) summarization of videos in large web archives. In all these problems, big visual data and the additional information referred as metadata will be handled together.
Diverse Neural Photo Album SummarizationInternational Conference on Image Processing Theory, Tools and Applications (IPTA 2019)
Yunus Emre Ozkose, Bora Celikkale, Erkut Erdem, Aykut ErdemA Comparative Analysis of Practices in Training Deep Models for Fashion Attribute Detection (in Turkish)27th IEEE Signal Processing and Communications Applications Conference (SIU) 2019
Mustafa Sercan Amac, Aykut Erdem, Erkut Erdem