2023
Autores
Castro, E; Ferreira, PM; Rebelo, A; Rio-Torto, I; Capozzi, L; Ferreira, MF; Goncalves, T; Albuquerque, T; Silva, W; Afonso, C; Sousa, RG; Cimarelli, C; Daoudi, N; Moreira, G; Yang, HY; Hrga, I; Ahmad, J; Keswani, M; Beco, S;
Publicação
MACHINE VISION AND APPLICATIONS
Abstract
Every year, the VISion Understanding and Machine intelligence (VISUM) summer school runs a competition where participants can learn and share knowledge about Computer Vision and Machine Learning in a vibrant environment. 2021 VISUM's focused on applying those methodologies in fashion. Recently, there has been an increase of interest within the scientific community in applying computer vision methodologies to the fashion domain. That is highly motivated by fashion being one of the world's largest industries presenting a rapid development in e-commerce mainly since the COVID-19 pandemic. Computer Vision for Fashion enables a wide range of innovations, from personalized recommendations to outfit matching. The competition enabled students to apply the knowledge acquired in the summer school to a real-world problem. The ambition was to foster research and development in fashion outfit complementary product retrieval by leveraging vast visual and textual data with domain knowledge. For this, a new fashion outfit dataset (acquired and curated by FARFETCH) for research and benchmark purposes is introduced. Additionally, a competitive baseline with an original negative sampling process for triplet mining was implemented and served as a starting point for participants. The top 3 performing methods are described in this paper since they constitute the reference state-of-the-art for this particular problem. To our knowledge, this is the first challenge in fashion outfit complementary product retrieval. Moreover, this joint project between academia and industry brings several relevant contributions to disseminating science and technology, promoting economic and social development, and helping to connect early-career researchers to real-world industry challenges.
2023
Autores
Corbetta, V; Beets-Tan, R; Silva, W;
Publicação
Lecture Notes in Computer Science - Machine Learning in Medical Imaging
Abstract
2023
Autores
Ribeiro, M; Nunes, I; Castro, L; Costa-Santos, C; Henriques, TS;
Publicação
FRONTIERS IN PUBLIC HEALTH
Abstract
IntroductionPerinatal asphyxia is one of the most frequent causes of neonatal mortality, affecting approximately four million newborns worldwide each year and causing the death of one million individuals. One of the main reasons for these high incidences is the lack of consensual methods of early diagnosis for this pathology. Estimating risk-appropriate health care for mother and baby is essential for increasing the quality of the health care system. Thus, it is necessary to investigate models that improve the prediction of perinatal asphyxia. Access to the cardiotocographic signals (CTGs) in conjunction with various clinical parameters can be crucial for the development of a successful model. ObjectivesThis exploratory work aims to develop predictive models of perinatal asphyxia based on clinical parameters and fetal heart rate (fHR) indices. MethodsSingle gestations data from a retrospective unicentric study from Centro Hospitalar e Universitario do Porto de Sao Joao (CHUSJ) between 2010 and 2018 was probed. The CTGs were acquired and analyzed by Omniview-SisPorto, estimating several fHR features. The clinical variables were obtained from the electronic clinical records stored by ObsCare. Entropy and compression characterized the complexity of the fHR time series. These variables' contribution to the prediction of asphyxia perinatal was probed by binary logistic regression (BLR) and Naive-Bayes (NB) models. ResultsThe data consisted of 517 cases, with 15 pathological cases. The asphyxia prediction models showed promising results, with an area under the receiver operator characteristic curve (AUC) >70%. In NB approaches, the best models combined clinical and SisPorto features. The best model was the univariate BLR with the variable compression ratio scale 2 (CR2) and an AUC of 94.93% [94.55; 95.31%]. ConclusionBoth BLR and Bayesian models have advantages and disadvantages. The model with the best performance predicting perinatal asphyxia was the univariate BLR with the CR2 variable, demonstrating the importance of non-linear indices in perinatal asphyxia detection. Future studies should explore decision support systems to detect sepsis, including clinical and CTGs features (linear and non-linear).
2023
Autores
Charlton, PH; Allen, J; Bailon, R; Baker, S; Behar, JA; Chen, F; Clifford, GD; Clifton, DA; Davies, HJ; Ding, C; Ding, XR; Dunn, J; Elgendi, M; Ferdoushi, M; Franklin, D; Gil, E; Hassan, MF; Hernesniemi, J; Hu, X; Ji, N; Khan, Y; Kontaxis, S; Korhonen, I; Kyriacou, PA; Laguna, P; Lazaro, J; Lee, CK; Levy, J; Li, YM; Liu, CY; Liu, J; Lu, L; Mandic, DP; Marozas, V; Mejía-Mejía, E; Mukkamala, R; Nitzan, M; Pereira, T; Poon, CCY; Ramella-Roman, JC; Saarinen, H; Shandhi, MMH; Shin, H; Stansby, G; Tamura, T; Vehkaoja, A; Wang, WK; Zhang, YT; Zhao, N; Zheng, DC; Zhu, TT;
Publicação
PHYSIOLOGICAL MEASUREMENT
Abstract
Photoplethysmography is a key sensing technology which is used in wearable devices such as smartwatches and fitness trackers. Currently, photoplethysmography sensors are used to monitor physiological parameters including heart rate and heart rhythm, and to track activities like sleep and exercise. Yet, wearable photoplethysmography has potential to provide much more information on health and wellbeing, which could inform clinical decision making. This Roadmap outlines directions for research and development to realise the full potential of wearable photoplethysmography. Experts discuss key topics within the areas of sensor design, signal processing, clinical applications, and research directions. Their perspectives provide valuable guidance to researchers developing wearable photoplethysmography technology.
2023
Autores
Salewski, L; Alaniz, S; Rio-Torto, I; Schulz, E; Akata, Z;
Publicação
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023)
Abstract
In everyday conversations, humans can take on different roles and adapt their vocabulary to their chosen roles. We explore whether LLMs can take on, that is impersonate, different roles when they generate text in-context. We ask LLMs to assume different personas before solving vision and language tasks. We do this by prefixing the prompt with a persona that is associated either with a social identity or domain expertise. In a multi-armed bandit task, we find that LLMs pretending to be children of different ages recover human-like developmental stages of exploration. In a language-based reasoning task, we find that LLMs impersonating domain experts perform better than LLMs impersonating non-domain experts. Finally, we test whether LLMs' impersonations are complementary to visual information when describing different categories. We find that impersonation can improve performance: an LLM prompted to be a bird expert describes birds better than one prompted to be a car expert. However, impersonation can also uncover LLMs' biases: an LLM prompted to be a man describes cars better than one prompted to be a woman. These findings demonstrate that LLMs are capable of taking on diverse roles and that this in-context impersonation can be used to uncover their strengths and hidden biases. Our code is available at https://github.com/ExplainableML/in-context-impersonation.
2023
Autores
Obereder A.; Bertram T.; Correia C.; Feldt M.; Raffetseder S.; Shatokhina J.; Steuer H.;
Publicação
7th Adaptive Optics for Extremely Large Telescopes Conference, AO4ELT7 2023
Abstract
METIS SCAO uses a wavefront control concept that deploys a 2-stage spatial reconstruction where the wavefront is first reconstructed on an intermediate space we call the virtual DM, and then projected onto the actual control space. This document addresses the projection of the wavefront estimation on the virtual deformable mirror (VDM) onto the control modes developed for METIS (Mid-infrared ELT Imager and Spectrograph). We present a new approach to project onto the control modes using an intermediate regularized projection on the M4 mirror and then convert to modes. This method enables us to utilise all modes for the projection and control in a stable manner, achieving high Strehl ratios for a wide range of conditions without the need for complex parameter tuning.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.