2023
Autores
Litvak, M; Rabaev, I; Campos, R; Jorge, M; Jatowt, A;
Publicação
CEUR Workshop Proceedings
Abstract
[No abstract available]
2023
Autores
Litvak, M; Rabaev, I; Campos, R; Jorge, AM; Jatowt, A;
Publicação
SIGIR Forum
Abstract
2023
Autores
Mansouri, B; Campos, R;
Publicação
CoRR
Abstract
2023
Autores
Mansouri, B; Durgin, S; Franklin, S; Fletcher, S; Campos, R;
Publicação
Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), Thessaloniki, Greece, September 18th to 21st, 2023.
Abstract
This paper describes the participation of the Artificial Intelligence and Information Retrieval (AIIR) Lab from the University of Southern Maine and the Laboratory of Artificial Intelligence and Decision Support (LIAAD) lab from INESC TEC in the CLEF 2023 SimpleText lab. There are three tasks defined for SimpleText: (T1) What is in (or out)?, (T2) What is unclear?, and (T3) Rewrite this!. Five runs were submitted for Task 1 using traditional Information Retrieval, and Sentence-BERT models. For Task 2, three runs were submitted, using YAKE! and KBIR keyword extraction models. Finally, for Task 3, two models were deployed, one using OpenAI Davinci embeddings and the other combining two unsupervised simplification models.
2023
Autores
Campos, R; Jorge, AM; Jatowt, A; Bhatia, S; Litvak, M;
Publicação
CEUR Workshop Proceedings
Abstract
[No abstract available]
2023
Autores
Mansouri, B; Campos, R; Jatowt, A;
Publicação
COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023
Abstract
Timeline summarization (TLS) is a challenging research task that requires researchers to distill extensive and intricate temporal data into a concise and easily comprehensible representation. This paper proposes a novel approach to timeline summarization using Abstract Meaning Representations (AMRs), a graphical representation of the text where the nodes are semantic concepts and the edges denote relationships between concepts. With AMR, sentences with different wordings, but similar semantics, have similar representations. To make use of this feature for timeline summarization, a two-step sentence selection method that leverages features extracted from both AMRs and the text is proposed. First, AMRs are generated for each sentence. Sentences are then filtered out by removing those with no named-entities and keeping the ones with the highest number of named-entities. In the next step, sentences to appear in the timeline are selected based on two scores: Inverse Document Frequency (IDF) of AMR nodes combined with the score obtained by applying a keyword extraction method to the text. Our experimental results on the TLS-Covid19 test collection demonstrate the potential of the proposed approach.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.