Publicaciones

Publications for 2025

Displaying 154 publication(s) for 2025

Compiling Gradual Types with Evidence

Authors: Matías Toro, Éric Tanter, José Luis Romero et al.
Year: 2025Source: arXiv (Cornell University)
Efficiently supporting sound gradual typing in a language with structural types is challenging. To date, the Grift compiler is the only close-to-the-metal implementation of gradual typing in this sett...

Criminalidad y Democracia en América Latina

Authors: Juan Pablo Luna, A. Feldmann
Year: 2025
En la última década, el crimen organizado ha dejado de ser un fenómeno localizado para convertirse en una amenaza estructural a las democracias de América Latina. Redes criminales diversificadas y...

Joint model shows association of Mapuche genetic ancestry and longitudinal BMI with early menarche

Authors: Danilo Alvares, S. Eyheramendy, Lucas Vicuña et al.
Year: 2025
Abstract The age at puberty onset varies greatly between individuals and ethnic populations, with significant health implications. Early menarche increases risk for breast cancer, cardiovascular disea...

Anesthésie dans la chirurgie de De Quervain : les avantages de la technique WALANT

Authors: Claudio Gutiérrez, Nicole Mercier Rodríguez
Year: 2025Source: Hand surgery & rehabilitation

Identifying a novel Mecp2-mediated epigenetic mechanism controlling Lonp1 in the hippocampus and its disruption by aging

Authors: Alejandra Loyola, Karina A Cicali, Jesús Llanquinao-Sandoval et al.
Year: 2025Source: Scientific Reports

New compressed indices for multijoins on graph databases

Authors: Diego Arroyuelo, Adrián Gómez‐Brandón, Gonzalo Navarro
Year: 2025Source: Information Systems

Active Learning of Symbolic Automata Over Rational Numbers

Authors: Cristian Riveros, Sebastian Hagedorn, Martı́n Muñoz et al.
Year: 2025Source: arXiv (Cornell University)
Automata learning has many applications in artificial intelligence and software engineering. Central to these applications is the $L^*$ algorithm, introduced by Angluin. The $L^*$ algorithm learns det...

Simulating conversations on social media with generative agent-based models

Authors: Marcelo Mendoza, Andrés Carvallo, Eliana Providel et al.
Year: 2025Source: EPJ Data Science
Large Language Models (LLMs) can generate realistic text resembling human-produced content. However, the ability of these models to simulate conversations on social media is still less explored. To in...

Query Answering Under Volume-Based Diversity Functions

Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2025Source: Proceedings of the ACM on Management of Data
When query evaluation produces too many tuples, a new approach in query answering is to retrieve a diverse subset of them. The standard approach for measuring the diversity of a set of tuples is to us...

Full Waveform Inversion via Optimal Transport with Sign-Sensitive Signal Decomposition

Authors: Juan Pablo Luna
Year: 2025
We developed a theoretical framework that encompasses a broad family of misfit functions between real and simulated seismogram data, including well-known examples such as the least-squares criterion. ...

User Perception of Attention Visualizations: Effects on Interpretability Across Evidence-Based Medical Documents

Authors: Vladimir Araujo, Andrés Carvallo, Hernan Valdivieso et al.
Year: 2025Source: Lecture notes in computer science

Perceptual Evaluation of GANs and Diffusion Models for Generating X-Rays

Authors: Cecilia Besa, Denis Parra, Gregory Schuit
Year: 2025Source: Lecture notes in computer science

Graph Querying or Similarity Search? Both!

Authors: Vicente Calisto, Gonzalo Navarro, Sebastián Ferrada et al.
Year: 2025Source: Lecture notes in computer science

Striving for excellence is striving for diversity

Authors: Magdalena Saldaña, Edson C. Tandoc, Kristy Hess et al.
Year: 2025
There is rising momentum within the fields of communication, media studies and (digital) journalism studies to enhance diversity of scholarship, away from a Western-centric gaze, and to be more inclus...

Using large language models for survey research in communication: opportunities and challenges

Authors: Stephan Winter, Sebastián Rivera, Sebastián Valenzuela
Year: 2025Source: Communication and Change
Abstract Artificial intelligence (AI) is transforming survey research, offering powerful tools like large language models (LLMs) to analyze human beliefs, opinions, and behaviors. As researchers incre...

Evaluating GPT-4o in high-stakes medical assessments: performance and error analysis on a Chilean anesthesiology exam

Authors: Marcelo Mendoza, Andrés Neyem, Fernando Altermatt et al.
Year: 2025Source: BMC Medical Education
Abstract Background Large language models (LLMs) such as GPT-4o have the potential to transform clinical decision-making, patient education, and medical research. Despite impressive performance in gen...

Can Large Language Models Compete with Specialized Models in Lexical Semantic Change Detection?

Authors: Nikolay Arefyev, Felipe Bravo-Márquez, Frank D. Zamora-Reina et al.
Year: 2025Source: Frontiers in artificial intelligence and applications
In this paper, we present a comprehensive comparison between specialized Lexical Semantic Change Detection (LSCD) models and Large Language Models (LLMs) for the LSCD task. In addition to comparing mo...

An empirical study of the effect of video encoders on Temporal Video Grounding

Authors: Edison Marrese-Taylor, Cristian Rodríguez-Opazo, Felipe Bravo-Márquez et al.
Year: 2025Source: arXiv (Cornell University)
Temporal video grounding is a fundamental task in computer vision, aiming to localize a natural language query in a long, untrimmed video. It has a key role in the scientific community, in part due to...

B-Call: integrating ideological position and voting cohesion in legislative behavior

Authors: Juan Reutter, Sergio Toro, Daniel Alcatruz et al.
Year: 2025Source: Frontiers in Political Science
This paper addresses two central dimensions of legislative behavior: ideological position and voting cohesion. Although both approaches have been widely used to analyze legislative behavior, no unifie...

Flexible and Expressive Typed Path Patterns for GQL

Authors: Manuel Rigger, Matías Toro, Wenjia Ye et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Graph databases have become an important data management technology across various domains, including biology, sociology, industry (e.g. fraud detection, supply chain management, financial services), ...

Incremental Certified Programming

Authors: Éric Tanter, Kenji Maillard, Nicolas Tabareau et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Certified programming, as carried out in proof assistants and dependently-typed programming languages, ensures that a software meets its requirements by supporting the definition of both specification...

CompactLTJ: Space & Time Efficient Leapfrog Triejoin on Graph Databases

Authors: Domagoj Vrgoč, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2025Source: The VLDB Journal
Abstract Leapfrog Triejoin (LTJ) is arguably the most practical and popular worst-case-optimal (wco) algorithm for solving basic graph patterns in graph databases. Its main drawback is that it needs t...

Human Response to Decision Support in Face Matching: The Influence of Task Difficulty and Machine Accuracy

Authors: Ricardo Baeza-Yates, Carlos Castillo, Marina Estévez-Almenzar
Year: 2025Source: Frontiers in artificial intelligence and applications
Decision support systems enhanced by Artificial Intelligence (AI) are increasingly being used in high-stakes scenarios where errors or biased outcomes can have significant consequences. In this work, ...

Smallest Suffixient Sets as a Repetitiveness Measure

Authors: Gonzalo Navarro, Cristian Urbina, Giuseppe Romana
Year: 2025Source: Lecture notes in computer science

Cache-Friendly Compressed Boolean Matrices

Authors: Gonzalo Navarro, Adrián Gómez-Brandón, Antonio Fariña et al.
Year: 2025Source: Lecture notes in computer science

Query Answering under Volume-Based Diversity Functions

Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2025Source: arXiv (Cornell University)
When query evaluation produces too many tuples, a new approach in query answering is to retrieve a diverse subset of them. The standard approach for measuring the diversity of a set of tuples is to us...

My Private–Public Sphere: Women’s Information Strategies in Times of News Mistrust

Authors: Magdalena Saldaña, Isabel Pavez, Claudia Lagos Lira et al.
Year: 2025Source: Journalism & Mass Communication Quarterly
Problematic information, such as mis- and disinformation, circulating in fragmented news ecosystems, has contributed to mistrust and information fatigue. Using survey data ( N = 2,117) and two focus g...

The Missing Link: Identifying Digital Intermediaries in E‐Government

Authors: Sergio Toro, Sebastián Valenzuela, Teresa Correa et al.
Year: 2025Source: Public Administration Review

Shrec 2025: Partial Retrieval Benchmark

Authors: Benjamín Bustos, Ivan Sipiran, Silvia Biasotti et al.
Year: 2025Source: Computers & Graphics
Partial retrieval is a long-standing problem in the 3D Object Retrieval community. Its main difficulties arise from how to define 3D local descriptors in a way that makes them effective for partial re...

Human-AI Coevolution (Abstract Reprint)

Authors: Ricardo Baeza-Yates, Dino Pedreschi, Alistair Knott et al.
Year: 2025
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence a...

Artificial Intelligence and Peacebuilding: Opportunities and Challenges

Authors: Sebastián Valenzuela, Philip N. Howard, Fredrick Ogenga et al.
Year: 2025
A high-level précis of this Technical Paper can be found in the Summary for Policymakers report, Artificial Intelligence for Peacebuilding: Promises and Pitfalls. Artificial intelligence (AI) is rapi...

A Uniform Language for Safety, Robustness and Explainability

Authors: Pablo Barceló, Vaishak Belle
Year: 2025Source: Lecture notes in computer science

Personalized MRI-based characterization of subcortical anomalies in Ataxia-Telangiectasia using deep-learning

Authors: Denis Parra, Robert A. Dineen, Cristian Salazar-Vilches et al.
Year: 2025Source: PLoS ONE
Background Cerebellar atrophy is a known feature of ataxia-telangiectasia (A-T). However, basal ganglia dysfunction contributing to extrapyramidal movement disorders in A-T remains understudied. Objec...

Slicing of Probabilistic Programs: A Review of Existing Approaches

Authors: Federico Olmedo
Year: 2025Source: ACM Computing Surveys
Program slicing aims to simplify programs by identifying and removing non-essential parts while preserving program behavior. It is widely used for program understanding, debugging, and software mainte...

Engineering rank/select data structures for large-alphabet strings

Authors: Diego Arroyuelo, Erick Sepúlveda, Francisco Riveros et al.
Year: 2025Source: The Computer Journal
Abstract Large-alphabet strings, prevalent in information retrieval and natural language processing, pose unique storage and processing challenges. This paper explores the efficient implementation of ...

Introduction to the Special Issue on Temporal Web: Studying Time and the Temporal Dimension

Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2025Source: ACM Transactions on the Web

On Computing Probabilistic Explanations for Decision Trees

Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2025Source: Journal of Artificial Intelligence Research
Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases ...

WIP: Does this Course Need a Well-being Teaching Assistant?

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2025

Fast and Small Subsampled R-indexes

Authors: Gonzalo Navarro, Travis Gagie, Dustin Cobas
Year: 2025Source: ACM Transactions on Algorithms
The \(r\) -index (Gagie et al., JACM 2020) represented a breakthrough in compressed indexing of repetitive text collections, outperforming its alternatives by orders of magnitude in query time. Its sp...

Sex differences in work-related accidents extracted from free text in Spanish using natural language processing

Authors: Jocelyn Dunstan, Víctor Rocco, Daniela Moyano et al.
Year: 2025Source: BMC Public Health
By sharing our prompts and code, we aim to help other institutions and countries extract crucial information from free text to a controlled vocabulary of ILO. Future work includes the analysis of comm...

Uncovering the Hidden Biases in Personal Informatics

Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2025Source: GetMobile Mobile Computing and Communications
Personal Informatics (PI) systems, such as apps and wearables that help users track physical activity, sleep, heart rate, or stress, have become critical tools for self-monitoring and health research....

Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays

Authors: Cecilia Besa, Denis Parra, Gregory Schuit
Year: 2025Source: arXiv (Cornell University)
Generative image models have achieved remarkable progress in both natural and medical imaging. In the medical context, these techniques offer a potential solution to data scarcity-especially for low-p...

Corrections to “On the data complexity of consistent query answering over graph databases [Journal of Computer and System Sciences 88 (2017) 164–194]”

Authors: Pablo Barceló, Gaëlle Fontaine, Sophie Tison et al.
Year: 2025Source: Journal of Computer and System Sciences

Robust Dynamic Embedding for Gradual Typing

Authors: Matías Toro, Eric Tanter, Nicolas Tabareau et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Gradual typing has long been advocated as a means to bridge the gap between static and dynamic typing disciplines, enabling a range of use cases such as the gradual migration of existing dynamically t...

Are Your Fairness Metrics Accurate? A Semi-Supervised Approach to Improving Fairness Estimates Under Sample Selection Bias

Authors: Ricardo Baeza-Yates, M. Clara De Paolis Kaluza, Shantanu Jain et al.
Year: 2025

CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray

Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2025Source: Medical Image Analysis

ChatGPT as a Stable and Fair Tool for Automated Essay Scoring

Authors: Marcelo Mendoza, Miguél Nussbaum, Zvi Bekerman et al.
Year: 2025Source: Education Sciences
The evaluation of open-ended questions is typically performed by human instructors using predefined criteria to uphold academic standards. However, manual grading presents challenges, including high c...

(Worst-case) Optimal Adaptive Dynamic Bitvectors

Authors: Gonzalo Navarro
Year: 2025Source: Theory of Computing Systems

Public Knowledge and Expertise Under Authoritarian Siege: A Defense of Academic Freedom from Digital Journalism Studies

Authors: Magdalena Saldaña, Ramón Salaverría, Oscar Westlund et al.
Year: 2025Source: Digital Journalism
This article addresses the growing global assault on academic freedom—a cornerstone of democratic societies now under increasing threat from authoritarian regimes. It highlights a global decline in ...

Reducing urban speed limits decreases work-related traffic injury severity: Evidence from Santiago, Chile

Authors: Matías Toro, Eduardo Graells-Garrido, Gabriel Mansilla et al.
Year: 2025Source: Travel Behaviour and Society

Cross-Lingual Cross-Domain Transfer Learning for Rumor Detection

Authors: Marcelo Mendoza, Mauricio Solar, Eliana Providel
Year: 2025Source: Future Internet
This study introduces a novel method that merges propagation-based transfer learning with word embeddings for rumor detection. This approach aims to use data from languages with abundant resources to ...

Querying Graph Data: Where We Are and Where To Go

Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2025
Although graph query languages such as Cypher, SQL/PGQ, and GQL take inspiration from theoretical languages such as conjunctive regular path queries (CRPQs), their pattern matching facilities are sign...

Rel: A Programming Language for Relational Data

Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2025
Rel is a new relational language whose key design goal is to allow both database querying and programming in the large without relying on the currently dominant paradigm in which a query sublanguage i...

CORE+: A Complex Event Recognition Engine in C++

Authors: Cristian Riveros, Stijn Vansummeren, Vicente Calisto et al.
Year: 2025
Complex Event Recognition (CER) refers to the activity of analyzing streams of continuously arriving event data, to recognize collections of events that satisfy user-defined patterns. CER is known to ...

Editorial

Authors: Bárbara Poblete, Makoto P. Kato, H. Liu et al.
Year: 2025Source: Information Retrieval Research
This editorial celebrates the first issue of the Information Retrieval Research Journal, IRRJ.

A Systematic Review of User-Centred Evaluation of Explainable AI in Healthcare

Authors: Kristýna Sirka Kacafírková, Maxwell Szymanski, Katrien Verbert et al.
Year: 2025Source: arXiv (Cornell University)
Despite promising developments in Explainable Artificial Intelligence, the practical value of XAI methods remains under-explored and insufficiently validated in real-world settings. Robust and context...

Gradual Sensitivity Typing

Authors: Matías Toro, Eric Tanter, Damián Árquez et al.
Year: 2025

Using publicly available data for predicting socioeconomic values in urban context

Authors: Juan Reutter, Mario Miguel Ojeda, Juan L. Reutter
Year: 2025Source: Computational Urban Science
Abstract Urban transportation networks are recognized for their pivotal role in forecasting city indicators and facilitating efficient planning and management. However, despite the increase of methodo...

Complex Event Recognition under Time Constraints: Towards a Formal Framework for Efficient Query Evaluation

Authors: Cristian Riveros, Jaime García
Year: 2025Source: Proceedings of the ACM on Management of Data
Complex Event Recognition (CER) establishes a relevant solution for processing streams of events, giving users timely information. CER systems detect patterns in real-time, producing complex events an...

Accurate and Efficient Solid Waste Recognition: A Novel Approach Using Google Teachable Machine Based on Convolutional Neural Network (CNN)

Authors: Marcelo Mendoza, László Duma
Year: 2025

Explaining k -Nearest Neighbors: Abductive and Counterfactual Explanations

Authors: Pablo Barceló, Bernardo Subercaseaux, Miguel Romero et al.
Year: 2025Source: Proceedings of the ACM on Management of Data
Despite the wide use of k -Nearest Neighbors as classification models, their explainability properties remain poorly understood from a theoretical perspective. While nearest neighbors classifiers offe...

Characterizing Knowledge Manipulation in a Russian Wikipedia Fork

Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Pablo Aragón et al.
Year: 2025Source: Proceedings of the International AAAI Conference on Web and Social Media
Wikipedia is powered by MediaWiki, a free and open-source software that is also the infrastructure for many other wiki-based online encyclopedias. These include the recently launched website Ruwiki, w...

SPLASH-SegFormer Pipeline: A Transformer-Based Approach for High-Resolution and Low-Cost Laser Scanner Seafloor Mapping

Authors: Hans Löbel, Javiera Fuentes-Guíñez, Giancarlo Troni
Year: 2025Source: IEEE Robotics and Automation Letters
High-resolution seafloor mapping continues to be challenging, primarily due to the high costs and complexity of traditional sensors. Laser scanners offer a more affordable alternative, using a monocul...

Information Integrity about Climate Science: A Systematic Review

Authors: Heather Ford, Eni Mustafaraj, Gizem Ceylan et al.
Year: 2025
A high-level précis of this Synthesis Report can be found in the Summary for Policymakers report, Facts, Fakes, and Climate Science. The human response to the climate crisis is being obstructed and d...

Facts, Fakes, and Climate Science: Recommendations for Improving Information Integrity about Climate Science

Authors: Sebastián Valenzuela, Philip N. Howard, Jusen Asuka et al.
Year: 2025
This Summary for Policymakers provides a high-level précis of the Synthesis Report, Information Integrity about Climate Science: A Systematic Review. The human response to the climate crisis is being...

Evaluating the Performance of Large Language Models on the CONACEM Anesthesiology Certification Exam: A Comparison with Human Participants

Authors: Marcelo Mendoza, Andrés Neyem, Fernando Altermatt et al.
Year: 2025Source: Applied Sciences
Large Language Models (LLMs) have demonstrated strong performance on English-language medical exams, but their effectiveness in non-English, high-stakes environments is less understood. This study ben...

Regulatory Initiatives in AI

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Explainable Artificial Intelligence

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Perspectives and Challenges

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Fairness, Accountability, and Transparency in AI

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

What is AI Ethics?

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Beyond the Mainstream: Sustainability and the Replicability Crisis

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Bias in Al

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

NLP and Representational Bias

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Transformers and Generative AI

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Benefits and Risks of LLMs

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Ethics in Artificial Intelligence and Information Technologies

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025

Visual Transformers and the Rise of Multimodality

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

A Sociotechnical Approach to Integrate Ethics into AI Projects

Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks

Practical Adaptive Dynamic Bitvectors

Authors: Gonzalo Navarro
Year: 2025Source: Software Practice and Experience
ABSTRACT Introduction While operations rank and select on static bitvectors can be supported in constant time, lower bounds show that this is impossible when supporting updates; practical implementati...

Large Language Models in Crisis Informatics for Zero and Few-Shot Classification

Authors: Bárbara Poblete, Andrés Abeliuk, Cinthia Sánchez
Year: 2025Source: ACM Transactions on the Web
This article presents an exploration of the use of pre-trained Large Language Models (LLMs) for crisis classification to address labeled data dependency issues. We present a methodology that enhances ...

The Role of Organizations in Networked Mobilization: Examining the 2011 Chilean Student Movement Through The Logic of Connective Action

Authors: Denis Parra, Carolina Pérez-Arredondo, Diego Gómez-Zará
Year: 2025
This study examines the communication mechanisms that shape the formation of digitally-enabled mobilization networks. Informed by the logic of connective action, we postulate that the emergence of net...

Novel SIMEX algorithm for autoregressive models to estimate AGN variability

Authors: Susana Eyheramendy, Wilfredo Palma, Felipe Elorrieta et al.
Year: 2025Source: Monthly Notices of the Royal Astronomical Society
Abstract The origin of the variability in accretion disks of active galactic nuclei (AGN) is still unknown, but its behavior can be characterized by modeling the time series of optical wavelength flux...

Foreword to the special section on 3D object retrieval 2024 symposium (3DOR2024)

Authors: Benjamín Bustos, Ivan Sipiran, Tobias Schreck et al.
Year: 2025Source: Computers & Graphics

Imitating Human Reasoning to Extract 5W1H in News

Authors: Marcelo Mendoza, Hans Löbel, Carlos Muñoz et al.
Year: 2025
Extracting key information from news articles is crucial for advancing search systems.Historically, the 5W1H framework, which organises information based on 'Who', 'What', 'When', 'Where', 'Why', and ...

15th Temporal Web Analytics Workshop (TempWeb) Overview

Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2025

Performance of single-agent and multi-agent language models in Spanish language medical competency exams

Authors: Marcelo Mendoza, Andrés Neyem, Fernando Altermatt et al.
Year: 2025Source: BMC Medical Education
Abstract Background Large language models (LLMs) like GPT-4o have shown promise in advancing medical decision-making and education. However, their performance in Spanish-language medical contexts rema...

Elucidating Type Conversions in SQL Engines

Authors: Matías Toro, Eric Tanter, Claudio Gutiérrez et al.
Year: 2025Source: Lecture notes in computer science
Abstract Practical SQL engines differ in subtle ways in their handling of typing constraints and implicit type casts. These issues, usually not considered in formal accounts of SQL, directly affect th...

The Role of Generative AI Use in 2024 Elections Worldwide

Authors: Sebastián Valenzuela, Philip N. Howard, Inga Kristina Trauthig
Year: 2025
A high-level précis of the Technical Paper can be found in the Summary for Policymakers report, Generative AI in Electoral Campaigns: Mapping Global Patterns. GenAI is being deployed in many ways dur...

Generative AI in Electoral Campaigns: Mapping Global Patterns

Authors: Sebastián Valenzuela, Philip N. Howard, Inga Kristina Trauthig
Year: 2025
This Summary for Policymakers provides a high-level précis of the Technical Paper, The Role of Generative AI Use in 2024 Elections Worldwide. GenAI is being deployed in many ways during elections, ra...

Correction: Cross-lingual hate speech detection using domain-specific word embeddings

Authors: Bárbara Poblete, Ayme Arango Monnar, Jorge Perez Rojas
Year: 2025Source: PLoS ONE
[This corrects the article DOI: 10.1371/journal.pone.0306521.].

Worst-Case-Optimal Joins on Graphs with Topological Relations

Authors: Aidan Hogan, Juan Reutter, Gonzalo Navarro et al.
Year: 2025
Spatial data play an important role in many applications built over knowledge graphs, and are frequently referenced in queries posed to public query services, such as that of Wikidata.Querying for spa...

Repetitiveness Measures Based on String Morphisms

Authors: Gonzalo Navarro, Cristian Urbina
Year: 2025Source: Theoretical Computer Science

Probabilistic Explanations for Linear Models

Authors: Marcelo Arenas, Bernardo Subercaseaux, Kuldeep S. Meel
Year: 2025Source: Proceedings of the AAAI Conference on Artificial Intelligence
Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is cent...

Complex event recognition under time constraints: towards a formal framework for efficient query evaluation

Authors: Cristian Riveros, Jaime García
Year: 2025Source: arXiv (Cornell University)
This work studies Complex Event Recognition (CER) under time constraints regarding its query language, computational models, and streaming evaluation algorithms. We start by introducing an extension o...

Advancing the Study of Political Misinformation Across Countries and Platforms—Introduction to the Special Issue

Authors: Sebastián Valenzuela, Edson C. Tandoc, Frank Esser et al.
Year: 2025Source: The International Journal of Press/Politics
The global spread of political misinformation poses serious challenges to democracies, eroding trust and distorting public discourse. However, research has largely focused on WEIRD countries—Western...

Bridging Inequality Gaps: Sustainable Journalism in the News Coverage of Education Policies

Authors: Magdalena Saldaña, Valentina Proust, Cristian Cabalín et al.
Year: 2025Source: Journalism Practice
By conducting a content analysis of 331 news stories, this study observed how six news organizations covered Chile's new school admission system (SAE) for enrolling in K-12 schools. To identify the pr...

Causality-Based Scores Alignment in Explainable Data Management

Authors: Felipe Azua, Leopoldo Bertossi
Year: 2025Source: arXiv (Cornell University)
Different attribution scores have been proposed to quantify the relevance of database tuples for query answering in databases; e.g. Causal Responsibility, the Shapley Value, the Banzhaf Power-Index, a...

HealthIUI: Workshop on Intelligent and Interactive Health User Interfaces

Authors: Denis Parra, Peter Brusilovsky, Shriti Raj et al.
Year: 2025

Logical Expressiveness of Graph Neural Networks on Knowledge Graphs

Authors: Pablo Barceló, Miguel Romero, İsmail İlkan Ceylan et al.
Year: 2025Source: Frontiers in artificial intelligence and applications
Graph neural networks are prominent models for representation learning over graph-structured data. While the capabilities and limitations of these models are well-understood for simple graphs, our und...

Dialogue on difference: Identity and political communication

Authors: Rachel Griffin, Teresa Y. Smith, Magdalena Saldaña et al.
Year: 2025Source: UNC Libraries
Identity is a crucial force in every facet of contemporary politics, but political communication research has too often addressed it only superficially, excluded it from the subfield’s primary f...

NLP modeling recommendations for restricted data availability in clinical settings

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena
Year: 2025Source: BMC Medical Informatics and Decision Making
Abstract Background Clinical decision-making in healthcare often relies on unstructured text data, which can be challenging to analyze using traditional methods. Natural Language Processing (NLP) has ...

A Shopping Agent for Addressing Subjective Product Needs

Authors: Bárbara Poblete, Preetam Prabhu Srikar Dammu, Omar Alonso
Year: 2025
In e-commerce, customers often struggle to find relevant items when their needs involve subjective properties characterized by personal or collective perception, tastes, and opinions, which are typica...

Dialogue on difference: Identity and political communication

Authors: Magdalena Saldaña, Sebastián Valenzuela, Khadijah Costley White et al.
Year: 2025Source: Communication Monographs

Constant-delay enumeration for SLP-compressed documents

Authors: Cristian Riveros, Martı́n Muñoz
Year: 2025Source: Logical Methods in Computer Science
We study the problem of enumerating results from a query over a compressed document. The model we use for compression are straight-line programs (SLPs), which are defined by a context-free grammar tha...

How Expressive are Knowledge Graph Foundation Models?

Authors: Pablo Barceló, Juan Reutter, Michael M. Bronstein et al.
Year: 2025Source: arXiv (Cornell University)
Knowledge Graph Foundation Models (KGFMs) are at the frontier for deep learning on knowledge graphs (KGs), as they can generalize to completely novel knowledge graphs with different relational vocabul...

A Comparison of Human and Machine Learning Errors in Face Recognition

Authors: Ricardo Baeza-Yates, Carlos Castillo, Marina Estévez-Almenzar
Year: 2025Source: arXiv (Cornell University)
Machine learning applications in high-stakes scenarios should always operate under human oversight. Developing an optimal combination of human and machine intelligence requires an understanding of the...

Patterns of Persistence: Studying News Repertoires Before, During, and After Covid-19

Authors: Sebastián Valenzuela, Ingrid Bachmann, Natalia Solís Valdés
Year: 2025Source: Journalism Studies
In the realm of news consumption, individuals often establish recurrent patterns, integrating diverse sources into distinct repertoires. However, these patterns can change during unprecedented events ...

Generalized straight-line programs

Authors: C. Urbina, Gonzalo Navarro, Francisco Javier Vidal Olivares
Year: 2025Source: Acta Informatica

Enhancing contact recommendation in social platforms through mental health awareness: Exploring Anorexia Nervosa as a case study

Authors: Ricardo Baeza-Yates, Diana Ramírez‐Cifuentes, Ana Freire et al.
Year: 2025Source: PLoS ONE
We analyze and propose a solution for the exposure of vulnerable users to harmful content during their interaction with contact recommender systems in social platforms. Our approach is dedicated to ma...

Preface to the special issue on “Artificial Intelligence‐driven Decision Making in Health and Medicine”

Authors: Leopoldo Bertossi, Herb Kunze, Davide La Torre et al.
Year: 2025Source: International Transactions in Operational Research

Digital Journalism (Studies): An Agenda for the Future

Authors: Magdalena Saldaña, Ramón Salaverría, Oscar Westlund et al.
Year: 2025Source: Digital Journalism
Digital Journalism has an important role to play in encouraging and publishing research with societal relevance that advances digital journalism studies as a field. In this article we discuss the mult...

The Causal-Effect Score in Data Management

Authors: Leopoldo Bertossi, Felipe Azua
Year: 2025Source: arXiv (Cornell University)
The Causal Effect (CE) is a numerical measure of causal influence of variables on observed results. Despite being widely used in many areas, only preliminary attempts have been made to use CE as an at...

Developing and Validating an Automatic Support System for Tumor Coding in Pathology Reports in Spanish

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2025Source: JCO Clinical Cancer Informatics
These results demonstrate the feasibility of implementing natural language processing tools in the routine of a cancer center to extract and code valuable information from pathology reports. Our recom...

Towards A Global AI Auditing Framework: Assessment and Recommendations

Authors: Marcelo Mendoza, Sebastián Valenzuela, Janaki Srinivasan et al.
Year: 2025
A high-level précis of the Synthesis Report can be found in the Summary for Policymakers Recommendations for a Global AI Auditing Framework: Summary of Standards and Features. The growing integration...

Towards Computer-Using Personal Agents

Authors: Aidan Hogan, Katja Hose, Olaf Hartig et al.
Year: 2025Source: arXiv (Cornell University)
Computer-Using Agents (CUA) enable users to automate increasingly-complex tasks using graphical interfaces such as browsers. As many potential tasks require personal data, we propose Computer-Using Pe...

A Comunication Framework for Compositional Generation

Authors: Denis Parra, Rafael Elberg, Mircea Petrache
Year: 2025Source: arXiv (Cornell University)
Compositionality and compositional generalization--the ability to understand novel combinations of known concepts--are central characteristics of human language and are hypothesized to be essential fo...

Semantic Web and Creative AI -- A Technical Report from ISWS 2023

Authors: Frank van Harmelen, Anna Sofia Lippolis, John Domingue et al.
Year: 2025Source: arXiv (Cornell University)
The International Semantic Web Research School (ISWS) is a week-long intensive program designed to immerse participants in the field. This document reports a collaborative effort performed by ten team...

A frustratingly easy way of extracting political networks from text

Authors: Naim Bro
Year: 2025Source: PLoS ONE
This study demonstrates the use of GPT-4 and variants, advanced language models readily accessible to many social scientists, in extracting political networks from text. This approach showcases the no...

Novel SIMEX algorithm for autoregressive models to estimate AGN variability

Authors: Felipe Elorrieta, E. Camacho, S. Eyheramendy et al.
Year: 2025Source: arXiv (Cornell University)
The origin of the variability in accretion disks of active galactic nuclei (AGN) is still unknown, but its behavior can be characterized by modeling the time series of optical wavelength fluxes coming...

Ehrenfeucht-Haussler Rank and Chain of Thought

Authors: Pablo Barceló, Tomasz Steifer, Alexander Kozachinskiy
Year: 2025Source: arXiv (Cornell University)
The notion of rank of a Boolean function has been a cornerstone in the theory of PAC learning, enabling quasipolynomial-time learning algorithms for polynomial-size decision trees. We present a novel ...

FairXAI -A Taxonomy and Framework for Fairness and Explainability Synergy in Machine Learning

Authors: Ricardo Baeza-Yates, Fredrik Heintz, Resmi Ramachandranpillai et al.
Year: 2025Source: IEEE Transactions on Neural Networks and Learning Systems
Explainable artificial intelligence (XAI) and fair learning have made significant strides in various application domains, including criminal recidivism predictions, healthcare settings, toxic comment ...

B-Call: Integrating Ideological Position and Political Cohesion in Legislative Voting Models

Authors: Sergio Toro, Daniel Alcatruz, M. Aníbal Valenzuela et al.
Year: 2025Source: arXiv (Cornell University)
This paper combines two significant areas of political science research: measuring individual ideological position and cohesion. Although both approaches help analyze legislative behaviors, no unified...

The Missing Link: Identifying Digital Intermediaries in E-Government

Authors: Sergio Toro, Sebastián Valenzuela, Teresa Correa et al.
Year: 2025Source: arXiv (Cornell University)
The digitalization of public administration has advanced significantly on a global scale. Many governments now view digital platforms as essential for improving the delivery of public services and fos...

Unsupervised Framing Analysis for Social Media Discourse in Polarizing Events

Authors: Hernán Sarmiento, Felipe Bravo-Márquez, Sebastián Valenzuela et al.
Year: 2025Source: ACM Transactions on the Web
This study investigates the concept of frames in the realm of online polarization, with a focus on social media platforms. The research extends the understanding of how frames–emerging, complex, and...

Mid-Career Reflections: Climbing the Academic Ladder Without a Safety Net

Authors: Marcelo Arenas
Year: 2025Source: ACM SIGMOD Record
As I supposed everyone else did when asked by Tamer to write an article with Advice to Mid-Career Researchers, I read all the previous articles in this series to understand what this paper should be a...

Explaining k-Nearest Neighbors: Abductive and Counterfactual Explanations

Authors: Pablo Barceló, Miguel Romero Orth, Alexander Kozachinskiy et al.
Year: 2025Source: arXiv (Cornell University)
Despite the wide use of $k$-Nearest Neighbors as classification models, their explainability properties remain poorly understood from a theoretical perspective. While nearest neighbors classifiers off...

Curcumin Improves Hippocampal Cell Bioenergetics, Redox and Inflammatory Markers, and Synaptic Proteins, Regulating Mitochondrial Calcium Homeostasis

Authors: Sebastián Valenzuela, Alfonso González, Cláudio Retamal et al.
Year: 2025Source: Neurotoxicity Research

All Your Base Are Belong to Us: Sort Polymorphism for Proof Assistants

Authors: Gaëtan Gilbert, Josselin Poiret, Éric Tanter et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Proof assistants based on dependent type theory, such as Coq, Lean and Agda, use different universes to classify types, typically combining a predicative hierarchy of universes for computationally-rel...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2025Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

When is the Computation of a Feature Attribution Method Tractable?

Authors: Pablo Barceló, Micaela Morgado, Roberto Cominetti
Year: 2025Source: arXiv (Cornell University)
Feature attribution methods have become essential for explaining machine learning models. Many popular approaches, such as SHAP and Banzhaf values, are grounded in power indices from cooperative game ...

TOI-4504: Exceptionally Large Transit Timing Variations Induced by Two Resonant Warm Gas Giants in a Three-planet System

Authors: Susana Eyheramendy, Andrés Jordán, Néstor Espinoza et al.
Year: 2025Source: The Astrophysical Journal Letters
Abstract We present a joint analysis of transit timing variations (TTVs) and Doppler data for the transiting exoplanet system TOI-4504. TOI-4504 c is a warm Jupiter-mass planet that exhibits the large...

14 Kg of CO2: Analyzing the Carbon Footprint and Performance of Session-Based Recommendation Algorithms

Authors: Jessie Gil, Alejandro Plaza, Denis Parra
Year: 2025Source: Communications in computer and information science

Correction to: The Semantic Web – ISWC 2024

Authors: Aidan Hogan, Daniel Hernández, Katja Hose et al.
Year: 2025Source: Lecture notes in computer science

Top-k Document Retrieval in Compressed Space

Authors: Gonzalo Navarro, Yakov Nekrich
Year: 2025Source: Society for Industrial and Applied Mathematics eBooks
Let 𝓓 be a collection of D strings of total length n over an alphabet of size σ. We consider the so-called top-k document retrieval problem: given a short string P and an integer k, list the ident...

A Theoretical Bound which Improves the Performance of Compilation-Based Multi-Agent Path Finding

Authors: Jorge Baier, Roberto Asín‐Achá, Rodrigo López
Year: 2025Source: IEEE Access

Benchmarking zero-shot biomedical relation triplet extraction across language model architectures

Authors: Marcelo Mendoza, Frederik Steensgaard Gade, Ole Lund
Year: 2025

Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection

Authors: Ricardo Baeza-Yates, Mykola Trokhymovych, Diego Sáez Trumper et al.
Year: 2025

In-Memory Object Graph Stores

Authors: Benjamin A. Steer, Minh-Duc Pham, Josep Lluís Larriba Pey et al.
Year: 2025Source: arXiv (Cornell University)
We present a design and implementation of an in-memory object graph store, dubbed εStore. Our key innovation is a storage model - epsilon store - that equates an object on the heap to a node in a gra...

Artificial Intelligence Enhanced Colposcopy Supports Early Detection of High Grade Cervical Intraepithelial Neoplasia in HPV Positive Individuals

Authors: Claudio Gutiérrez, Andrea Weitoschova, Sandhya Yerra et al.
Year: 2025Source: International Journal of Research Studies in Microbiology and Biotechnology

Hybrid framework for automated generation of mammography radiology reports

Authors: Denis Parra, Eduardo Godoy, Rodrigo Salas et al.
Year: 2025Source: Computational and Structural Biotechnology Journal
Breast cancer remains a significant health concern for women at various stages of life, impacting both productivity and reproductive health. Recent advancements in deep learning (DL) have enabled subs...

Complexity of Consistent Query Answering in Databases under Cardinality-Based and Incremental Repair Semantics (extended version)

Authors: Leopoldo Bertossi, Andrei Lopatenko
Year: 2025Source: arXiv (Cornell University)
A database D may be inconsistent wrt a given set IC of integrity constraints. Consistent Query Answering (CQA) is the problem of computing from D the answers to a query that are consistent wrt IC . Co...

Screening Dyslexia Using Visual Auditory Computer Games and Machine Learning

Authors: Ricardo Baeza-Yates, Luz Rello, Maria Rauschenberger et al.
Year: 2025Source: IEEE Access
Reading acquisition is one the main keys for school success and a crucial component for empowering individuals to participate meaningfully in society. Yet, it is still a challenging skill to acquire f...

Advancing AI Incidents Classification: Leveraging LLMs with Strategic Prompting

Authors: Ricardo Baeza-Yates, Yian Chen, Lana Do et al.
Year: 2025Source: Communications in computer and information science

A Framework for Extraction and Transformation of Documents

Authors: Cristian Riveros, Nicole Schweikardt, Markus L. Schmid
Year: 2025Source: arXiv (Cornell University)
We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the se...

Adapting Bias Evaluation to Domain Contexts using Generative Models

Authors: Valentin Barrière, Tamara Quiroga, Felipe Bravo-Márquez
Year: 2025

Optimizing the Performance of the FM-Index for Large-Scale Data

Authors: Dustin Cobas, Gonzalo Navarro, Travis Gagie
Year: 2025Source: arXiv (Cornell University)
The FM-index is a fundamental data structure used in bioinformatics to efficiently search for strings and index genomes. However, the FM-index can pose computational challenges, particularly in the co...

A TWO-STAGE STOCHASTIC PROGRAMMING MODEL FOR THE MID-TERM OIL REFINERY PLANNING UNDER UNCERTAIN DEMAND

Authors: Juan Pablo Luna, Virgílio José Martins Ferreira Filho, Leonardo Nascimento
Year: 2025Source: Anais do Simpósio Brasileiro de Pesquisa Operacional

Output Bounds for Conjunctions of Path Queries

Authors: Juan Reutter, Domagoj Vrgoč, Tamara Cucumides
Year: 2025Source: SSRN Electronic Journal

IALab UC at BEA 2025 Shared Task: LLM-Powered Expert Pedagogical Feature Extraction

Authors: Jorge Baier, Sofía Correa Busquets, Valentina Córdova Véliz
Year: 2025

Database Theory in Action: Cypher, GQL, and Regular Path Queries

Authors: Domagoj Vrgoč, Oskar van Rest, Stefan Plantikow et al.
Year: 2025Source: arXiv (Cornell University)
Cypher has so far been the most commonly used query language for property graphs, and served as the foundation of the recently standardized graph query language GQL. In designing the features of GQL, ...

Assessment of the acceptability, proximate properties, and product cost of amylase-enhanced mixed cassava and sweet potato syrup

Authors: Marcelo Mendoza
Year: 2025Source: Pantao, international journal of the humanities and social sciences
The variety of goods obtained from root crops, particularly cassava and sweet potatoes, is getting low, thereby affecting their sustainability. The researcher has produced a syrup by combining cassava...

Dynamic Direct Access of MSO Query Evaluation over Strings

Authors: Cristian Riveros, Pierre Bourhis, Stefan Mengel et al.
Year: 2025Source: arXiv (Cornell University)
We study the problem of evaluating a Monadic Second Order (MSO) query over strings under updates in the setting of direct access. We present an algorithm that, given an MSO query with first-order free...

Modeling and Comparative Scenario - Based Simulation of SmartBottle+: An Artificial Intelligence (AI) - Powered Recycling Reward System versus Hungary’s Conventional Reverse Vending Machines (RVMs)

Authors: Marcelo Mendoza, Mohamed Ammar Ahmed
Year: 2025Source: Procedia Computer Science
The growing need for sustainable waste management calls for more efficient and accessible recycling systems. This paper introduces SmartBottle+, an artificial intelligence (AI) powered recycling rewar...

Publications for 2024

Displaying 166 publication(s) for 2024

Probabilistic Explanations for Linear Models

Authors: Marcelo Arenas, Bernardo Subercaseaux, Kuldeep S. Meel
Year: 2024Source: arXiv (Cornell University)
Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is cent...

Clinical analogy resolution performance for foundation language models

Authors: Jocelyn Dunstan, Fabián Villena, Tamara Quiroga
Year: 2024Source: ACM Transactions on Computing for Healthcare
Using extensive data sources to create foundation language models has revolutionized the performance of deep learning-based architectures. This remarkable improvement has led to state-of-the-art resul...

Competing Frames and Melodrama: The Effects of Facebook Posts on Policy Preferences about COVID-19

Authors: Sebastián Valenzuela, Ingrid Bachmann, Daniel Halpern et al.
Year: 2024Source: Routledge eBooks
The tension between health and economic considerations regarding COVID-19 has resulted in a framing contest, in which proponents and adversaries of strong containment measures hold oppositional frames...

Glycemic Control With Layperson-Delivered Telephone Calls vs Usual Care for Patients With Diabetes

Authors: Sebastián Valenzuela, Mathew Sither, Rhonda Aubrey et al.
Year: 2024Source: JAMA Network Open
Importance Diabetes is associated with emotional distress and poor mental health, especially for individuals with low income, hindering patients’ ability to manage their condition. The health care s...

Optimization of Bias Mitigation in Word Embeddings: a Methodological Approach

Authors: Felipe Bravo-Márquez, Mayteé Zambrano
Year: 2024

A Comparative Analysis of Offensive Discourse in the 2021 Chilean Presidential Campaign on Twitter and WhatsApp

Authors: Hernán Sarmiento, Felipe Bravo-Márquez, Sebastián Valenzuela et al.
Year: 2024

TOI-4504: Exceptionally large Transit Timing Variations induced by two resonant warm gas giants in a three planet system

Authors: Trifon Trifonov, M. Skarka, Néstor Espinoza et al.
Year: 2024Source: arXiv (Cornell University)
We present a joint analysis of TTVs and Doppler data for the transiting exoplanet system TOI-4504. TOI-4504 c is a warm Jupiter-mass planet that exhibits the largest known transit timing variations (T...

Bias in Retrieval Systems

Authors: Ricardo Baeza-Yates, Shiran Dudy, Leena Murgai
Year: 2024Source: Information Retrieval

Gradual C0: Symbolic Execution for Gradual Verification

Authors: Eric Tanter, Joshua Sunshine, Jonathan Aldrich et al.
Year: 2024Source: ACM Transactions on Programming Languages and Systems
Current static verification techniques such as separation logic support a wide range of programs. However, such techniques only support complete and detailed specifications, which places an undue burd...

Your house won’t be yours anymore!” Effects of Misinformation, News Use, and Media Trust on Chile’s Constitutional Referendum

Authors: Magdalena Saldaña, Sebastián Rivera, Ximena Orchard et al.
Year: 2024Source: The International Journal of Press/Politics
News consumption and voting behavior are interlinked and particularly important in elections where traditional political cleavages are not easily applicable. This relationship becomes more complex and...

Multi-label learning on low label density sets with few examples

Authors: Benjamín Bustos, Ivan Sipiran, Tobias Schreck et al.
Year: 2024Source: Expert Systems with Applications

ML-Based Classification of Hamstring Strain Injury from Nonlinear Features of Surface Electromyography Signals

Authors: Marcelo Mendoza, Ma. Belinda C. Fidel, Gian Angelo A. Calumpang et al.
Year: 2024Source: TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)

Recommendations for a Global AI Auditing Framework: Summary of Standards and Features

Authors: Marcelo Mendoza, Sebastián Valenzuela, Janaki Srinivasan et al.
Year: 2024
This Summary for Policymakers provides a high-level précis of the Synthesis Report Towards A Global AI Auditing Framework: Assessment and Recommendations. The growing integration of artificial intell...

Report on the 14th Workshop on Temporal Web Analytics (TempWeb 2024) at WWW 2024

Authors: Ricardo Baeza-Yates, Marc Spaniol, Ómar Alonso
Year: 2024Source: ACM SIGIR Forum
The TempWeb workshop (series) is an established co-located event at The Web Conference that aims at bringing together researchers and practitioners across various domains, taking the constantly evolvi...

PD155 RedETS Horizon Scanning: Impact In The Decision-Making Process

Authors: Gonzalo Navarro, Roland Pastells‐Peiró, Maria-Dolors Estrada et al.
Year: 2024Source: International Journal of Technology Assessment in Health Care
Introduction The RedETS horizon scanning (HS) program in Spain is focused on identifying non-pharmaceutical emerging health technologies. HS is organized in three steps: (i) identification using diffe...

Public adherence to the principles of criminal law in Chile: Shaping factors and consequences for trust in the criminal justice system

Authors: Magdalena Saldaña, Rodrigo González-Fuente, Omar A. Barriga et al.
Year: 2024Source: Política criminal

Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning

Authors: Denis Parra, Cristián Tejos, Sergio Uribe et al.
Year: 2024Source: Proceedings on CD-ROM - International Society for Magnetic Resonance in Medicine. Scientific Meeting and Exhibition/Proceedings of the International Society for Magnetic Resonance in Medicine, Scientific Meeting and Exhibition
Motivation: The standard approach for plane reformatting in 4D flow MRI is manual, leading to time-consuming and user-dependent results. Goal(s): Our goal was to enhance plane reformatting in 4D flow ...

Evaluating regular path queries on compressed adjacency matrices

Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón et al.
Year: 2024Source: The VLDB Journal

PathFinder: Returning Paths in Graph Queries

Authors: Domagoj Vrgoč, Carlos Rojas, Wim Martens et al.
Year: 2024Source: Lecture notes in computer science

Restructuring Tractable Probabilistic Circuits

Authors: Marcelo Arenas, Guy Van den Broeck, Honghua Zhang et al.
Year: 2024Source: arXiv (Cornell University)
Probabilistic circuits (PCs) are a unifying representation for probabilistic models that support tractable inference. Numerous applications of PCs like controllable text generation depend on the abili...

Influence of regional anesthesia on fall risk in adults over 60 years

Authors: Aidan Hogan, Ursula Trinler, Paul Alfred Grützner et al.
Year: 2024Source: Clinical Biomechanics

Human-AI Coevolution

Authors: Paul Lukowicz, Frank Dignum, Albert‐László Barabási et al.
Year: 2024Source: Artificial Intelligence
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence a...

Static Slicing for Probabilistic Programs: An Overview

Authors: Federico Olmedo
Year: 2024Source: Lecture notes in computer science

Reducing Interpretative Ambiguity in an educational environment with ChatGPT.

Authors: Marcelo Mendoza, Miguél Nussbaum, Zvi Bekerman et al.
Year: 2024Source: Computers & Education

Enhancing commit message quality in software capstone projects with generative AI

Authors: Marcelo Mendoza, Andrés Neyem, Juan Pablo Sandoval Alcocer et al.
Year: 2024Source: SoftwareX
Software Capstone Projects provide valuable hands-on experience for students in software development, and creating effective commit messages is an essential, though often challenging, part of this pro...

Towards Tractability of the Diversity of Query Answers: Ultrametrics to the Rescue

Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2024Source: Proceedings of the ACM on Management of Data
The set of answers to a query may be very large, potentially overwhelming users when presented with the entire set. In such cases, presenting only a small subset of the answers to the user may be pref...

Complex Event Recognition meets Hierarchical Conjunctive Queries

Authors: Cristian Riveros, Dante Pinto
Year: 2024Source: Proceedings of the ACM on Management of Data
Hierarchical conjunctive queries (HCQ) are a subclass of conjunctive queries (CQ) with robust algorithmic properties. Among others, Berkholz, Keppeler, and Schweikardt have shown that HCQ is the subcl...

Trends in the Global Information Environment: 2024 Expert Survey Results

Authors: Sebastián Valenzuela, Philip N. Howard, Sacha Altay
Year: 2024
The global information environment is under great pressure. How do experts around the world perceive the features of and threats to the information environment in the countries they study? In June 202...

Disjointed Polarization in Chile’s Enduring Crisis of Representation – ERRATUM

Authors: Juan Pablo Luna
Year: 2024Source: Latin American Politics and Society
An abstract is not available for this content. As you have access to this content, full HTML content is provided on this page. A PDF of this content is also available in through the 'Save PDF' action ...

BWBEV: A Bitwise Query Processing Algorithm for Approximate Prefix Search

Authors: Ricardo Baeza-Yates, Edleno Silva de Moura, Berg Ferreira et al.
Year: 2024Source: Journal of the Brazilian Computer Society
We tackle the challenge of conducting an approximate prefix search within datasets of strings. We explore using a bit-parallelism technique to compute the edit distance between distinct strings and il...

A Uniform Language to Explain Decision Trees

Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2024
The formal XAI community has studied a plethora of interpretability queries aiming to understand the classifications made by decision trees. However, a more uniform understanding of what questions we ...

Computing MEMs and Relatives on Repetitive Text Collections

Authors: Gonzalo Navarro
Year: 2024Source: ACM Transactions on Algorithms
We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern \(P[1\mathinner{.. }m]\) on a large repetitive text collection \(T[1\mathinner{.. }n]\) over an alphabet of siz...

Streaming enumeration on nested documents

Authors: Cristian Riveros, Martín Muñoz
Year: 2024Source: ACM Transactions on Database Systems
Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especi...

Survey of data stories: Guidelines for data story authoring

Authors: Denis Parra, Manuela Garretón, Daniela Moyano et al.
Year: 2024Source: Information Visualization
Data stories are sequences of data facts connected through a meaningful narrative and combine data visualizations and storytelling to convey information effectively. They have gained popularity due to...

Correction: AI content detection in the emerging information ecosystem: new obligations for media and tech companies

Authors: Ricardo Baeza-Yates, David Eyers, Susan Leavy et al.
Year: 2024Source: Ethics and Information Technology

The Distributional Uncertainty of the SHAP Score in Explainable Machine Learning

Authors: Leopoldo Bertossi, Miguel Romero, Nina Pardal et al.
Year: 2024Source: Frontiers in artificial intelligence and applications
Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an...

ARCHIE: Articulated Robot for Collaborative Highly Integrated Education

Authors: Marcelo Mendoza, E. E. Mendoza, Juan Saeteros et al.
Year: 2024

Merging Gradual Typing

Authors: Matías Toro, Wenjia Ye, Bruno C. d. S. Oliveira
Year: 2024Source: Proceedings of the ACM on Programming Languages
Programming language mechanisms with a type-directed semantics are nowadays common and widely used. Such mechanisms include gradual typing, type classes, implicits and intersection types with a merge ...

Dynamic direct access of MSO query evaluation over strings

Authors: Cristian Riveros, Pierre Bourhis, Stefan Mengel et al.
Year: 2024Source: arXiv (Cornell University)
We study the problem of evaluating a Monadic Second Order (MSO) query over strings under updates in the setting of direct access. We present an algorithm that, given an MSO query with first-order free...

Dynamic Direct Access of MSO Query Evaluation over Strings

Authors: Cristian Riveros, Pierre Bourhis, Stefan Mengel et al.
Year: 2024Source: arXiv (Cornell University)
We study the problem of evaluating a Monadic Second Order (MSO) query over strings under updates in the setting of direct access. We present an algorithm that, given an MSO query with first-order free...

Fast and Small Subsampled R-indexes

Authors: Gonzalo Navarro, Travis Gagie, Dustin Cobas
Year: 2024Source: arXiv (Cornell University)
The $r$-index represented a breakthrough in compressed indexing of repetitive text collections, outperforming its alternatives by orders of magnitude in query time. Its space usage, $O(r)$ where $r$ i...

AI content detection in the emerging information ecosystem: new obligations for media and tech companies

Authors: Ricardo Baeza-Yates, David Eyers, Susan Leavy et al.
Year: 2024Source: Ethics and Information Technology
The world is about to be swamped by an unprecedented wave of AI-generated content. We need reliable ways of identifying such content, to supplement the many existing social institutions that enable tr...

Sense through time: diachronic word sense annotations for word sense induction and Lexical Semantic Change Detection

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg et al.
Year: 2024Source: Language Resources and Evaluation
Abstract There has been extensive work on human word sense annotation, i.e., manually labeling word uses in natural texts according to their senses. Such labels were primarily created for the tasks of...

Adaptive Dynamic Bitvectors

Authors: Gonzalo Navarro
Year: 2024Source: Lecture notes in computer science

Compressed Graph Representations for Evaluating Regular Path Queries

Authors: Gonzalo Navarro, J Robert
Year: 2024Source: Lecture notes in computer science

Political participation and technology

Authors: Sebastián Valenzuela, Marcelo Santos
Year: 2024Source: Routledge eBooks

Repetitive Patterns Recognition in Textures of Ancient Peruvian Pottery

Authors: Benjamín Bustos, Ivan Sipiran, Sebastian Sepulveda
Year: 2024Source: Journal on Computing and Cultural Heritage
We present a study and comparison of computer vision methods for the task of finding repetitive motifs in ancient Peruvian pottery. Under this context, the main difficulties for solving the task are t...

Expert Survey on the Global Information Environment 2024: Searching for Solutions

Authors: Sebastián Valenzuela, Philip N. Howard, Sacha Altay
Year: 2024
The global information environment is under significant pressure from the development of new technologies and shifting public policies. How do experts around the world perceive the varied features of,...

Responsible AI Day

Authors: Ricardo Baeza-Yates, Nataly Buslón
Year: 2024Source: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
We summarize the goals of the Responsible AI day, giving a glimpse on the program as well as a short biography of the organizers.

Automatic knowledge-graph creation from historical documents: The Chilean dictatorship as a case study

Authors: Antonia Fonck, Camila Díaz, Alejandro Grez et al.
Year: 2024Source: arXiv (Cornell University)
We present our results regarding the automatic construction of a knowledge graph from historical documents related to the Chilean dictatorship period (1973-1990). Our approach consists on using LLMs t...

Microbiota-dependent T-cell response to α-synuclein-derived antigens triggers the development of hypersensitivity and neuroinflammation associated with Parkinson's Disease

Authors: Sebastián Valenzuela, M Ricca, Valentina Ugalde et al.
Year: 2024Source: Research Square (Research Square)
<title>Abstract</title> <bold>Background</bold>. Previous evidence has shown that both the T-cell response and the microbiota play fundamental roles on the development of Parkinson's Disease (PD), whi...

Gradual Indexed Inductive Types

Authors: Eric Tanter, Kenji Maillard, Nicolas Tabareau et al.
Year: 2024Source: Proceedings of the ACM on Programming Languages
Indexed inductive types are essential in dependently-typed programming languages, enabling precise and expressive specifications of data structures and properties. Recognizing that programming and pro...

Self-Supervised Learning Applied to Variable Star Semi-Supervised Classification Using LSTM and GRU Networks

Authors: Hans Löbel, R. B. Merino, Billy Peralta et al.
Year: 2024
Recognizing variable stars is a task of interest in the astronomy community. Currently, this task has taken advantage of deep learning algorithms. However, these algorithms require a large amount of d...

Movelet Trees

Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2024Source: arXiv (Cornell University)
We combine Nishimoto and Tabei's move structure with a wavelet tree to show how, if $T [1..n]$ is over a constant-sized alphabet and its Burrows-Wheeler Transform (BWT) consists of $r$ runs, then we c...

WIP: Evaluation of the Third Design Cycle of the Wellbeing Teaching Assistant (WTA): Understanding What Type of Cases are Served Through a Categorization Analysis

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
Abstract Well-being is increasingly recognized as a key element to foster within higher education. In this context, our institution—a prominent engineering school in Latin America—has created the ...

Unpacking Student Workload through Elicitation Techniques: Perspectives from Engineering Faculty and Students

Authors: Jorge Baier, Isabel Hilliger, Erick Svec et al.
Year: 2024
Abstract This is a work-in-progress about student workload. Over the past two decades, practitioners and researchers have shown concern for student workload within engineering programs. Since the late...

WIP: Exploring the Effects of a Purpose-in-Life Reflection Activity in an Introductory Artificial Intelligence Course

Authors: Jorge Baier, Trini Balart, Kristi Shryock et al.
Year: 2024
Abstract Sense of purpose in life is related to actively choosing to work for the benefit of society and has been recognized as a key influencer of well-being which in turn has been established to be ...

WIP: Traditional Engineering Assessments Challenged by ChatGPT: An Evaluation of its Performance on a Fundamental Competencies Exam

Authors: Jorge Baier, Trini Balart, Martín Castillo
Year: 2024
Abstract ChatGPT, a chatbot which produces text with remarkable coherence, is leading higher education institutions to question the relevance of the current model of engineering education and, particu...

Complex event recognition meets hierarchical conjunctive queries

Authors: Cristian Riveros, Dante Pinto
Year: 2024Source: arXiv (Cornell University)
Hierarchical conjunctive queries (HCQ) are a subclass of conjunctive queries (CQ) with robust algorithmic properties. Among others, Berkholz, Keppeler, and Schweikardt have shown that HCQ is the subcl...

Towards Tractability of the Diversity of Query Answers: Ultrametrics to the Rescue

Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2024Source: arXiv (Cornell University)
The set of answers to a query may be very large, potentially overwhelming users when presented with the entire set. In such cases, presenting only a small subset of the answers to the user may be pref...

Global Approaches to Auditing Artificial Intelligence: A Literature Review

Authors: Marcelo Mendoza, Sebastián Valenzuela, Janaki Srinivasan et al.
Year: 2024
This Synthesis Report is a literature review outlining the regulatory, industry, and academic approaches to AI audits. We review 78 articles published in peer-reviewed journals and as preprints, 21 do...

Telar and TelarKG: Data-Driven Insights into Chile’s Constitutional Process

Authors: Aidan Hogan, Juan Reutter, Sergio Toro et al.
Year: 2024Source: Communications of the ACM

Response to Kempf et al on Methodological and Practical Aspects of a Distant Metastasis Detection Model

Authors: Jocelyn Dunstan, Pablo Báez, Ricardo Ahumada et al.
Year: 2024Source: JCO Clinical Cancer Informatics

Human-AI Coevolution (Abstract Reprint)

Authors: Ricardo Baeza-Yates, Dino Pedreschi, Alistair Knott et al.
Year: 2024
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence a...

“The more official, the less I believe”: Using focus groups to explore public opinion formation in politically polarized contexts

Authors: Magdalena Saldaña, Andrés Scherman, Cristian Cabalín et al.
Year: 2024Source: Social Science Quarterly
Abstract Introduction Public opinion studies have traditionally relied on survey analyses. However, a qualitative approach is needed to address opinion formation's multidimensional and contextual natu...

New Compressed Indices for Multijoins on Graph Databases

Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón et al.
Year: 2024Source: arXiv (Cornell University)
A recent surprising result in the implementation of worst-case-optimal (wco) multijoins in graph databases (specifically, basic graph patterns) is that they can be supported on graph representations t...

Gradual Differentially Private Programming

Authors: Matías Toro, Eric Tanter, Federico Olmedo et al.
Year: 2024Source: Communications of the ACM

Welcome

Authors: Bárbara Poblete, Fábio Kon, Sebastián Uchitel
Year: 2024Source: Communications of the ACM
T I S W I T H great pleasure that we introduce the second edition of the Communications of the ACM Latin American Regional Special Section.In this edition, we showcase some of the region's most intere...

A pseudonymized corpus of occupational health narratives for clinical entity recognition in Spanish

Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2024Source: BMC Medical Informatics and Decision Making
Despite the high creation cost, annotated corpora are indispensable for robust natural language processing systems. In the clinical field, in addition to annotating medical entities, corpus creators m...

Dynamic compact data structure for temporal reachability with unsorted contact insertions

Authors: Gonzalo Navarro, Bruno Augusto Nassif Travençolo, Marcelo Keese Albertini et al.
Year: 2024Source: The Computer Journal
Abstract Temporal graphs represent interactions between entities over time. Deciding whether entities can reach each other through temporal paths is useful for various applications such as in communic...

LB1040 Machine learning-based predictive model with routine blood work identifies moderate-severe alopecia areata

Authors: Marcelo Mendoza, Tarun Sharma, Ross O’Hagan et al.
Year: 2024Source: Journal of Investigative Dermatology

Tackling Challenges in Implementing Large-Scale Graph Databases

Authors: Aidan Hogan, Juan Reutter, Domagoj Vrgoč et al.
Year: 2024Source: Communications of the ACM

The economics of ethnic marriages: Endogamy and the social status of minority groups

Authors: Naim Bro, Liran Morav
Year: 2024Source: British Journal of Sociology
Abstract This study examines the relationship between ethnic endogamy and socioeconomic status (SES) within the socioeconomically divergent Jewish and Native‐Chilean Mapuche communities of Santiago,...

Joint models reveal genetic architecture of pubertal stage transitions and their association with BMI in admixed Chilean population

Authors: Susana Eyheramendy, Lucas Vicuña, Verónica Mericq et al.
Year: 2024Source: Human Molecular Genetics
Early or late pubertal onset can lead to disease in adulthood, including cancer, obesity, type 2 diabetes, metabolic disorders, bone fractures, and psychopathologies. Thus, knowing the age at which pu...

Path-based Algebraic Foundations of Graph Query Languages

Authors: Renzo Angles, Domagoj Vrgoč, Angela Bonifati et al.
Year: 2024Source: arXiv (Cornell University)
Graph databases are gaining momentum thanks to the flexibility and expressiveness of their data model and query languages. A standardization activity driven by the ISO/IEC standardization body is also...

Cuando los algoritmos son editores: Cómo las redes sociales, la IA y la desinformación alteran el consumo de noticias

Authors: Sebastián Valenzuela
Year: 2024Source: Comunicación y Medios
Esta es una versión editada de la charla magistral del autor en la inauguración de la Conferencia Académica por el Día Mundial de la Libertad de Prensa de UNESCO 2024 y que organizaron la Universi...

Entity normalization in a Spanish medical corpus using a UMLS-based lexicon: findings and limitations

Authors: Jocelyn Dunstan, Pablo Báez, Fredy Núñez Torres et al.
Year: 2024Source: Language Resources and Evaluation

Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation

Authors: René Víctor Valqui Vidal, Álvaro Soto, Denis Parra et al.
Year: 2024Source: arXiv (Cornell University)
Advancing representation learning in specialized fields like medicine remains challenging due to the scarcity of expert annotations for text and images. To tackle this issue, we present a novel two-st...

Taxonomic classification with maximal exact matches in KATKA kernels and minimizer digests.

Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2024Source: PubMed
For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which ...

Adversarial Pairwise Multimodal Recommendation

Authors: Denis Parra, Ricardo Ñanculef, Mario Mallea
Year: 2024Source: 2022 International Joint Conference on Neural Networks (IJCNN)

Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning

Authors: Denis Parra, Rafael Elberg, Mircea Petrache
Year: 2024
Image and multimodal machine learning tasks are very challenging to solve in the case of poorly distributed data. In particular, data availability and privacy restrictions exacerbate these hurdles in ...

ERDoc: A Web Interface for Entity-Relation Modelling

Authors: Aidan Hogan, Sebastián Ferrada, Matias Lopez
Year: 2024

The Generalized Causal-Effect Score in Data Management (short paper)

Authors: Leopoldo Bertossi, Felipe Azua
Year: 2024

TelarKG: a Knowledge Graph of Chile's Constitutional Process

Authors: Aidan Hogan, Juan Reutter, Renzo Angles et al.
Year: 2024
In this paper we present TelarKG, a knowledge graph (KG) that consolidates multiple sources of information regarding the Chilean Constitutional process, particularly about the work of the members of t...

Physics-informed neural networks for parameter estimation in blood flow models

Authors: Jocelyn Dunstan, Sergio Uribe, Jeremías Garay et al.
Year: 2024Source: Computers in Biology and Medicine

Space & Time Efficient Leapfrog Triejoin

Authors: Domagoj Vrgoč, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2024
Leapfrog Triejoin (LTJ) is arguably the most practical and popular worst-case-optimal (wco) algorithm for solving basic graph patterns in graph databases. Its main drawback is that it needs the databa...

Gender Representation Across Online Retail Products

Authors: Bárbara Poblete, Dana Pessach
Year: 2024Source: 2022 ACM Conference on Fairness, Accountability, and Transparency
We present a broad characterization of gender representation in a large heterogeneous sample of retail products. In particular, we study online product textual information, such as titles and descript...

A New Upper Bound for the Makespan of Cost-Optimal Solutions for Multi-Agent Path Finding (Extended Abstract)

Authors: Jorge Baier, Rodrigo López, Roberto Asín‐Achá
Year: 2024Source: Proceedings of the International Symposium on Combinatorial Search
A well-known approach to solving Multi-Agent Path Finding (MAPF) optimally is compilation to Boolean Satisfiability or Answer Set Programming (ASP). Such compilation-based approaches are superior to o...

Finding a Small, Diverse Subset of the Pareto Solution Set in Bi-Objective Search (Extended Abstract)

Authors: Jorge Baier, Nicolás Rivera, Pablo Araneda et al.
Year: 2024Source: Proceedings of the International Symposium on Combinatorial Search
Bi-objective search requires computing a Pareto solution set which contains a set of paths. In real-world applications, Pareto solution sets may contain several tens or even hundreds of solutions. For...

Counting on General Run-Length Grammars

Authors: Gonzalo Navarro, Alejandro Pacheco
Year: 2024Source: arXiv (Cornell University)
We introduce a data structure for counting pattern occurrences in texts compressed with any run-length context-free grammar. Our structure uses space proportional to the grammar size and counts the oc...

Querying Graph Databases at Scale

Authors: Aidan Hogan, Domagoj Vrgoč
Year: 2024
The tutorial provides an in-depth overview of recent advances in algorithms and data structures for processing graph database queries. The focus will be on scalable algorithms that have been demonstra...

MillenniumDB: A Multi-modal, Multi-model Graph Database

Authors: Aidan Hogan, Marcelo Arenas, Juan Reutter et al.
Year: 2024
Current knowledge graphs encompass diverse data formats, including images, text, tables, audio files, and videos. Additionally, the graph database ecosystem is required to support multiple co-existing...

The Limitations of Data, Machine Learning and Us

Authors: Ricardo Baeza-Yates
Year: 2024
Machine learning (ML), particularly deep learning, is being used everywhere. However, not always is applied well or has ethical and/or scientific issues. In this keynote we first do a deep dive in the...

Demonstrating REmatch: A Novel RegEx Engine for Finding all Matches

Authors: Cristian Riveros, Domagoj Vrgoč, Vicente Calisto et al.
Year: 2024
In this demonstration we showcase REmatch, a regular expression (RegEx) engine built to find all matches of a given pattern in a document. REmatch is based on the theory of enumeration algorithms, and...

A Data Management Approach to Explainable AI

Authors: Marcelo Arenas
Year: 2024
In recent years, there has been a growing interest in developing methods to explain individual predictions made by machine learning models. This has led to the development of various notions of explan...

A Principled Approach for a New Bias Measure

Authors: Ricardo Baeza-Yates, Bruno Scarone, Alfredo Viola
Year: 2024Source: arXiv (Cornell University)
The widespread use of machine learning and data-driven algorithms for decision making has been steadily increasing over many years. The areas in which this is happening are diverse: healthcare, employ...

A framework for extraction and transformation of documents

Authors: Cristian Riveros, Nicole Schweikardt, Markus L. Schmid
Year: 2024Source: arXiv (Cornell University)
We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the se...

Using Color Refinement to Boost Enumeration and Counting for Acyclic CQs of Binary Schemas

Authors: Cristian Riveros, Nicole Schweikardt, Benjamin Scheidt
Year: 2024Source: arXiv (Cornell University)
We present an index structure, called the color-index, to boost the evaluation of acyclic conjunctive queries (ACQs) over binary schemas. The color-index is based on the color refinement algorithm, a ...

A Framework for Extraction and Transformation of Documents

Authors: Cristian Riveros, Nicole Schweikardt, Markus L. Schmid
Year: 2024Source: arXiv (Cornell University)
We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the se...

Compact Path Representations for Graph Database Pattern Matching

Authors: Domagoj Vrgoč, Carlos Rojas, Stijn Vansummeren et al.
Year: 2024
Modern graph database query languages such as GQL, SQL/PGQ, and Cypher allow regular path queries to return entire paths, as opposed to only their endpoints. This is challenging for query evaluation, ...

14th Temporal Web Analytics Workshop (TempWeb)

Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2024
The TempWeb workshop series is an established co-located event at The Web Conference that aims at bringing together researchers and practitioners across various domains. Naturally, submissions address...

Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence

Authors: Ricardo Baeza-Yates, Marios Constantinides, Michael Madaio et al.
Year: 2024
With the upcoming AI regulations (e.g., EU AI Act) and rapid advancements in generative AI, new challenges emerge in the area of Human-Centered Responsible Artificial Intelligence (HCR-AI). As AI beco...

Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning

Authors: Mircea Petrache, Denis Parra, Rafael Elberg
Year: 2024Source: arXiv (Cornell University)
Image and multimodal machine learning tasks are very challenging to solve in the case of poorly distributed data. In particular, data availability and privacy restrictions exacerbate these hurdles in ...

Disjointed Polarization in Chile’s Enduring Crisis of Representation

Authors: Juan Pablo Luna
Year: 2024Source: Latin American Politics and Society
Abstract This analytical essay proposes the notion of disjointed polarization to characterize the nature of polarization in contemporary Chile. In disjointed polarization, elite-level polarization doe...

Is the change deforestation? Using time-series analysis of satellite data to disentangle deforestation from other forest degradation causes

Authors: Susana Eyheramendy, Javier Lopatin, Ignacio Fuentes et al.
Year: 2024Source: Remote Sensing Applications Society and Environment

Augmented non-hallucinating large language models as medical information curators

Authors: Aidan Hogan, Jakob Nikolas Kather, Stephen Gilbert
Year: 2024Source: npj Digital Medicine
Reliably processing and interlinking medical information has been recognized as a critical foundation to the digital transformation of medical workflows, and despite the development of medical ontolog...

U Can't Gen This? A Survey of Intellectual Property Protection Methods for Data in Generative AI

Authors: Andreas Rauber, Tanja Šarčević, Rudolf Mayer et al.
Year: 2024Source: arXiv (Cornell University)
Large Generative AI (GAI) models have the unparalleled ability to generate text, images, audio, and other forms of media that are increasingly indistinguishable from human-generated content. As these ...

SpatialCluster: A Python library for urban clustering

Authors: Marcelo Mendoza, Hans Löbel, Naim Bro et al.
Year: 2024Source: SoftwareX
This paper introduces SpatialCluster, a Python library developed for clustering urban areas using geolocated data. The library integrates a range of methods for urban clustering, including Deep Modula...

A Circus of Circuits: Connections Between Decision Diagrams, Circuits, and Automata

Authors: Benjie Wang, YooJung Choi, Mikaël Monet et al.
Year: 2024Source: arXiv (Cornell University)
This document is an introduction to two related formalisms to define Boolean functions: binary decision diagrams, and Boolean circuits. It presents these formalisms and several of their variants studi...

Generalized Straight-Line Programs

Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2024Source: arXiv (Cornell University)
It was recently proved that any Straight-Line Program (SLP) generating a given string can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We generalize this ...

(Don’t) Stop Believing: A Signal Detection Approach to Risk and Protective Factors for Engagement with Politicized (Mis)Information in Social Media

Authors: Sebastián Valenzuela, Marcelo Santos, Tobias Rothmund et al.
Year: 2024
Prior misinformation research often lacks comparisons with the processing of true information and specifically focuses on the dangers of right-wing or conservative misinformation. By employing a signa...

A Self-Righteous, Not a Virtuous, Circle: Proposing a New Framework for Studying Media Effects on Knowledge and Political Participation in a Social Media Environment

Authors: Sebastián Valenzuela, Sangwon Lee
Year: 2024Source: Social Media + Society
To explain the participatory effects of news exposure, communication scholars have long relied upon the “virtuous circle” framework of media use and civic participation. That is, news consumption ...

Detection and impact estimation of social bots in the Chilean Twitter network

Authors: Marcelo Mendoza, Sebastián Valenzuela, Marcelo Santos et al.
Year: 2024Source: Scientific Reports
Abstract The rise of bots that mimic human behavior represents one of the most pressing threats to healthy information environments on social media. Many bots are designed to increase the visibility o...

Faster Maximal Exact Matches with Lazy LCP Evaluation

Authors: Nathaniel K. Brown, Jan Fostier, Lore Depuydt et al.
Year: 2024
MONI (Rossi et al., <i>JCB</i> 2022) is a BWT-based compressed index for computing the matching statistics and maximal exact matches (MEMs) of a pattern (usually a DNA read) with respect to a highly r...

A simpler data structure for dynamic strings

Authors: Gonzalo Navarro, Zsuzsanna Lipták, Francesco Masillo
Year: 2024Source: arXiv (Cornell University)
We consider the problem of maintaining a collection of strings while efficiently supporting splits and concatenations on them, as well as comparing two substrings, and computing the longest common pre...

BAT-LZ Out of Hell

Authors: Gonzalo Navarro, Zsuzsanna Lipták, Francesco Masillo
Year: 2024Source: arXiv (Cornell University)
Despite consistently yielding the best compression on repetitive text collections, the Lempel-Ziv parsing has resisted all attempts at offering relevant guarantees on the cost to access an arbitrary s...

Worst-Case-Optimal Similarity Joins on Graph Databases

Authors: Aidan Hogan, Juan Reutter, Benjamín Bustos et al.
Year: 2024Source: Proceedings of the ACM on Management of Data
We extend the concept of worst-case optimal equijoins in graph databases to the case where some nodes are required to be within the k-nearest neighbors (kNN) of others under some similarity function. ...

Similarity joins and clustering for SPARQL

Authors: Aidan Hogan, Benjamín Bustos, Sebastián Ferrada
Year: 2024Source: Semantic Web
The SPARQL standard provides operators to retrieve exact matches on data, such as graph patterns, filters and grouping. This work proposes and evaluates two new algebraic operators for SPARQL 1.1 that...

Exploring the Impact of Generative AI for StandUp Report Recommendations in Software Capstone Project Development

Authors: Marcelo Mendoza, Andrés Neyem, Juan Pablo Sandoval Alcocer et al.
Year: 2024
StandUp Reports play an important role in capstone software engineering courses, facilitating progress tracking, obstacle identification, and team collaboration. However, despite their significance, s...

Introduction to Responsible AI

Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2024
In the first part of this tutorial we define responsible AI and we discuss the problems embedded in terms like ethical or trustworthy AI. In the second part, to set the stage, we cover irresponsible A...

Stronger and Safer Together: Motivations for and Challenges of (Trans)National Collaboration in Investigative Reporting in Latin America

Authors: Magdalena Saldaña, Lourdes M. Cueva Chacón
Year: 2024Source: Routledge eBooks
Despite the growing scholarship on investigative journalism in Latin America, very few studies have addressed collaboration across newsrooms in the region. By analyzing the responses of 251 journalist...

Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence

Authors: Mohammad Tahaei, Edyta P. Bogucka, Seán Kennedy et al.
Year: 2024Source: arXiv (Cornell University)
With the upcoming AI regulations (e.g., EU AI Act) and rapid advancements in generative AI, new challenges emerge in the area of Human-Centered Responsible Artificial Intelligence (HCR-AI). As AI beco...

The Threat of Misinformation on Journalism’s Epistemology: Exploring the Gap between Journalist’s and Audience’s Expectations when Facing Fake Content

Authors: Marcelo Mendoza, Sebastián Valenzuela, Eliana Providel et al.
Year: 2024Source: Digital Journalism
This study analyzes the discourse of reporters, editors and audiences in focus groups and in-depth interviews, examining the expectations on journalists when facing misinformation. While both groups a...

A Family of Centrality Measures for Graph Data Based on Subgraphs

Authors: Cristian Riveros, Sebastián Bugedo, Jorge Salas
Year: 2024Source: ACM Transactions on Database Systems
We present the theoretical foundations and first experimental study of a new approach in centrality measures for graph data. The main principle is straightforward: the more relevant subgraphs around a...

Work in Progress: A Cross-sectional Survey Study for Understanding and Addressing the Needs of Engineering Students During COVID-19

Authors: Jorge Baier, Isabel Hilliger, Constanza Melian et al.
Year: 2024Source: 2020 ASEE Virtual Annual Conference Content Access Proceedings
His research focuses on areas of automated reasoning in Artificial Intelligence; specifically, automated planning, search and knowledge representation.Currently his research focuses on understanding h...

Iterated Straight-Line Programs

Authors: Gonzalo Navarro, C. Urbina
Year: 2024Source: arXiv (Cornell University)
We explore an extension to straight-line programs (SLPs) that outperforms, for some text families, the measure $\delta$ based on substring complexity, a lower bound for most measures and compressors e...

Stronger compact representations of object trajectories

Authors: Gonzalo Navarro, Travis Gagie, Adrián Gómez-Brandón et al.
Year: 2024Source: Geo-spatial Information Science
GraCT and ContaCT were the first compressed data structures to represent object trajectories, demonstrating that it was possible to use orders of magnitude less space than classical indexes while stay...

The Ring: Worst-case Optimal Joins in Graph Databases using (Almost) No Extra Space

Authors: Aidan Hogan, Juan Reutter, Gonzalo Navarro et al.
Year: 2024Source: ACM Transactions on Database Systems
We present an indexing scheme for triple-based graphs that supports join queries in worst-case optimal (wco) time within compact space. This scheme, called a ring , regards each triple as a cyclic str...

Social ties, mental well-being and academic self-regulation. Exploring effects through Structural Equation Modeling.

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
A long tradition of studies in both psychology and sociology has shown that social ties have positive effects on mental well-being of both the population in general and in educational contexts in part...

The Well-being Teaching Assistant: A Proactive Approach to Caring for Students with Academic and Personal Difficulties in Massive Courses

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
Abstract Since the covid pandemic, some higher education institutions have promoted a flexible evaluation approach for students who face a variety of problems. Instructors willing to implement such fl...

Link Prediction with Relational Hypergraphs

Authors: Pablo Barceló, Michael M. Bronstein, Miguel Romero Orth et al.
Year: 2024Source: arXiv (Cornell University)
Link prediction with knowledge graphs has been thoroughly studied in graph machine learning, leading to a rich landscape of graph neural network architectures with successful applications. Nonetheless...

WIP: Exploring differences in student sense of belonging inside and outside the engineering classroom

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
Abstract This Work-in-Progress (WIP) aims to explore differences in engineering students' sense of belonging. By sense of belonging, researchers have referred to the feeling of mattering to a communit...

The Distributional Uncertainty of the SHAP score in Explainable Machine Learning

Authors: Leopoldo Bertossi, Nina Pardal, Santiago Cifuentes et al.
Year: 2024Source: arXiv (Cornell University)
Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an...

The problem of estimation and forecasting of obesity prevalence using sparsely collected data

Authors: Jocelyn Dunstan, Cristóbal Cuadrado, Luis Rojo-González et al.
Year: 2024Source: Engineering Applications of Artificial Intelligence

The Meso News-Space as a Framework for Studying Mobile Instant Messaging Services

Authors: Sebastián Valenzuela, Marcelo Santos
Year: 2024Source: Digital Journalism

Automatic Detection of Distant Metastasis Mentions in Radiology Reports in Spanish

Authors: Jocelyn Dunstan, Matías Rojas, Pablo Báez et al.
Year: 2024Source: JCO Clinical Cancer Informatics
A critical task in oncology is extracting information related to cancer metastasis from electronic health records. Metastasis-related information is crucial for planning treatment, evaluating patient ...

A pseudonymized corpus of occupational health narratives for clinical entity recognition in Spanish

Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2024Source: Research Square (Research Square)
<title>Abstract</title> Despite the high creation cost, annotated corpora are indispensable for robust natural language processing systems. In the clinical field, apart from annotating medical entitie...

Securing Verified IO Programs Against Unverified Code in F*

Authors: Eric Tanter, Cătălin Hriţcu, Ştefan Ciobâcă et al.
Year: 2024Source: Proceedings of the ACM on Programming Languages
We introduce SCIO*, a formally secure compilation framework for statically verified programs performing input-output (IO). The source language is an F* subset in which a verified program interacts wit...

Responsible AI in Farming: A Multi-Criteria Framework for Sustainable Technology Design

Authors: Ricardo Baeza-Yates, Kevin Mallinger, Ricardo Baeza‐Yates
Year: 2024Source: Applied Sciences
The continuous fusion of artificial intelligence (AI) and autonomous farming machinery (e.g., drones and field robots) provides a significant shift in the daily work experience of farmers. Faced with ...

A Transforming Digital Journalism Editorial Team Calls for a Tribute and a Welcome

Authors: Magdalena Saldaña, Oscar Westlund
Year: 2024Source: Digital Journalism

Toward an AI Knowledge Assistant for Context-Aware Learning Experiences in Software Capstone Project Development

Authors: Marcelo Mendoza, Andrés Neyem, Juan Pablo Sandoval Alcocer et al.
Year: 2024Source: IEEE Transactions on Learning Technologies
Software assistants have significantly impacted software development for both practitioners and students, particularly in capstone projects. The effectiveness of these tools varies based on their know...

Cross-Lingual Cross-Domain Transfer Learning for Rumor Detection

Authors: Marcelo Mendoza, Mauricio Solar, Eliana Providel
Year: 2024

All Models are Wrong, But Some are Deadly: Inconsistencies in Emotion Detection in Suicide-related Tweets

Authors: Ricardo Baeza-Yates, Resmi Ramachandranpillai, Annika Marie Schoene et al.
Year: 2024

A Credibility Divide? Discerning Truth From Misinformation in Chile

Authors: Sebastián Valenzuela, Ingrid Bachmann, Daniel Halpern et al.
Year: 2024Source: International Journal of Public Opinion Research
Abstract Studies on misinformation often overlook people’s assessment of true information, focusing instead on beliefs in and sharing of false content. This is problematic, as it limits scholars’ ...

Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models

Authors: Matías Toro, Tobias Groot
Year: 2024

Post-processing of Medical Image for Neurosurgical Planning with Academic Purposes

Authors: Pablo Barceló, Rocío Buenamaizón, Ricardo Berjano et al.
Year: 2024Source: IFMBE proceedings

Post-Processing Applied to Brain Tumor Surgery: Case studies

Authors: Pablo Barceló, Rocío Buenamaizón, Ricardo Berjano et al.
Year: 2024Source: IFMBE proceedings

An optimized relational database for querying structural patterns in proteins

Authors: Renzo Angles, Roberto García, Mauricio Arenas‐Salinas et al.
Year: 2024Source: Database
Abstract A database is an essential component in almost any software system, and its creation involves more than just data modeling and schema design. It also includes query optimization and tuning. T...

YARS-PG: Property Graphs Representation for Publication and Exchange

Authors: Renzo Angles, Dominik Tomaszuk, Łukasz Szeremeta
Year: 2024Source: IEEE Access
Graph serialization is a critical aspect of advancing graph-oriented systems and applications. Despite the importance of standardized serialization for property graphs, there is a lack of a universal ...

Path Querying in Graph Databases: A Systematic Mapping Study

Authors: Renzo Angles, Roberto García
Year: 2024Source: IEEE Access
Path querying refers to the evaluation of path queries in a graph database. New research in this topic is crucial for the development of graph database systems as path queries are associated with rele...

The Property Graph Data Format (PGDF)

Authors: Renzo Angles, Sebastián Ferrada, Ignacio Burgos
Year: 2024Source: IEEE Access
Property graphs are popular in both industry and academia due to their versatility in modeling complex data across diverse application domains, ranging from social networks to knowledge graphs. Despit...

Responsible AI: An Urgent Mandate

Authors: Ricardo Baeza-Yates, Usama M. Fayyad, Ricardo Baeza‐Yates
Year: 2024Source: IEEE Intelligent Systems
AI is rapidly becoming essential in various industries, raising societal expectations. AI's societal consequences include impacts on mental health; misinformation; workforce displacement; and economic...

Iterated Straight-Line Programs

Authors: Gonzalo Navarro, C. Urbina
Year: 2024Source: Lecture notes in computer science

Space-Efficient Conversions from SLPs

Authors: Gonzalo Navarro, Travis Gagie, Adrián Goga et al.
Year: 2024Source: Lecture notes in computer science

Speedy Gonzales: A Collection of Fast Task-Specific Models for Spanish

Authors: José Cañete, Felipe Bravo-Márquez
Year: 2024

Wheeler Maps

Authors: Gonzalo Navarro, Travis Gagie, Jouni Sirén et al.
Year: 2024Source: Lecture notes in computer science

News Gathering: Leveraging Transformers to Rank News

Authors: Marcelo Mendoza, Hans Löbel, Maximiliano Ojeda et al.
Year: 2024Source: Lecture notes in computer science

Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation

Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2024Source: Findings of the Association for Computational Linguistics: ACL 2022

A Privacy-Preserving Corpus for Occupational Health in Spanish: Evaluation for NER and Classification Tasks

Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2024
Claudio Aracena, Luis Miranda, Thomas Vakili, Fabián Villena, Tamara Quiroga, Fredy Núñez-Torres, Victor Rocco, Jocelyn Dunstan. Proceedings of the 6th Clinical Natural Language Processing Workshop...

Chile: La deriva del sistema político y el fracaso del nuevo proceso constitucional

Authors: Sergio Toro, AGUSTINA NOGUERA
Year: 2024Source: Revista de ciencia política

Enumeration and Updates for Conjunctive Linear Algebra Queries Through Expressibility

Authors: Thomas Muñoz, Cristian Riveros, Stijn Vansummeren
Year: 2024Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Due to the importance of linear algebra and matrix operations in data analytics, there is significant interest in using relational query optimization and processing techniques for evaluating (sparse) ...

Geospatial Raster Data Processing Applying Neural Networks

Authors: Magdalena Saldaña, Carlos Guzmán Sanchéz-Mejorada, Rolando Quintero et al.
Year: 2024Source: Communications in computer and information science

iHealth-Chile-3&2 at RRG24: Template Based Report Generation

Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2024

iHealth-Chile-1 at RRG24: In-context Learning and Finetuning of a Large Multimodal Model for Radiology Report Generation

Authors: Denis Parra, Pablo Messina, Rafael Elberg et al.
Year: 2024

How Could Be Used Student Comments for Delivering Feedback to Instructors in Higher Education?

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo
Year: 2024Source: Communications in computer and information science

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Authors: Toqeer Ehsan, Jiahui Geng, Tiago Timponi Torrent et al.
Year: 2024
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual a...

Publications for 2023

Displaying 159 publication(s) for 2023

ACHS-Privacy Corpus

Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)

ACHS-Privacy Corpus

Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)

A Panel Study on the Dynamics of Social Media Use and Conspiracy Thinking

Authors: Sebastián Valenzuela, Daniel Halpern, Sangwon Lee et al.
Year: 2023Source: Media Psychology
Studies exploring the association between social media use and belief in conspiracy theories have yielded mixed evidence. To address this inconsistency, we focus on conspiracy thinking – a predispos...

K-Focal Search for Slow Learned Heuristics

Authors: Jorge Baier, Carlos Hernández, Jorge Toro et al.
Year: 2023Source: IEEE Access
Bounded suboptimal heuristic search is a family of search algorithms capable of solving hard combinatorial problems, returning suboptimal solutions within a given bound.Recent machine learning approac...

Predicting disease severity in multiple sclerosis using multimodal data and machine learning

Authors: Ricardo Baeza-Yates, Ana Freire, Priscilla Bäcker‐Koduah et al.
Year: 2023Source: Journal of Neurology
Multiple sclerosis patients would benefit from machine learning algorithms that integrates clinical, imaging and multimodal biomarkers to define the risk of disease activity.

Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances

Authors: Felipe Bravo-Márquez, Edison Marrese-Taylor, I. Jara et al.
Year: 2023Source: arXiv (Cornell University)
Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning. Various neural architectures, spanning Transformer-based models like Vision Transforme...

Ciencias, golpe de Estado y Dictadura en Chile

Authors: Claudio Gutiérrez
Year: 2023Source: Anales de la Universidad de Chile
de «limpieza» física.En la tercera, abordamos la «limpieza» ideológica y disciplinaria.En la cuarta, tratamos la

Bias and the Web

Authors: Ricardo Baeza-Yates, Leena Murgai
Year: 2023
Abstract Bias is everywhere, sometimes blatantly explicit, but most of the time it’s hidden, as it often arises from that which is missing, the gaps in our knowledge or data. In this chapter, we cov...

Measuring Bias

Authors: Ricardo Baeza-Yates, Aida Sharif Rohani
Year: 2023Source: 2021 IEEE International Conference on Big Data (Big Data)
The extensive use of machine learning (ML) for supporting or making major decisions such as employment, credit card approval, or juridical decisions has resulted in rising concerns over the widespread...

Differential privacy and SPARQL

Authors: Federico Olmedo, Carlos Buil-Aranda, Jorge Lobo
Year: 2023Source: Semantic Web
Differential privacy is a framework that provides formal tools to develop algorithms to access databases and answer statistical queries with quantifiable accuracy and privacy guarantees. The notions o...

Sherlock-wannabes or when the audience fact-checks. How ideology, education, and alternative media use explain fact-checking behaviors

Authors: Magdalena Saldaña, Marcelo Santos
Year: 2023Source: Estudios sobre el Mensaje Periodístico
When confronted with suspicious information, the most common advice is to rely on trusted, well-known news media outlets to verify it. However, in a high-choice, fragmented media ecosystem, news reade...

Local Government, Social Media and Management of COVID-19: The Case of Chilean Mayoral Communication

Authors: Sergio Toro, Sebastián Valenzuela, Juan Pablo Luna et al.
Year: 2023Source: Political Communication
Most research on governments' use of social media focuses on the national or federal level. We therefore know little about the way local authorities harness social media platforms to communicate with ...

Report on the 46th ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023): Reflections from the Program Co-Chairs

Authors: Bárbara Poblete, Josiane Mothe, Makoto P. Kato
Year: 2023Source: ACM SIGIR Forum
The ACM SIGIR Conference on Research and Development in Information Retrieval has been experiencing significant growth over the past few years. In 2023, SIGIR received a total of 822 full paper submis...

Report on the 13th Workshop on Temporal Web Analytics (TempWeb 2023) at WWW 2023

Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2023Source: ACM SIGIR Forum
TempWeb is an established Workshop (series) with a long-standing tradition as a co-located event at The Web Conference. Considering the constantly evolving Web as a primary object of research, TempWeb...

Politics and Media in Journalism & Mass Communication Quarterly: A Centennial Research Retrospective

Authors: Sebastián Valenzuela, Homero Gil de Zúñiga, Ingrid Bachmann et al.
Year: 2023Source: Journalism & Mass Communication Quarterly
Based on computerized and manual content analyses, we examined the theories, methods, topics, and authors’ backgrounds of the empirical articles revolving around politics and media published by Jour...

FairXAI - A Taxonomy and Framework for Fairness and Explainability Synergy in Machine Learning

Authors: Ricardo Baeza-Yates, Fredrik Heintz, Resmi Ramachandranpillai
Year: 2023
<p>Explainable Artificial Intelligence (XAI) and Fair Learning have made significant strides in various application domains, including criminal recidivism predictions, healthcare settings, toxic...

FairXAI - A Taxonomy and Framework for Fairness and Explainability Synergy in Machine Learning

Authors: Ricardo Baeza-Yates, Fredrik Heintz, Resmi Ramachandranpillai
Year: 2023
<p>Explainable Artificial Intelligence (XAI) and Fair Learning have made significant strides in various application domains, including criminal recidivism predictions, healthcare settings, toxic...

Near-Optimal Search Time in $$\delta $$-Optimal Space, and Vice Versa

Authors: Gonzalo Navarro, Tomasz Kociumaka, Francisco Javier Vidal Olivares
Year: 2023Source: Algorithmica

Generative AI models should include detection mechanisms as a condition for public release

Authors: Ricardo Baeza-Yates, Raja Chatila, David Eyers et al.
Year: 2023Source: Ethics and Information Technology
Abstract The new wave of ‘foundation models’—general-purpose generative AI models, for production of text (e.g., ChatGPT) or images (e.g., MidJourney)—represent a dramatic advance in the state...

A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence

Authors: Denis Parra, Hans Löbel, Andrés Carvallo et al.
Year: 2023Source: Data in Brief
The COVID-19 pandemic has underlined the need for reliable information for clinical decision-making and public health policies. As such, evidence-based medicine (EBM) is essential in identifying and e...

Bias Invariant Approaches for Improving Word Embedding Fairness

Authors: Bárbara Poblete, Vanessa Murdock, Rongting Zhang et al.
Year: 2023
Many public pre-trained word embeddings have been shown to encode different types of biases. Embeddings are often obtained from training on large pre-existing corpora, and therefore resulting biases c...

A Uniform Language to Explain Decision Trees

Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2023Source: arXiv (Cornell University)
The recent development of formal explainable AI has disputed the folklore claim that "decision trees are readily interpretable models", showing different interpretability queries that are computationa...

A neuro-symbolic framework for answering conjunctive queries

Authors: Pablo Barceló, Juan Reutter, Floris Geerts et al.
Year: 2023Source: arXiv (Cornell University)
The challenge of answering graph queries over incomplete knowledge graphs is gaining significant attention in the machine learning community. Neuro-symbolic models have emerged as a promising approach...

Logical Languages Accepted by Transformer Encoders with Hard Attention

Authors: Pablo Barceló, Anthony W. Lin, Alexander Kozachinskiy et al.
Year: 2023Source: arXiv (Cornell University)
We contribute to the study of formal languages that can be recognized by transformer encoders. We focus on two self-attention mechanisms: (1) UHAT (Unique Hard Attention Transformers) and (2) AHAT (Av...

Natural language processing analysis of the psychosocial stressors of mental health disorders during the pandemic

Authors: Susana Eyheramendy, Maria Paz Hermosilla, Isidora Paiva-Mack et al.
Year: 2023Source: npj Mental Health Research
Abstract Over the past few years, the COVID-19 pandemic has exerted various impacts on the world, notably concerning mental health. Nevertheless, the precise influence of psychosocial stressors on thi...

Evaluation of 3D Reconstruction for Cultural Heritage Applications

Authors: Benjamín Bustos, Ivan Sipiran, Cristián Llull et al.
Year: 2023
In recent years, we have seen the emergence of methods for creating 3D digital reproductions of objects using photos. These techniques, particularly when combined with handheld video devices like smar...

An empirical study of the effect of video encoders on Temporal Video Grounding

Authors: Felipe Bravo-Márquez, Edison Marrese-Taylor, I. Jara et al.
Year: 2023
Temporal video grounding is a fundamental task in computer vision, aiming to localize a natural language query in a long, untrimmed video. It has a key role in the scientific community, in part due to...

On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters

Authors: Pablo Barceló, Matthias Lanzinger
Year: 2023Source: arXiv (Cornell University)
Seminal research in the field of graph neural networks (GNNs) has revealed a direct correspondence between the expressive capabilities of GNNs and the $k$-dimensional Weisfeiler-Leman ($k$WL) test, a ...

No Agreement Without Loss: Learning and Social Choice in Peer Review

Authors: Pablo Barceló, Tomasz Steifer, Cristóbal Rojas et al.
Year: 2023Source: Frontiers in artificial intelligence and applications
In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on th...

Uncovering Bias in Personal Informatics

Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2023Source: Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies
Personal informatics (PI) systems, powered by smartphones and wearables, enable people to lead healthier lifestyles by providing meaningful and actionable insights that break down barriers between use...

2.2 kW single-mode narrow-linewidth laser delivery through a hollow-core fiber

Authors: Denis Parra, Matthew Cooper, Joseph Wahlen et al.
Year: 2023Source: Optica
Antiresonant hollow-core fibers (AR-HCFs) have opened up exciting possibilities for high-energy and high-power laser delivery because of their exceptionally low nonlinearities and high damage threshol...

Los presidencialismos y la inestabilidad política en América Latina: Contención e incorporación del conflicto durante el siglo XIX

Authors: Sergio Toro, Juan Carlos Arellano González, Alejandro Olivares
Year: 2023Source: Revista Chilena de Derecho y Ciencia Política
Una de las principales características de los presidencialismos de América Latina es que, a lo largo de la historia, se han mostrado diversos momentos de inestabilidad. En búsqueda de algunas expli...

Optimizing RPQs over a compact graph representation

Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2023Source: The VLDB Journal

Truth be told: How “true” and “false” labels influence user engagement with fact-checks

Authors: Sebastián Valenzuela, Ingrid Bachmann, Tiago Ventura et al.
Year: 2023Source: New Media & Society
When do users share fact-checks on social media? We describe a survey experiment conducted during the 2019 election in Argentina measuring the propensity of voters to share corrections to political mi...

SparqLog: A System for Efficient Evaluation of SPARQL 1.1 Queries via Datalog

Authors: Renzo Angles, Georg Gottlob, Reinhard Pichler et al.
Year: 2023Source: Proceedings of the VLDB Endowment
Over the past decade, Knowledge Graphs have received enormous interest both from industry and from academia. Research in this area has been driven, above all, by the Database (DB) community and the Se...

Trends in the Global Information Environment: 2023 Expert Survey Results

Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
The information environment is rapidly evolving, with algorithmic bias, manipulation and misinformation having a significant impact on public life. The global network of researchers is an important so...

Expert Survey on the Global Information Environment 2023: Lessons for Technology Policy and Design

Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
The global information environment is impacted by both technology design and public policy. This Summary for Policymakers summarizes Trends in the Global Information Environment: 2023 Expert Survey Re...

‘Does she know how to read?’ An intersectional perspective to explore Twitter users’ portrayal of women Mapuche leaders

Authors: Magdalena Saldaña, Ximena Orchard, Isabel Pavez et al.
Year: 2023Source: Information Communication & Society
ABSTRACTSocial media offer new opportunities for women in politics, but also new ground for the expression of bias and stereotypes. Drawing upon literature about mediated representations of women in p...

Efficient construction of the BWT for repetitive text using string compression

Authors: Gonzalo Navarro, Diego Díaz-Domínguez
Year: 2023Source: Information and Computation
We present a new semi-external algorithm that builds the Burrows–Wheeler transform variant of Bauer et al. (a.k.a., BCR BWT) in linear expected time. Our method uses compression techniques to reduce...

Artificial intelligence-based decision-making: can ChatGPT replace a multidisciplinary tumour board?

Authors: Sebastián Valenzuela, Javier Vela Ulloa, Christophe Riquoir Altamirano et al.
Year: 2023Source: British journal of surgery
Artificial intelligence (AI) has been around for a while.Recent reports 1,2 have evaluated its role in assisting clinical decision-making, with promising results.After its launch in 2022 by OpenAI (Sa...

DIVERGÊNCIA KULLBACK-LEIBLER APLICADA A FWI

Authors: Juan Pablo Luna, Gilberto Barbosa Neto Carvalho, Virgílio José Martins Ferreira Filho
Year: 2023Source: Revista Contemporânea
A FWI (FUll-Waveform Inversion) é um dos métodos mais robustos para extrair informações sísmicas. Contudo, a norma L2 usada para medir a diferença entre dados sísmicos nem sempre é a melhor op...

The Shapley Value in Database Management

Authors: Leopoldo Bertossi, Mikaël Monet, Ester Livshits et al.
Year: 2023Source: ACM SIGMOD Record
Attribution scores can be applied in data management to quantify the contribution of individual items to conclusions from the data, as part of the explanation of what led to these conclusions. In Arti...

Fair Multilingual Vandalism Detection System for Wikipedia

Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
This paper presents a novel design of the system aimed at supporting the Wikipedia community in addressing vandalism on the platform. To achieve this, we collected a massive dataset of 47 languages, a...

Gradual Sensitivity Typing

Authors: Matías Toro, Eric Tanter, Damián Árquez et al.
Year: 2023Source: arXiv (Cornell University)
Reasoning about the sensitivity of functions with respect to their inputs has interesting applications in various areas, such as differential privacy. In order to check and enforce sensitivity, severa...

Physics-informed neural networks for blood flow inverse problems

Authors: Jocelyn Dunstan, Sergio Uribe, Jeremías Garay et al.
Year: 2023Source: arXiv (Cornell University)
Physics-informed neural networks (PINNs) have emerged as a powerful tool for solving inverse problems, especially in cases where no complete information about the system is known and scatter measureme...

LECTURE HELD AT THE ACADEMIA EUROPAEA BUILDING BRIDGES CONFERENCE 2022

Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2023Source: European Review
Artificial intelligence (AI) has finally reached most people on our planet thanks to generative AI tools for text and other media. This has started a controversy about the possible benefits and risks,...

Towards a Comprehensive Human-Centred Evaluation Framework for Explainable AI

Authors: Denis Parra, Katrien Verbert, Ivania Donoso-Guzmán et al.
Year: 2023Source: arXiv (Cornell University)
While research on explainable AI (XAI) is booming and explanation techniques have proven promising in many application domains, standardised human-centred evaluation procedures are still missing. In a...

Attribution-Scores in Data Management and Explainable Machine Learning

Authors: Leopoldo Bertossi
Year: 2023Source: arXiv (Cornell University)
We describe recent research on the use of actual causality in the definition of responsibility scores as explanations for query answers in databases, and for outcomes from classification models in mac...

Influence of quality of reduction using radiological criteria on kinematics and kinetics in ankle fractures with unstable syndesmotic injury

Authors: Aidan Hogan, Ursula Trinler, Paul Alfred Grützner et al.
Year: 2023Source: Clinical Biomechanics
Although, the data did not show that radiological reduction criteria have a statistically significant effect on active functional outcome after a mean follow up time of 5.7 years, tendencies for a bet...

Evaluating Regular Path Queries on Compressed Adjacency Matrices

Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón
Year: 2023Source: arXiv (Cornell University)
Regular Path Queries (RPQs), which are essentially regular expressions to be matched against the labels of paths in labeled graphs, are at the core of graph database query languages like SPARQL. A way...

How are AI assistants changing higher education?

Authors: Maria Rauschenberger, Ricardo Baeza‐Yates, Ricardo Baeza-Yates et al.
Year: 2023Source: Frontiers in Computer Science
Context Higher education is changing at an accelerating pace due to the widespread use of digital teaching and emerging technologies. In particular, AI assistants such as ChatGPT pose significant chal...

Wikipedia Multilingual Vandalism Detection Dataset

Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
This dataset accompanies a research paper that introduces a novel system designed to support the Wikipedia community in combating vandalism on the platform. The dataset has been prepared to enhance th...

Wikipedia Multilingual Vandalism Detection Dataset

Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
This dataset accompanies a research paper that introduces a novel system designed to support the Wikipedia community in combating vandalism on the platform. The dataset has been prepared to enhance th...

A transcription and information extraction system to facilitate EHR documentation in Spanish

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2023Source: Research Square (Research Square)
<title>Abstract</title> The large and diverse access to data sources in healthcare has boosted the application of novel computer techniques that can extract meaningful information to improve patients'...

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval

Authors: Bárbara Poblete, Hsin‐Hsi Chen, Josiane Mothe et al.
Year: 2023
International audience

RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data Streams

Authors: Felipe Bravo-Márquez, Gabriel Iturra-Bocaz
Year: 2023Source: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
Word embeddings have become essential components in various information retrieval and natural language processing tasks, such as ranking, document classification, and question answering. However, desp...

Criminal Politics and Botched Development in Contemporary Latin America

Authors: Juan Pablo Luna, Andreas Emil Feldmann
Year: 2023Source: Cambridge University Press eBooks
This Element investigates the relationship between the narcotics industry and politics and assesses how it influences domestic political dynamics, including economic development prospects in Latin Ame...

SparqLog: A System for Efficient Evaluation of SPARQL 1.1 Queries via Datalog [Experiment, Analysis and Benchmark]

Authors: Renzo Angles, Georg Gottlob, Reinhard Pichler et al.
Year: 2023Source: arXiv (Cornell University)
Over the past decade, Knowledge Graphs have received enormous interest both from industry and from academia. Research in this area has been driven, above all, by the Database (DB) community and the Se...

State Responses to Autonomy Demands: Indigenous Movements and Regional Threats in Bolivia and Ecuador

Authors: Carla Alberti, Shannan Mattiace
Year: 2023Source: Journal of Politics in Latin America
In this paper, we examine the political factors that explain state responses to demands for indigenous territorial autonomy in Ecuador and Bolivia. Specifically, we aim to explain why the 2009 Bolivia...

Automatic Coding at Scale: Design and Deployment of a Nationwide System for Normalizing Referrals in the Chilean Public Healthcare System

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2023Source: arXiv (Cornell University)
The disease coding task involves assigning a unique identifier from a controlled vocabulary to each disease mentioned in a clinical document. This task is relevant since it allows information extracti...

The Impact of the Web on Information Retrieval

Authors: Ricardo Baeza-Yates, Peter Mika
Year: 2023Source: ACM eBooks
chapter Share on The Impact of the Web on Information Retrieval Authors: Peter Mika Search about this author , Ricardo Baeza-Yates Search about this author Authors Info & Claims Linking the World's In...

Uncovering Bias in Personal Informatics

Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)

REmatch: A Novel Regex Engine for Finding All Matches

Authors: Cristian Riveros, Domagoj Vrgoč, Nicolás Van Sint Jan
Year: 2023Source: Proceedings of the VLDB Endowment
In this paper, we present the REmatch system for information extraction. REmatch is based on a recently proposed enumeration algorithm for evaluating regular expressions with capture variables support...

Visual Exploration of Repetitive Patterns on Ancient Peruvian Pottery

Authors: Benjamín Bustos, Ivan Sipiran, Tobias Schreck et al.
Year: 2023Source: Journal of WSCG
The analysis and understanding of artefact properties and their relationships is a key goal in archaeological analysis of cultural heritage objects. There are many aspects of concern, including shape ...

Strategies for Improving the Global Information Environment: Results from a Systematic Review and Meta-Analysis

Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
Which design solutions mitigate the impact of misinformation on social media platforms, according to the latest scientific research? This IPIE Summary for Policymakers presents the main findings of tw...

Joint models reveal genetic architecture of transitions between pubertal stages and their association with BMI in a Latino population

Authors: Susana Eyheramendy, Lucas Vicuña, Verónica Mericq et al.
Year: 2023Source: medRxiv (Cold Spring Harbor Laboratory)
Abstract Early or late pubertal onset can lead to disease in adulthood, including cancer, obesity, type 2 diabetes, metabolic disorders, bone fractures and psychopathologies. Thus, knowing the age at ...

Special Issue on “Artificial Intelligence‐Driven Decision Making in Health and Medicine”

Authors: Leopoldo Bertossi, Herb Kunze, Marc Poulin et al.
Year: 2023Source: International Transactions in Operational Research
Artificial Intelligence (AI) refers to an interdisciplinary area which embraces computer science, robotics, engineering, mathematics, and statistics, and is largely based on the ability of a machine t...

Radiografía de un mito: la representación estereotipada de los videojugadores puesta a prueba en Chile

Authors: Magdalena Saldaña, Marco Jaramillo
Year: 2023Source: Comunicación y Medios
Los jugadores de videojuegos han sido representados en medios de comunicación y en productos culturales como niños u hombres jóvenes con escasa vida social, sin participación en la vida comunitari...

Social AI and the Challenges of the Human-AI Ecosystem

Authors: Ricardo Baeza-Yates, Dino Pedreschi, Alistair Knott et al.
Year: 2023Source: arXiv (Cornell University)
The rise of large-scale socio-technical systems in which humans interact with artificial intelligence (AI) systems (including assistants and recommenders, in short AIs) multiplies the opportunity for ...

Evaluating Pre-training Strategies for Collaborative Filtering

Authors: Denis Parra, Leandro Balby Marinho, Rodrygo L. T. Santos et al.
Year: 2023
Pre-training is essential for effective representation learning models, especially in natural language processing and computer vision-related tasks. The core idea is to learn representations, usually ...

From Database Repairs to Causality in Databases and Beyond

Authors: Leopoldo Bertossi
Year: 2023Source: arXiv (Cornell University)
We describe some recent approaches to score-based explanations for query answers in databases. The focus is on work done by the author and collaborators. Special emphasis is placed on the use of count...

A Copernican Revolution in Data

Authors: Claudio Gutiérrez
Year: 2023Source: arXiv (Cornell University)
Half a century ago, Charles Bachman foresaw the significance and centrality of data in the digital world. In this short paper, we delve into the evolution of these ideas within the database community ...

MillenniumDB: An Open-Source Graph Database System

Authors: Cristian Riveros, Aidan Hogan, Carlos Buil-Aranda et al.
Year: 2023Source: Data Intelligence
Abstract In this systems paper, we present MillenniumDB: a novel graph database engine that is modular, persistent, and open source. MillenniumDB is based on a graph data model, which we call domain g...

PG-Schema: Schemas for Property Graphs

Authors: Renzo Angles, Domagoj Vrgoč, Juan Sequeda et al.
Year: 2023Source: Proceedings of the ACM on Management of Data
Property graphs have reached a high level of maturity, witnessed by multiple robust graph database systems as well as the ongoing ISO standardization effort aiming at creating a new standard Graph Que...

Do Fiscal Transfers Affect Local Democracy? Lessons from Chilean Municipalities

Authors: Carla Alberti, Diego Díaz Rioseco, Ignacio Riveros
Year: 2023Source: Latin American Politics and Society
ABSTRACT Extant literature concurs that fiscal transfers affect local democracy when they grant subnational governments nontax revenue. Yet there is nonetheless a mismatch between this concept and exi...

Evaluating Regular Path Queries in GQL and SQL/PGQ: How Far Can The Classical Algorithms Take Us?

Authors: Domagoj Vrgoč, Carlos Rojas, Benjamín Farías
Year: 2023Source: arXiv (Cornell University)
Path queries are a core feature of modern graph query languages such as Cypher, SQL/PGQ, and GQL. These languages provide a rich set of features for matching paths, such as restricting to certain path...

Cross-Lingual and Cross-Domain Crisis Classification for Low-Resource Scenarios

Authors: Jorge Pérez, Bárbara Poblete, Hernán Sarmiento et al.
Year: 2023Source: Proceedings of the International AAAI Conference on Web and Social Media
Social media data has emerged as a useful source of timely information about real-world crisis events. One of the main tasks related to the use of social media for disaster management is the automatic...

Characterizing and Identifying Socially Shared Self-Descriptions in Product Reviews

Authors: Bárbara Poblete, Vanessa Murdock, Chia-Jung Lee et al.
Year: 2023Source: Proceedings of the International AAAI Conference on Web and Social Media
Online e-commerce product reviews can be highly influential in a customer's decision-making processes. Reviews often describe personal experiences with a product and provide candid opinions about a pr...

Fair multilingual vandalism detection system for Wikipedia

Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: arXiv (Cornell University)
This paper presents a novel design of the system aimed at supporting the Wikipedia community in addressing vandalism on the platform. To achieve this, we collected a massive dataset of 47 languages, a...

GPC: A Pattern Calculus for Property Graphs

Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2023
International audience

The ACM PODS Alberto O. Mendelzon Test-of-Time Award 2023

Authors: Marcelo Arenas, Wenfei Fan, Frank Neven
Year: 2023
Citations for the The ACM PODS Alberto O. Mendelzon Test-of-Time Award 2023

Data Stories of Water: Studying the Communicative Role of Data Visualizations within Long‐form Journalism

Authors: Denis Parra, Manuela Garretón, Francesca Morini et al.
Year: 2023Source: Computer Graphics Forum
Abstract We present a methodology for making sense of the communicative role of data visualizations in journalistic storytelling and share findings from surveying water‐related data stories. Data st...

Framing school choice and merit: news media coverage of an education policy in Chile

Authors: Magdalena Saldaña, Cristian Cabalín, M. Beatriz Fernández
Year: 2023Source: Discourse Studies in the Cultural Politics of Education
School choice is a controversial issue in the public discussion of education. In Chile, the new School Admission System (SAE) was recently implemented to gradually reverse the country’s high educati...

The value of mathematical modelling approaches in epidemiology for public health decision making

Authors: Ricardo Baeza-Yates, Martha Ospina, Oscar H. Franco et al.
Year: 2023Source: Colombian Journal of Anesthesiology
It is discussed the relevance of quantitative approaches, specifically mathematical modelling in epidemiology, in the public health decision-making process. This topic is discussed here based on the e...

Engineering Rank/Select Data Structures for Big-Alphabet Strings

Authors: Diego Arroyuelo, Erick Sepúlveda, Francisco Riveros et al.
Year: 2023Source: arXiv (Cornell University)
Big-alphabet strings are common in several scenarios such as information retrieval and natural-language processing. The efficient storage and processing of such strings usually introduces several chal...

Separating Automatic Relations

Authors: Pablo Barceló, Diego Figueira, Rémi Morvan
Year: 2023Source: arXiv (Cornell University)
We study the separability problem for automatic relations (i.e., relations on finite words definable by synchronous automata) in terms of recognizable relations (i.e., finite unions of products of reg...

MUSIB: musical score inpainting benchmark

Authors: Denis Parra, Felipe Bravo-Márquez, Rodrigo F. Cádiz et al.
Year: 2023Source: EURASIP Journal on Audio Speech and Music Processing
Abstract Music inpainting is a sub-task of automated music generation that aims to infill incomplete musical pieces to help musicians in their musical composition process. Many methods have been devel...

Uneven States, Unequal Societies, and Democracy’s Unfulfilled Promises: Citizenship Rights in Chile and Contemporary Latin America

Authors: Juan Pablo Luna, Rodrigo M. Medel
Year: 2023Source: Latin American Politics and Society
ABSTRACT In contemporary Latin America, deep-seated social discontent with political elites and institutions has been, paradoxically, the counterpart of democratic stability and resilience. This parad...

RDF Playground: An Online Tool for Learning about the Semantic Web

Authors: Aidan Hogan, Raúl Cid, Bastián Inostroza
Year: 2023
We present RDF Playground: a web-based tool to assist those who wish to learn or teach about the Semantic Web. The tool integrates functionalities relating to the key features of RDF, allowing users t...

Templet: A Collaborative System for Knowledge Graph Question Answering over Wikidata

Authors: Aidan Hogan, Francisca Suárez
Year: 2023
We present Templet: an online question answering (QA) system for Wikidata. Templet is based on the collaboratively-edited repository QAWiki, which collects questions in multiple natural languages alon...

Wikidata Atlas: Putting Wikidata on the Map

Authors: Aidan Hogan, Benjamín Del Pino
Year: 2023
Wikidata Atlas is an online system that allows users to explore Wikidata items on an interactive global map; for example, users can explore the global distribution of all lighthouses described by Wiki...

A convolutional architecture for 3D model embedding using image views

Authors: Benjamín Bustos, Ivan Sipiran, Arniel Labrada
Year: 2023Source: The Visual Computer

13th Temporal Web Analytics Workshop (TempWeb) Overview

Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2023
International audience

Lacking time: A case study of student and faculty perceptions of academic workload in the COVID‐19 pandemic

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo
Year: 2023Source: Journal of Engineering Education
Abstract Background To avoid the spread of COVID‐19, most engineering programs rapidly shifted to emergency online education, and prior research has associated online education with academic overloa...

Using diversity as a source of scientific innovation for the Web

Authors: Bárbara Poblete
Year: 2023Source: Proceedings of the ACM Web Conference 2022
The Web has become a resource that allows us to make sense of social phenomena around the world. This started the moment users became content creators, and has grown with the emergence of social platf...

A Study on Information Disorders on Social Networks during the Chilean Social Outbreak and COVID-19 Pandemic

Authors: Marcelo Mendoza, Sebastián Valenzuela, Claudia López et al.
Year: 2023Source: Applied Sciences
Information disorders on social media can have a significant impact on citizens’ participation in democratic processes. To better understand the spread of false and inaccurate information online, th...

10 Years of Digital Journalism (Studies): The Past, the Present, the Future

Authors: Magdalena Saldaña, Oscar Westlund, Edson C. Tandoc et al.
Year: 2023Source: Digital Journalism
The Digital Journalism editorial team is thrilled to introduce this 10th anniversary special issue. At the beginning of 2022, we invited our international editorial board to contribute to this importa...

Digital Journalism: The Journal and the Path that Brought us Here

Authors: Magdalena Saldaña, Oscar Westlund, Edson C. Tandoc et al.
Year: 2023Source: Digital Journalism
Click to increase image sizeClick to decrease image size Disclosure StatementNo potential conflict of interest was reported by the author(s).

Human-Centered Responsible Artificial Intelligence: Current & Future Trends

Authors: Jessica Vitak, Mohammad Tahaei, Seán Kennedy et al.
Year: 2023
In recent years, the CHI community has seen significant growth in research on\nHuman-Centered Responsible Artificial Intelligence. While different research\ncommunities may use different terminology t...

The long memory of the land: Pre-colonial origins of Mapuche mobilization in Chile

Authors: Sergio Toro, Carla Alberti, Juan Pablo Luna et al.
Year: 2023Source: Political Geography

Contextual Linear Types for Differential Privacy

Authors: Matías Toro, Eric Tanter, Federico Olmedo et al.
Year: 2023Source: ACM Transactions on Programming Languages and Systems
Language support for differentially private programming is both crucial and delicate. While elaborate program logics can be very expressive, type-system-based approaches using linear types tend to be ...

A Gradual Probabilistic Lambda Calculus

Authors: Matías Toro, Federico Olmedo, Wenjia Ye
Year: 2023Source: Proceedings of the ACM on Programming Languages
Probabilistic programming languages have recently gained a lot of attention, in particular due to their applications in domains such as machine learning and differential privacy. To establish invarian...

GenoVi, an open-source automated circular genome visualizer for bacteria and archaea

Authors: Carlos Buil-Aranda, Mauricio Araya, Nicolás Jara et al.
Year: 2023Source: PLoS Computational Biology
The increase in microbial sequenced genomes from pure cultures and metagenomic samples reflects the current attainability of whole-genome and shotgun sequencing methods. However, software for genome v...

Studying the Downstream Effects of Fact-Checking on Social Media: Experiments on Correction Formats, Belief Accuracy, and Media Trust

Authors: Sebastián Valenzuela, Ingrid Bachmann
Year: 2023Source: Social Media + Society
Repeated exposure to misinformation not only reduces the accuracy of people’s beliefs, but it also decreases confidence in institutions such as the news media. Can fact-checking—journalism’s mai...

Compact representations of spatial hierarchical structures with support for topological queries

Authors: Gonzalo Navarro, José Fuentes‐Sepúlveda, Diego Seco et al.
Year: 2023Source: Information and Computation

A Researcher's Digest of GQL

Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2023Source: HAL (Le Centre pour la Communication Scientifique Directe)
GQL (Graph Query Language) is being developed as a new ISO standard for graph query languages to play the same role for graph databases as SQL plays for relational. In parallel, an extension of SQL fo...

Three iterations of $(d-1)$-WL test distinguish non isometric clouds of $d$-dimensional points

Authors: Pablo Barceló, Mircea Petrache, Alexander Kozachinskiy et al.
Year: 2023Source: arXiv (Cornell University)
The Weisfeiler--Lehman (WL) test is a fundamental iterative algorithm for checking isomorphism of graphs. It has also been observed that it underlies the design of several graph neural network archite...

The Chilean Waiting List sub-Corpus with medical entities normalized to UMLS terminology

Authors: Jocelyn Dunstan, Pablo Báez, Leonardo Campillos Llanos
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
A collection of 2000 medical referrals from the Chilean Waiting List Corpus, manually annotated with six entity types (Finding, Procedure, Disease, Family Member, Body Part, and Medication) and manual...

The Chilean Waiting List sub-Corpus with medical entities normalized to UMLS terminology

Authors: Jocelyn Dunstan, Pablo Báez, Leonardo Campillos Llanos
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
A collection of 2000 medical referrals from the Chilean Waiting List Corpus, manually annotated with six entity types (Finding, Procedure, Disease, Family Member, Body Part, and Medication) and manual...

Efficient Computation of Shap Explanation Scores for Neural Network Classifiers via Knowledge Compilation

Authors: Leopoldo Bertossi, Jorge E. Leon
Year: 2023Source: arXiv (Cornell University)
The use of Shap scores has become widespread in Explainable AI. However, their computation is in general intractable, in particular when done with a black-box classifier, such as neural network. Recen...

A named entity recognition framework using transformers to identify relevant clinical findings from mammographic radiological reports

Authors: Denis Parra, Eduardo Godoy, Alejandro Veloz et al.
Year: 2023
Detecting and extracting findings in a radiological report is crucial for text mining tasks in several applications. In this case, a labeled process for the image associated with the radiological repo...

Attribution-Scores and Causal Counterfactuals as Explanations in Artificial Intelligence

Authors: Leopoldo Bertossi
Year: 2023Source: arXiv (Cornell University)
In this expository article we highlight the relevance of explanations for artificial intelligence, in general, and for the newer developments in {\em explainable AI}, referring to origins and connecti...

Representing Paths in Graph Database Pattern Matching

Authors: Domagoj Vrgoč, Carlos Rojas, Stijn Vansummeren et al.
Year: 2023Source: Proceedings of the VLDB Endowment
Modern graph database query languages such as GQL, SQL/PGQ, and their academic predecessor G-Core promote paths to first-class citizens in the sense that their pattern matching facility can return pat...

Attitudinal effects of data visualizations and illustrations in data stories

Authors: Denis Parra, Manuela Garretón, Francesca Morini et al.
Year: 2023Source: IEEE Transactions on Visualization and Computer Graphics
Journalism has become more data-driven and inherently visual in recent years. Photographs, illustrations, infographics, data visualizations, and general images help convey complex topics to a wide aud...

Influence of surgical reduction on dynamic balance in patients after unstable ankle fracture.

Authors: Aidan Hogan, Ursula Trinler, Sven Y. Vetter et al.
Year: 2023Source: Gait & Posture

A Theory of Link Prediction via Relational Weisfeiler-Leman on Knowledge Graphs

Authors: Pablo Barceló, Miguel Romero Orth, Xingyue Huang et al.
Year: 2023Source: arXiv (Cornell University)
Graph neural networks are prominent models for representation learning over graph-structured data. While the capabilities and limitations of these models are well-understood for simple graphs, our und...

New insights from GWAS on BMI-related growth traits in a longitudinal cohort of admixed children with Native American and European ancestry

Authors: Susana Eyheramendy, Lucas Vicuña, Tomás Norambuena et al.
Year: 2023Source: iScience
Body-mass index (BMI) is a hallmark of adiposity. In contrast with adulthood, the genetic architecture of BMI during childhood is poorly understood. The few genome-wide association studies (GWAS) on c...

Online estimation methods for irregular autoregressive models

Authors: Susana Eyheramendy, Wilfredo Palma, Felipe Elorrieta et al.
Year: 2023Source: arXiv (Cornell University)
In the last decades, due to the huge technological growth observed, it has become increasingly common that a collection of temporal data rapidly accumulates in vast amounts. This provides an opportuni...

Predicting no-show appointments in a pediatric hospital in Chile using machine learning

Authors: Jocelyn Dunstan, Fabián Villena, Juan Peypouquet et al.
Year: 2023Source: Health Care Management Science
The Chilean public health system serves 74% of the country's population, and 19% of medical appointments are missed on average because of no-shows. The national goal is 15%, which coincides with the a...

The Personal Is the Political? What Do WhatsApp Users Share and How It Matters for News Knowledge, Polarization and Participation in Chile

Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2023Source: Routledge eBooks

Stronger and Safer Together

Authors: Magdalena Saldaña, Lourdes M. Cueva Chacón
Year: 2023Source: Routledge eBooks

Indigenous autonomy and Latin American state security in contexts of criminal violence: the cases of Cauca in Colombia and Guerrero in Mexico

Authors: Carla Alberti, Shannan Mattiace
Year: 2023Source: Latin American and Caribbean Ethnic Studies
Scholars writing on Indigenous autonomy in the Americas have focused mainly on social movement demands and on the implementation of laws that enshrine autonomy rights. The motives of state officials i...

Predicting disease severity in Multiple Sclerosis using multimodal data and machine learning

Authors: Nicole Kerlero de Rosbo, Janina Behrens, Susanna Asseyer et al.
Year: 2023Source: Research Square (Research Square)
Abstract Background Multiple Sclerosis patients would benefit from machine learning algorithms that integrates clinical, imaging, and multimodal biomarkers to define the risk of disease activity. Meth...

Enumeration and updates for conjunctive linear algebra queries through expressibility

Authors: Cristian Riveros, Stijn Vansummeren, Thomas Muñoz
Year: 2023Source: arXiv (Cornell University)
Due to the importance of linear algebra and matrix operations in data analytics, there is significant interest in using relational query optimization and processing techniques for evaluating (sparse) ...

Towards a Comprehensive Human-Centred Evaluation Framework for Explainable AI

Authors: Denis Parra, Katrien Verbert, Ivania Donoso-Guzmán et al.
Year: 2023Source: Communications in computer and information science

MillenniumDB: An Open-Source Graph Database System

Authors: Cristian Riveros, Aidan Hogan, Carlos Buil-Aranda et al.
Year: 2023Source: Data Intelligence
ABSTRACT In this systems paper, we present MillenniumDB: a novel graph database engine that is modular, persistent, and open source. MillenniumDB is based on a graph data model, which we call domain g...

Evaluating Regular Path Queries on Compressed Adjacency Matrices

Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón et al.
Year: 2023Source: Lecture notes in computer science

A Comprehensive and Curated Dataset of Covid-19 and Epistemonikos Evidence

Authors: Denis Parra, Hans Löbel, Andrés Carvallo et al.
Year: 2023Source: SSRN Electronic Journal
The emergence of COVID-19 has highlighted the importance of reliable information for clinical decision-making and public health policies. Evidence-based medicine (EBM) seeks to identify and evaluate s...

Uncovering Bias in Personal Informatics

Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2023Source: arXiv (Cornell University)
Personal informatics (PI) systems, powered by smartphones and wearables, enable people to lead healthier lifestyles by providing meaningful and actionable insights that break down barriers between use...

Understanding Search Behavior Bias in Wikipedia

Authors: Ricardo Baeza-Yates, Bruno Scarone, Erik Bernhardson
Year: 2023Source: Communications in computer and information science

Constant Time and Space Updates for the Sigma-Tau Problem

Authors: Gonzalo Navarro, Aaron Williams, Zsuzsanna Lipták et al.
Year: 2023Source: Lecture notes in computer science

Dynamic Compact Data Structure for Temporal Reachability with Unsorted Contact Insertions

Authors: Gonzalo Navarro, Bruno Augusto Nassif Travençolo, Marcelo Keese Albertini et al.
Year: 2023Source: arXiv (Cornell University)
Temporal graphs represent interactions between entities over time. Deciding whether entities can reach each other through temporal paths is useful for various applications such as in communication net...

Wheeler maps

Authors: Gonzalo Navarro, Travis Gagie, Jouni Sirén et al.
Year: 2023Source: arXiv (Cornell University)
Motivated by challenges in pangenomic read alignment, we propose a generalization of Wheeler graphs that we call Wheeler maps. A Wheeler map stores a text $T[1..n]$ and an assignment of tags to the ch...

Maintaining the cycle structure of dynamic permutations

Authors: Gonzalo Navarro, Zsuzsanna Lipták, Francesco Masillo
Year: 2023Source: arXiv (Cornell University)
We present a new data structure for maintaining dynamic permutations, which we call a $\textit{forest of splay trees (FST)}$. The FST allows one to efficiently maintain the cycle structure of a permut...

A Simple Grammar-Based Index for Finding Approximately Longest Common Substrings

Authors: Gonzalo Navarro, Travis Gagie, Sana Kashgouli
Year: 2023Source: Lecture notes in computer science

Bimodal Neural Style Transfer for Image Generation Based on Text Prompts

Authors: Marcelo Mendoza, Diego Gutiérrez
Year: 2023Source: Lecture notes in computer science

Supporting Users in Refining and Comparing Topic Models: An Experimental Study

Authors: Marcelo Mendoza, Evangelos Milios, Fernando V. Paulovich et al.
Year: 2023
Topic modeling is a statistical approach for extracting themes from high volumes of textual data. Humans are needed to interpret its outputs, which include sets of terms and scores. Lately, visualizat...

Bimodal Style Transference from Musical Composition to Image Using Deep Generative Models

Authors: Marcelo Mendoza, María José Apolo
Year: 2023Source: Lecture notes in computer science

Work-in-Progress: Decision Support System for the Process of Student Academic Registration

Authors: Renzo Angles, Luís Silvestre, Fabian Olivares et al.
Year: 2023Source: Lecture notes in networks and systems

Countermeasures for Mitigating Digital Misinformation: A Systematic Review

Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
This Synthesis Report provides a formal systematic review of scientific literature on countermeasures for mitigating digital misinformation. We focus on 588 peer-reviewed publications, drawn from arou...

Platform Responses to Misinformation: A Meta-Analysis of Data

Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
Digital misinformation is a critical issue affecting the global information environment. Countering misinformation and its effects is a major objective of governments, international organizations, con...

How Do Centrality Measures Choose the Root of Trees?

Authors: Cristian Riveros, Oskar Skibski, Jorge Salas
Year: 2023Source: arXiv (Cornell University)
Centrality measures are widely used to assign importance to graph-structured data. Recently, understanding the principles of such measures has attracted a lot of attention. Given that measures are div...

Separating Automatic Relations

Authors: Pablo Barceló, Diego Figueira, Rémi Morvan
Year: 2023Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
We study the separability problem for automatic relations (i.e., relations on finite words definable by synchronous automata) in terms of recognizable relations (i.e., finite unions of products of reg...

Compact Data Structures Meet Databases (Invited Talk)

Authors: Cristian Riveros, Aidan Hogan, Carlos Buil-Aranda et al.
Year: 2023Source: arXiv (Cornell University)
We describe two success stories on the application of compact data structures (cds) to solve the problem of the excessively redundant space requirements posed by worst-case-optimal (wco) algorithms fo...

A Novel First-Order Autoregressive Moving Average Model to Analyze Discrete-Time Series Irregularly Observed

Authors: Susana Eyheramendy, Wilfredo Palma, César Ojeda et al.
Year: 2023Source: Contributions to statistics
A novel first-order autoregressive moving average model for analyzing discrete-time series observed at irregularly spaced times is introduced. Under Gaussianity, it is established that the model is st...

Extending time-series models for irregular observational gaps with a moving average structure for astronomical sequences

Authors: Susana Eyheramendy, Wilfredo Palma, César Ojeda et al.
Year: 2023Source: RAS Techniques and Instruments
ABSTRACT In this study, we introduce a novel moving-average model for analyzing stationary time-series observed irregularly in time. The process is strictly stationary and ergodic under normality and ...

Online Estimation Methods for Irregular Autoregressive Models

Authors: Susana Eyheramendy, Wilfredo Palma, Felipe Elorrieta et al.
Year: 2023Source: Contributions to statistics
In the last decades, due to the huge technological growth observed, it has become increasingly common that a collection of temporal data rapidly accumulates in vast amounts. This provides an opportuni...

Size Bounds and Algorithms for Conjunctive Regular Path Queries

Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2023Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Conjunctive regular path queries (CRPQs) are one of the core classes of queries over graph databases. They are join intensive, inheriting their structure from the relational setting, but they also all...

r-indexing without backward searching

Authors: Gonzalo Navarro, Travis Gagie, Nicola Prezza et al.
Year: 2023Source: arXiv (Cornell University)
Suppose we are given a text $T$ of length $n$ and a straight-line program for $T$ with $g$ rules. Let $\bar{r}$ be the number of runs in the Burrows-Wheeler Transform of the reverse of $T$. We can ind...

Faster Maximal Exact Matches with Lazy LCP Evaluation

Authors: Gonzalo Navarro, Travis Gagie, Adrián Goga et al.
Year: 2023Source: arXiv (Cornell University)
MONI (Rossi et al., {\it JCB} 2022) is a BWT-based compressed index for computing the matching statistics and maximal exact matches (MEMs) of a pattern (usually a DNA read) with respect to a highly re...

Pre-trained language models in Spanish for health insurance coverage

Authors: Jocelyn Dunstan, Víctor Rocco, Claudio Aracena et al.
Year: 2023
The field of clinical natural language processing (NLP) can extract useful information from clinical text. Since 2017, the NLP field has shifted towards using pre-trained language models (PLMs), impro...

Development of pre-trained language models for clinical NLP in Spanish

Authors: Jocelyn Dunstan, Claudio Aracena
Year: 2023
Clinical natural language processing aims to tackle language and prediction tasks using text from medical practice, such as clinical notes, prescriptions, and discharge summaries. Several approaches h...

Automatic Coding at Scale: Design and Deployment of a Nationwide System for Normalizing Referrals in the Chilean Public Healthcare System

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2023
The disease coding task involves assigning a unique identifier from a controlled vocabulary to each disease mentioned in a clinical document. This task is relevant since it allows information extracti...

Globalization & The Challenging Political Economy of Governing (and Researching) Islands in Contemporary Times

Authors: Juan Pablo Luna
Year: 2023Source: Social and ecological interactions in the Galapagos Islands

Attribution-Scores and Causal Counterfactuals as Explanations in Artificial Intelligence

Authors: Leopoldo Bertossi
Year: 2023Source: Lecture notes in computer science

From Database Repairs to Causality in Databases and Beyond

Authors: Leopoldo Bertossi
Year: 2023Source: Lecture notes in computer science

Reasoning Web. Causality, Explanations and Declarative Knowledge

Authors: Leopoldo Bertossi, Guohui Xiao
Year: 2023Source: Lecture notes in computer science

Efficient Computation of Shap Explanation Scores for Neural Network Classifiers via Knowledge Compilation

Authors: Leopoldo Bertossi, Jorge Esquiche León
Year: 2023Source: Lecture notes in computer science

Attribution-Scores in Data Management and Explainable Machine Learning

Authors: Leopoldo Bertossi
Year: 2023Source: Lecture notes in computer science

Publications for 2022

Displaying 210 publication(s) for 2022

Medios de comunicación y confianza política en América Latina: análisis individual y contextual del rol de las noticias en la confianza en el gobierno y el Estado

Authors: Sebastián Valenzuela, Ingrid Bachmann, Daniela Grassau et al.
Year: 2022Source: Revista Internacional de Sociología
¿Cuál es la asociación entre exposición a noticias y confianza política en Latinoamérica? ¿Hay diferencias según la libertad del sistema de medios y los niveles de polarización política? Par...

Practical Random Access to SLP-Compressed Texts

Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2022Source: Lecture notes in computer science
Grammar-based compression is a popular and powerful approach to compressing repetitive texts but until recently its relatively poor time-space trade-offs during real-life construction made it impracti...

On Dynamic Succinct Graph Representations

Authors: Gonzalo Navarro, Guillermo de Bernardo, Susana Ladra et al.
Year: 2022
We address the problem of representing dynamic graphs using k <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> -trees. The k <sup xmlns:mml="http:...

Approximating Optimal Bidirectional Macro Schemes

Authors: Gonzalo Navarro, Luís M. S.Russo, Alexandre P. Francisco et al.
Year: 2022
Lempel-Ziv is an easy-to-compute member of a wide family of so-called macro schemes; it restricts pointers to go in one direction only. Optimal bidirectional macro schemes are NP-complete to find, but...

Two-Dimensional Block Trees

Authors: Gonzalo Navarro, Travis Gagie, Adrián Gómez-Brandón et al.
Year: 2022Source: The Computer Journal
Abstract The Block Tree is a data structure for representing repetitive sequences in compressed space, which reaches space comparable with that of Lempel–Ziv compression while retaining fast direct ...

Learning to cluster urban areas: two competitive approaches and an empirical validation

Authors: Marcelo Mendoza, Sergio Toro, Hans Löbel et al.
Year: 2022Source: EPJ Data Science
Abstract Urban clustering detects geographical units that are internally homogeneous and distinct from their surroundings. It has applications in urban planning, but few studies compare the effectiven...

Exploration Trade-offs in Web Recommender Systems

Authors: Ricardo Baeza-Yates, Giovanni Delnevo, Ricardo Baeza‐Yates
Year: 2022Source: 2021 IEEE International Conference on Big Data (Big Data)
One of the main problems of web recommender systems is exposure bias, due to the fact that the web system itself is partly generating its own future, as users can only click on items shown to them. Th...

A scalable and energy efficient GPU thread map for m-simplex domains

Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: Future Generation Computer Systems

Test datasets for GenoVi: draft and complete genomes

Authors: Carlos Buil-Aranda, Mauricio Araya, Nicolás Jara et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Test dataset for GenoVi (Genome Visualizer).<br> All the genomes available in this repository were used to create all the analysis done by Cumsille et al., 2022 for the publication of GenoVi.

Test datasets for GenoVi: draft and complete genomes

Authors: Carlos Buil-Aranda, Mauricio Araya, Nicolás Jara et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Test dataset for GenoVi (Genome Visualizer).<br> All the genomes available in this repository were used to create all the analysis done by Cumsille et al., 2022 for the publication of GenoVi.

Extending Sticky-Datalog+/- via Finite-Position Selection Functions: Tractability, Algorithms, and Optimization

Authors: Leopoldo Bertossi, Mostafa Milani
Year: 2022Source: Information Systems

Trie-Compressed Intersectable Sets

Authors: Diego Arroyuelo, Juan P. Castillo
Year: 2022Source: arXiv (Cornell University)
We introduce space- and time-efficient algorithms and data structures for the offline set intersection problem. We show that a sorted integer set $S \subseteq [0{..}u)$ of $n$ elements can be represen...

Human vs. Artificial Intelligence

Authors: Ricardo Baeza-Yates, Pablo Villoslada
Year: 2022
In this essay we compare human and artificial intelligence from two points of view: computational and neuroscience. We discuss the differences and limitations of AI with respect to our intelligence, e...

Report on the 12th Temporal Web Analytics Workshop (TempWeb 2022) at WWW 2022

Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2022Source: ACM SIGIR Forum
TempWeb focuses on investigating infrastructures, scalable methods, and innovative software for aggregating, querying, and analyzing heterogeneous data at Web scale. Emphasis is given to data analysis...

Using Automated Planning to Provide Feedback during Collaborative Problem-Solving

Authors: Jorge Baier, Miguél Nussbaum, María Fernanda Rodríguez et al.
Year: 2022Source: International Journal of Artificial Intelligence in Education

Weisfeiler and Leman Go Relational

Authors: Pablo Barceló, Mikhail Galkin, Christopher G. Morris et al.
Year: 2022Source: arXiv (Cornell University)
Knowledge graphs, modeling multi-relational data, improve numerous applications such as question answering or graph logical reasoning. Many graph neural networks for such data emerged recently, often ...

LSQ 2.0: A linked dataset of SPARQL query logs

Authors: Aidan Hogan, Carlos Buil-Aranda, Axel-Cyrille Ngonga Ngomo et al.
Year: 2022Source: Semantic Web
We present the Linked SPARQL Queries (LSQ) dataset, which currently describes 43.95 million executions of 11.56 million unique SPARQL queries extracted from the logs of 27 different endpoints. The LSQ...

Gradual System F

Authors: Elizabeth Labrada, Matías Toro, Eric Tanter et al.
Year: 2022Source: Journal of the ACM
Bringing the benefits of gradual typing to a language with parametric polymorphism like System F, while preserving relational parametricity, has proven extremely challenging: first attempts were formu...

Fiscal Origins of Subnational Democracy: Evidence from Argentina

Authors: Carla Alberti, Diego Díaz Rioseco
Year: 2022Source: Politics & Society
Subnational governments are generally funded by fiscal rents, that is, transfers of centrally levied taxes. Existing literature concurs that fiscal federalism breeds rentierism and, consequently, hind...

Toward a Definitive Compressibility Measure for Repetitive Sequences

Authors: Nicola Prezza, Tomasz Kociumaka, Gonzalo Navarro
Year: 2022Source: IEEE Transactions on Information Theory
While the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$k$ </tex-math></inline-formula> th order empirical entr...

Counting the Answers to a Query

Authors: Cristian Riveros, Marcelo Arenas, Rajesh Jayaram et al.
Year: 2022Source: ACM SIGMOD Record
Counting the answers to a query is a fundamental problem in databases, with several applications in the evaluation, optimization, and visualization of queries. Unfortunately, counting query answers is...

PG-Schema: Schemas for Property Graphs

Authors: Renzo Angles, Domagoj Vrgoč, Juan Sequeda et al.
Year: 2022Source: arXiv (Cornell University)
Property graphs have reached a high level of maturity, witnessed by multiple robust graph database systems as well as the ongoing ISO standardization effort aiming at creating a new standard Graph Que...

Issue Information

Authors: Gonzalo Navarro, Christoph Lange, Han‐Na Kim et al.
Year: 2022Source: European Journal Of Haematology

“What a nasty girl!” incivility and gendered symbolic violence in news discussions

Authors: Magdalena Saldaña, Valentina Proust
Year: 2022Source: Feminist Media Studies
This study examines conversations developed in the virtual public sphere to identify if a user’s gender affects the presence of incivility in news comment sections. By relying on a mixed-method anal...

No Agreement Without Loss: Learning and Social Choice in Peer Review

Authors: Pablo Barceló, Tomasz Steifer, Cristóbal Rojas et al.
Year: 2022Source: arXiv (Cornell University)
In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on th...

La salud en la era digital

Authors: Claudio Gutiérrez, Mercedes López
Year: 2022Source: Revista Médica Clínica Las Condes
¿Qué cambios trae el mundo digital a la forma como abordamos la salud? ¿Cómo están incidiendo las tecnologías digitales en la medicina? Este artículo presenta una panorámica sobre estos temas,...

Aplicaciones de aprendizaje automático en salud

Authors: Jocelyn Dunstan, Fabián Villena, Claudio Aracena et al.
Year: 2022Source: Revista Médica Clínica Las Condes
Resumen: El presente trabajo tiene por objetivo mostrar algunas aplicaciones recientes de aprendizaje automático en el área de la salud. El aprendizaje automático o machine learning es una rama de ...

Procesamiento de lenguaje natural para texto clínico en español: el caso de las listas de espera en Chile

Authors: Jocelyn Dunstan, Pablo Báez, Fredy Núñez Torres et al.
Year: 2022Source: Revista Médica Clínica Las Condes
The waiting lists not covered by the Explicit Health Guarantee Plan for new specialty consultation in Chile increased due to the effects of the SARS-CoV-2 coronavirus (COVID-19) pandemic. This represe...

On the expressiveness of Lara: A proposal for unifying linear and relational algebra

Authors: Pablo Barceló, Nelson Higuera, Jorge Pérez et al.
Year: 2022Source: Theoretical Computer Science

On Computing Probabilistic Explanations for Decision Trees

Authors: Pablo Barceló, Bernardo Subercaseaux, Marcelo Arenas et al.
Year: 2022Source: Conference on Neural Information Processing Systems (NeurIPS 2022)

GPC: A Pattern Calculus for Property Graphs

Authors: Nadime Francis, Victor Marsault, Paolo Guagliardo et al.
Year: 2022Source: arXiv (Cornell University)
The development of practical query languages for graph databases runs well ahead of the underlying theory. The ISO committee in charge of database query languages is currently developing a new standar...

Datasets of Time- and Space-Efficient Regular Path Queries

Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Datasets that were used in the experiments of our work <em>Time- and Space-Efficient Regular Path Queries.</em>

Datasets of Time- and Space-Efficient Regular Path Queries

Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Datasets that were used in the experiments of our work <em>Time- and Space-Efficient Regular Path Queries.</em>

Space/time-efficient RDF stores based on circular suffix sorting

Authors: Gonzalo Navarro, Guillermo de Bernardo, Nieves R. Brisaboa et al.
Year: 2022Source: The Journal of Supercomputing

CLNews: The First Dataset of the Chilean Social Outbreak for Disinformation Analysis

Authors: Marcelo Mendoza, Eliana Providel, Daniel Toro-González et al.
Year: 2022Source: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
Disinformation is one of the main threats that loom on social networks. Detecting disinformation is not trivial and requires training and maintaining fact-checking teams, which is labor-intensive. Rec...

Simple and efficient bi-objective search algorithms via fast dominance checks

Authors: Jorge Baier, Carlos Hernández, Luis Suazo et al.
Year: 2022Source: Artificial Intelligence

Another Violent Protest? New Perspectives to Understand Protest Coverage

Authors: Magdalena Saldaña, Valentina Proust
Year: 2022Source: Media and Communication
This study assesses the relationship between two well-established sets of frames to better understand the news coverage of massive political protests. By relying on Semetko and Valkenburg’s generic ...

Gradual C0: Symbolic Execution for Gradual Verification

Authors: Eric Tanter, Joshua Sunshine, Jonathan Aldrich et al.
Year: 2022Source: arXiv (Cornell University)
Current static verification techniques support a wide range of programs. However, such techniques only support complete and detailed specifications, which places an undue burden on users. To solve thi...

An automatic methodology to measure drivers’ behavior in public transport

Authors: Hans Löbel, Juan Carlos Herrera, Hernan F. Catalan
Year: 2022Source: Journal of Intelligent Transportation Systems
The way in which public transport buses are driven has an influence in users’perception and satisfaction with the service. Bus driver’s behavior is usually obtained surveying passengers and/or usi...

Multi-Agent Path Finding: A New Boolean Encoding

Authors: Roberto Asin Acha, Rodrigo Lopez, Sebastian Hagedorn et al.
Year: 2022Source: Journal of Artificial Intelligence Research
Multi-agent pathfinding (MAPF) is an NP-hard problem. As such, dense maps may be very hard to solve optimally. In such scenarios, compilation-based approaches, via Boolean satisfiability (SAT) and ans...

Constant-delay enumeration for SLP-compressed documents

Authors: Cristian Riveros, Martı́n Muñoz
Year: 2022Source: arXiv (Cornell University)
We study the problem of enumerating results from a query over a compressed document. The model we use for compression are straight-line programs (SLPs), which are defined by a context-free grammar tha...

Answer-Set Programs for Repair Updates and Counterfactual Interventions

Authors: Leopoldo Bertossi
Year: 2022Source: arXiv (Cornell University)
We briefly describe -- mainly through very simple examples -- different kinds of answer-set programs with annotations that have been proposed for specifying: database repairs and consistent query answ...

Explainable neural image recommendation using Network Dissection visual concepts

Authors: Denis Parra, Hans Löbel, Antonio Ossa-Guerra
Year: 2022

Training and intrinsic evaluation of lightweight word embeddings for the clinical domain in Spanish

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Frontiers in Artificial Intelligence
Resources for Natural Language Processing (NLP) are less numerous for languages different from English. In the clinical domain, where these resources are vital for obtaining new knowledge about human ...

A Reasonably Gradual Type Theory

Authors: Eric Tanter, Kenji Maillard, Meven Lennon-Bertrand et al.
Year: 2022Source: Proceedings of the ACM on Programming Languages
Gradualizing the Calculus of Inductive Constructions (CIC) involves dealing with subtle tensions between normalization, graduality, and conservativity with respect to CIC. Recently, GCIC has been prop...

Propositional Equality for Gradual Dependently Typed Programming

Authors: Eric Tanter, Joseph Eremondi, Ronald Garcia et al.
Year: 2022Source: Proceedings of the ACM on Programming Languages-PACMPL
Gradual dependent types can help with the incremental adoption of dependently typed code by providing a principled semantics for imprecise types and proofs, where some parts have been omitted. Current...

Faster compressed quadtrees

Authors: Gonzalo Navarro, Travis Gagie, Guillermo de Bernardo et al.
Year: 2022Source: Journal of Computer and System Sciences

Can political alignment reduce crime? Evidence from Chile

Authors: Carla Alberti, Diego Díaz Rioseco, Giancarlo Visconti
Year: 2022Source: Political Science Research and Methods
Abstract Research has shown that presidents tend to benefit local level copartisans when distributing resources, which can improve the provision of public goods, such as security. Considering that fea...

UMLS Heading Sequences in Spanish

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
UMLS Heading Sequences in Spanish used to compute Word embeddings for the Spanish clinical language

Medical Journals in Spanish

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Medical Journals in Spanish used to compute Word embeddings for the Spanish clinical language

Chilean waiting list corpus

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
The chilean waiting list corpus used to compute Word embeddings for the Spanish clinical language

UMLS Heading Sequences in Spanish

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
UMLS Heading Sequences in Spanish used to compute Word embeddings for the Spanish clinical language

Medical Journals in Spanish

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Medical Journals in Spanish used to compute Word embeddings for the Spanish clinical language

Chilean waiting list corpus

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
The chilean waiting list corpus used to compute Word embeddings for the Spanish clinical language

People are more engaged on Facebook as they get older, especially in politics: evidence from users in 46 countries

Authors: Sergio Toro, Juan Pablo Luna, Gabriel Vommaro et al.
Year: 2022Source: Journal of Quantitative Description Digital Media
A growing body of literature has noted an age pattern in the sharing of false news in social media, with older people sharing more often misinformation than younger users. In this article we supplemen...

Cross-Lingual and Cross-Domain Crisis Classification for Low-Resource Scenarios

Authors: Jorge Pérez, Bárbara Poblete, Hernán Sarmiento et al.
Year: 2022Source: arXiv (Cornell University)
Social media data has emerged as a useful source of timely information about real-world crisis events. One of the main tasks related to the use of social media for disaster management is the automatic...

Dynamic Data Structures for Timed Automata Acceptance

Authors: Michał Pilipczuk, Filip Mazowiecki, Gabriele Puppis et al.
Year: 2022Source: Algorithmica
We study a variant of the classical membership problem in automata theory, which consists of deciding whether a given input word is accepted by a given automaton. We do so through the lenses of parame...

A Reasonably Gradual Type Theory

Authors: Kenji Maillard, Meven Lennon-Bertrand, Nicolas Tabareau et al.
Year: 2022Source: HAL (Le Centre pour la Communication Scientifique Directe)
Gradualizing the Calculus of Inductive Constructions (CIC) involves dealing with subtle tensions between normalization, graduality, and conservativity with respect to CIC. Recently, GCIC has been prop...

Actitudes políticas y solicitudes de ayuda directa a los gobiernos locales en América Latina

Authors: Sergio Toro, Danytza González-Ceballos
Year: 2022Source: AMÉRICA LATINA HOY
Este artículo estudia la relación entre las ayudas directas de los gobiernos y autoridades locales con las actitudes políticas de la ciudadanía. Se analizan datos de la encuesta Barómetro de las ...

Interactive annotation of geometric ornamentation on painted pottery assisted by deep learning

Authors: Ivan Sipiran, Tobias Schreck, Reinhold Preiner et al.
Year: 2022Source: it - Information Technology
Abstract In Greek art, the phase from 900 to 700 BCE is referred to as the Geometric period due to the characteristically simple geometry-like ornamentations appearing on painted pottery surfaces duri...

Grammar Compression by Induced Suffix Sorting

Authors: Gonzalo Navarro, Simon Gog, Maurício Ayala-Rincón et al.
Year: 2022Source: ACM Journal of Experimental Algorithmics
A grammar compression algorithm, called GCIS, is introduced in this work. GCIS is based on the induced suffix sorting algorithm SAIS, presented by Nong et al. in 2009. The proposed solution builds on ...

Biomechanical comparison of a 3D-printed prosthetic foot with conventional feet in people with transtibial amputation: A prospective cohort study

Authors: Aidan Hogan, Ursula Trinler, Mathias Rehg et al.
Year: 2022Source: Prosthetics and Orthotics International
The method of 3D printing is increasingly gaining utilization in clinical applications and may support prosthetic fitting. The aim was to compare biomechanical outcomes of people with a transtibial am...

WIP: Exploring differences in student sense of belonging inside and outside the engineering classroom

Authors: Jorge Baier, Isabel Hilliger, Maria Javiera de los Rios et al.
Year: 2022Source: ASEE Annual Conference and Exposition, Conference Proceedings

Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients

Authors: Gonzalo Navarro, Karen Oróstica, Álvaro Olivera‐Nappa et al.
Year: 2022Source: Journal of Translational Medicine
Abstract Background Recently, extensive cancer genomic studies have revealed mutational and clinical data of large cohorts of cancer patients. For example, the Pan-Lung Cancer 2016 dataset (part of Th...

Semantics and canonicalisation of SPARQL 1.1

Authors: Aidan Hogan, Jaime Salas
Year: 2022Source: Semantic Web

Changing Media Landscapes and Political Participation

Authors: Sebastián Valenzuela, Marcelo Santos
Year: 2022Source: Oxford University Press eBooks
Abstract This chapter discusses how a constantly changing media landscape affects political participation. After pointing out the affordances brought forward by digital media and communication technol...

Navigating planar topologies in near-optimal space and time

Authors: Gonzalo Navarro, José Fuentes‐Sepúlveda, Diego Seco
Year: 2022Source: Computational Geometry

Gradualizing the Calculus of Inductive Constructions

Authors: Eric Tanter, Kenji Maillard, Meven Lennon-Bertrand et al.
Year: 2022Source: ACM Transactions on Programming Languages and Systems
We investigate gradual variations on the Calculus of Inductive Construction (CIC) for swifter prototyping with imprecise types and terms. We observe, with a no-go theorem, a crucial trade-off between ...

Representing Paths in Graph Database Pattern Matching

Authors: Domagoj Vrgoč, Stijn Vansummeren, Wim Martens et al.
Year: 2022Source: arXiv (Cornell University)
Modern graph database query languages such as GQL, SQL/PGQ, and their academic predecessor G-Core promote paths to first-class citizens in the sense that paths that match regular path queries can be r...

Real-Time Heuristic Search with LTLf Goals

Authors: Jorge Baier, Jiame Middleton, Rodrigo Toro
Year: 2022Source: IJCAI International Joint Conference on Artificial Intelligence

The structure of political conflict. The oligarchs and the bourgeoisie in the Chilean Congress, 1834–1894

Authors: Naim Bro
Year: 2022Source: Theory and Society

Focal Discrepancy Search for Learned Heuristics

Authors: Jorge Baier, Matias Greco, Pablo Araneda
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
Machine learning allows learning accurate but inadmissible heuristics for hard combinatorial puzzles like the 15-puzzle, the 24-puzzle, and Rubik's cube. In this paper, we investigate how to exploit t...

Avoiding Errors in Learned Heuristics in Bounded-Suboptimal Search

Authors: Jorge Baier, Matias Greco
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
Despite being very effective, learned heuristics in bounded-suboptimal search can produce heuristic plateaus or move the search to zones of the state space that do not lead to a solution. In addition,...

K-Focal Search for Slow Learned Heuristics (Extended Abstract)

Authors: Jorge Baier, Matias Greco, Jorge Toro et al.
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
Learned heuristics, though inadmissible, can provide very good guidance for bounded-suboptimal search. Given a single search state s and a learned heuristic h, evaluating h(s) is typically very slow r...

Subset Approximation of Pareto Regions with Bi-Objective A* (Extended Abstract)

Authors: Jorge Baier, Nicolás Rivera, Carlos Hernández Ulloa
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
In bi-objective search, we are given a graph in which each directed arc is associated with a pair of non-negative weights, and the objective is to find the Pareto-optimal solution set. Unfortunately, ...

Data from the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation"

Authors: Marcelo Mendoza, Sergio Toro, Hans Löbel et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Data for urban clustering used in the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation". We release two datasets for urban clustering based on data acq...

Data from the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation"

Authors: Marcelo Mendoza, Sergio Toro, Hans Löbel et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Data for urban clustering used in the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation". We release two datasets for urban clustering based on data acq...

Hierarchical Transformers for Group-Aware Sequential Recommendation: Application in MOBA Games

Authors: Denis Parra, Vladimir Araujo, Andrés Villa et al.
Year: 2022
In recent years, several recommendation systems have been introduced to improve the user experience of players in video games. In Multiplayer Online Battle Arena (MOBA) games, a popular game genre, th...

Reflections on a Legacy: Thoughts from Scholars about Agenda-Setting Past and Future

Authors: Sebastián Valenzuela, Maxwell McCombs, Лэй Гуо et al.
Year: 2022Source: Mass Communication & Society
In response to Perloff's (this issue) essay examining the development and future of agenda setting, a series of scholars offer their own reactions to the essay and the broader issues it raises.

Real-Time Heuristic Search with LTLf Goals

Authors: Jorge Baier, Rodrigo Toro Icarte, Jaime Middleton
Year: 2022Source: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence
In Real-Time Heuristic Search (RTHS) we are given a search graph G, a heuristic, and the objective is to find a path from a given start node to a given goal node in G. As such, one does not impose any...

On Computing Probabilistic Explanations for Decision Trees

Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2022Source: arXiv (Cornell University)
Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases ...

Replication Data for: Corruption and Political Knowledge Erosion. A Cautionary Tale from Latin America

Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2022Source: Harvard Dataverse
This study employs data from the two-wave face-to-face panel survey conducted by the authors of this study. The survey employed a probability-based sample representative of all adults (18 years or old...

Subset Approximation of Pareto Regions with Bi-objective A

Authors: Jorge Baier, Nicolás Rivera, Carlos Hernández
Year: 2022Source: Proceedings of the AAAI Conference on Artificial Intelligence
In bi-objective search, we are given a graph in which each directed arc is associated with a pair of non-negative weights, and the objective is to find the Pareto-optimal solution set. Unfortunately, ...

Replication Data for Local Government, Social Media and Management of COVID-19: The Case of Chilean Mayoral Communication

Authors: Sergio Toro, Sebastián Valenzuela, Juan Pablo Luna et al.
Year: 2022Source: Harvard Dataverse
Code and data of the analysis and result of the paper

Bots don’t Vote, but They Surely Bother!

Authors: Ricardo Baeza-Yates, Eduardo Graells-Garrido, Ricardo Baeza‐Yates
Year: 2022
Comunicació presentada a 14th ACM Web Science Conference 2022 (WebSci '22), celebrat del 26 al 29 de juny de 2022 a Barcelona, Espanya.

PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model

Authors: Daryl Hernández, Dary Hernández, Nicolás Jara et al.
Year: 2022Source: Genes
Promoter identification is a fundamental step in understanding bacterial gene regulation mechanisms. However, accurate and fast classification of bacterial promoters continues to be challenging. New m...

Word embeddings for the Spanish clinical language

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Word embeddings for the Spanish clinical language Corpora used to compute the embeddings: Chilean waiting list corpus - https://zenodo.org/record/7072314 Medical Journal in Spanish - https://zenodo.or...

Word embeddings for the Spanish clinical language

Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Word embeddings for the Spanish clinical language Corpora used to compute the embeddings: Chilean waiting list corpus - https://zenodo.org/record/7072314 Medical Journal in Spanish - https://zenodo.or...

Efficient Enumeration for Annotated Grammars

Authors: Cristian Riveros, Martín Muñoz, Antoine Amarilli et al.
Year: 2022
International audience

Graph Pattern Matching in GQL and SQL/PGQ

Authors: Leonid Libkin, Wim Martens, Petra Selmer et al.
Year: 2022Source: Proceedings of the 2022 International Conference on Management of Data
International audience

A Reasonably Gradual Type Theory – Artifact

Authors: Eric Tanter, Meven Lennon-Bertrand, Kenji Maillard et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Accompanying artifact to the article <em>A Reasonably Gradual Type Theory.</em> It consists of two parts: - a Coq formalization of the model described in the article, - a proof of concept using rewrit...

A Reasonably Gradual Type Theory – Artifact

Authors: Eric Tanter, Meven Lennon-Bertrand, Kenji Maillard et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Accompanying artifact to the article <em>A Reasonably Gradual Type Theory.</em> It consists of two parts: - a Coq formalization of the model described in the article, - a proof of concept using rewrit...

Improving matrix-vector multiplication via lossless grammar-compressed matrices

Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: Proceedings of the VLDB Endowment
As nowadays Machine Learning (ML) techniques are generating huge data collections, the problem of how to efficiently engineer their storage and operations is becoming of paramount importance. In this ...

The Next Generation Virgo Cluster Survey. XXXIII. Stellar Population Gradients in the Virgo Cluster Core Globular Cluster System

Authors: Susana Eyheramendy, Andrés Jordán, Laura Ferrarese et al.
Year: 2022Source: The Astrophysical Journal
Abstract We present a study of the stellar populations of globular clusters (GCs) in the Virgo Cluster core with a homogeneous spectroscopic catalog of 692 GCs within a major-axis distance R maj = 840...

Corruption and Political Knowledge Erosion. A Cautionary Tale from Latin America

Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2022Source: International Journal of Public Opinion Research
Abstract Previous research has shown that corruption diminishes citizens’ level of political support and engagement. We extend this line of reasoning and evaluate whether previous levels of perceive...

Identifying and Characterizing New Expressions of Community Framing during Polarization

Authors: Bárbara Poblete, Felipe Bravo-Márquez, Eduardo Graells-Garrido et al.
Year: 2022Source: Proceedings of the International AAAI Conference on Web and Social Media
Chile experienced a series of important protests between October and December 2019. This social unrest, as it was called, was fueled by social inequity and radically affected the nation's status quo. ...

Reevaluando los diseños institucionales: El efecto del presidencialismo sobre la corrupción

Authors: Sergio Toro
Year: 2022Source: Revista Chilena de Derecho y Ciencia Política
La presente nota de investigación analiza la incidencia de las va-riables institucionales sobre el Índice de Percepción de la Corrupción (CPI). Utilizando la metodología de paressobre una base de...

Technical Perspective-No PANE, No Gain: Scaling Attributed Network Embedding in a Single Server

Authors: Aidan Hogan
Year: 2022Source: ACM SIGMOD Record
The machine learning community has traditionally been proactive in developing techniques for diverse types of data, such as text, audio, images, videos, time series, and, of course, matrices, tensors,...

Knowledge graphs

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: ACM Computing Surveys
Tracking the historical events that lead to the interweaving of data and knowledge.

Multilayer graphs

Authors: Aidan Hogan, Renzo Angles, Domagoj Vrgoč et al.
Year: 2022
In this short position paper, we argue that there is a need for a unifying data model that can support popular graph formats such as RDF, RDF* and property graphs, while at the same time being powerfu...

Technical perspective: The compression power of the BWT

Authors: Gonzalo Navarro
Year: 2022Source: Communications of the ACM
No abstract available.

Educational Tools for Mapuzugun

Authors: Claudio Gutiérrez, Antonios Anastasopoulos, Cristian Ahumada
Year: 2022Source: arXiv (Cornell University)
Mapuzugun is the language of the Mapuche people. Due to political and historical reasons, its number of speakers has decreased and the language has been excluded from the educational system in Chile a...

Slicing of Probabilistic Programs based on Specifications

Authors: Marcelo Navarro, Federico Olmedo
Year: 2022Source: Science of Computer Programming

Probabilistic Automata of Bounded Ambiguity

Authors: Cristian Riveros, Nathanaël Fijalkow, James Worrell
Year: 2022Source: HAL (Le Centre pour la Communication Scientifique Directe)
Probabilistic automata are an extension of nondeterministic finite automata in which transitions are annotated with probabilities. Despite its simplicity, this model is very expressive and many of the...

Plausible sealing for gradual parametricity

Authors: Elizabeth Labrada, Matías Toro, Eric Tanter et al.
Year: 2022Source: Proceedings of the ACM on Programming Languages-PACMPL
Graduality and parametricity have proven to be extremely challenging notions to bring together. Intuitively, enforcing parametricity gradually requires possibly sealing values in order to detect viola...

LSCDiscovery: A shared task on semantic change discovery and detection in Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: arXiv (Cornell University)
We present the first shared task on semantic change discovery and detection in Spanish and create the first dataset of Spanish words manually annotated for semantic change using the DURel framework (S...

CORE: a Complex Event Recognition Engine

Authors: Marco Bucchi, Alejandro Grez, Andrés Quintana et al.
Year: 2022Source: VLDB 2022
Complex Event Recognition (CER) systems are a prominent technology for finding user-defined query patterns over large data streams in real time. CER query evaluation is known to be computationally cha...

Towards Effective Blended Learning Through the Eyes of Students: A Survey Study in Transition into Face-to-Face Education

Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2022Source: Lecture notes in computer science

Slicing of Probabilistic Programs based on Specifications

Authors: Federico Olmedo, Marcelo Navarro
Year: 2022Source: arXiv (Cornell University)
This paper presents the first slicing approach for probabilistic programs based on specifications. We show that when probabilistic programs are accompanied by their specifications in the form of pre- ...

Squeeze: Efficient compact fractals for tensor core GPUs

Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: Future Generation Computer Systems

Propositional Equality for Gradual Dependently Typed Programming

Authors: Eric Tanter, Joseph Eremondi, Ronald G. García
Year: 2022Source: arXiv (Cornell University)
Gradual dependent types can help with the incremental adoption of dependently typed code by providing a principled semantics for imprecise types and proofs, where some parts have been omitted. Current...

Time- and Space-Efficient Regular Path Queries

Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: 2022 IEEE 38th International Conference on Data Engineering (ICDE)
We introduce a time- and space-efficient technique to solve regular path queries over labeled (RDF) graphs. We combine a bit-parallel simulation of the Glushkov automaton of the regular expression wit...

Exploration of Knowledge Graphs via Online Aggregation

Authors: Aidan Hogan, Benny Kimelfeld, Oren Kalinsky et al.
Year: 2022Source: 2022 IEEE 38th International Conference on Data Engineering (ICDE)
Exploration systems over large-scale RDF knowl-edge graphs often rely on aggregate count queries to indicate how many results the user can expect for the possible next steps of exploration. Such syste...

Temporal Regular Path Queries

Authors: Marcelo Arenas, Julia Stoyanovich, Pedro Bahamondes et al.
Year: 2022Source: 2022 IEEE 38th International Conference on Data Engineering (ICDE)
In the last decade, substantial progress has been made towards standardizing the syntax of graph query languages, and towards understanding their semantics and complexity of evaluation. In this paper,...

12th Temporal Web Analytics Workshop (TempWeb) Overview

Authors: Ricardo Baeza-Yates, Marc Spaniol, Ómar Alonso
Year: 2022Source: Companion Proceedings of the The Web Conference 2018
TempWeb focuses on investigating infrastructures, scalable methods, and innovative software for aggregating, querying, and analyzing heterogeneous data at Web scale. Emphasis is given to data analysis...

Evaluating regular path queries under the all-shortest paths semantics

Authors: Domagoj Vrgoč
Year: 2022Source: arXiv (Cornell University)
The purpose of this report is to explain how the textbook breadth-first search algorithm (BFS) can be modified in order to also create a compact representation of all shortest paths connecting a singl...

ALBETO and DistilBETO: Lightweight Spanish Language Models

Authors: Felipe Bravo-Márquez, Andrés Carvallo, Vladimir Araujo et al.
Year: 2022Source: arXiv (Cornell University)
In recent years there have been considerable advances in pre-trained language models, where non-English language versions have also been made available. Due to their increasing use, many lightweight v...

Correction to: Graph Compression for Adjacency-Matrix Multiplication

Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: SN Computer Science

Evaluation Benchmarks for Spanish Sentence Representations

Authors: Marcelo Mendoza, Felipe Bravo-Márquez, Álvaro Soto et al.
Year: 2022Source: arXiv (Cornell University)
Due to the success of pre-trained language models, versions of languages other than English have been released in recent years. This fact implies the need for resources to evaluate these models. In th...

Gobernanza Criminal y la Crisis de los Estados Latinoamericanos Contemporáneos

Authors: Juan Pablo Luna, Andreas Feldmann
Year: 2022Source: Annual Review of Sociology
Crecientemente las sociedades latinoamericanas enfrentan el surgimiento de nuevos órdenes en que los funcionarios estatales y las autoridades políticas comparten el poder con organizaciones criminal...

Efficient Construction of the BWT for Repetitive Text Using String Compression

Authors: Gonzalo Navarro, Diego Díaz-Domínguez
Year: 2022Source: arXiv (Cornell University)
We present a new semi-external algorithm that builds the Burrows--Wheeler transform variant of Bauer et al. (a.k.a., BCR BWT) in linear expected time. Our method uses compression techniques to reduce ...

Expressiveness and Approximation Properties of Graph Neural Networks

Authors: Juan Reutter, Floris Geerts, Juan L. Reutter
Year: 2022Source: arXiv (Cornell University)
Characterizing the separation power of graph neural networks (GNNs) provides an understanding of their limitations for graph learning tasks. Results regarding separation power are, however, usually ge...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

Automatic Extraction of Nested Entities in Clinical Referrals in Spanish

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2022Source: ACM Transactions on Computing for Healthcare
Here we describe a new clinical corpus rich in nested entities and a series of neural models to identify them. The corpus comprises de-identified referrals from the waiting list in Chilean public hosp...

Criminal Governance and the Crisis of Contemporary Latin American States

Authors: Juan Pablo Luna, Andreas Feldmann
Year: 2022Source: Annual Review of Sociology
Across Latin America, societies are confronting the rise of novel orders in which state officials and political authorities share power with criminal organizations. Criminal governance (i.e., the crea...

A Novel First-Order Autoregressive Moving Average Model to Analyze Discrete-Time Series Irregularly Observed

Authors: Susana Eyheramendy, Wilfredo Palma, César Ojeda et al.
Year: 2022Source: arXiv (Cornell University)
A novel first-order autoregressive moving average model for analyzing discrete-time series observed at irregularly spaced times is introduced. Under Gaussianity, it is established that the model is st...

Social Media and Belief in Misinformation in Mexico: A Case of Maximal Panic, Minimal Effects?

Authors: Sebastián Valenzuela, Marcelo Santos, Carlos Múñiz
Year: 2022Source: The International Journal of Press/Politics
Contrary to popular narratives, it is not clear whether using social media for news increases belief in political misinformation. Several of the most methodologically sound studies find small to nonex...

Similarity-Based Explanations meet Matrix Factorization via Structure-Preserving Embeddings

Authors: Denis Parra, Leandro Balby Marinho, Rodrygo L. T. Santos et al.
Year: 2022
Embeddings are core components of modern model-based Collaborative Filtering (CF) methods, such as Matrix Factorization (MF) and Deep Learning variations. In essence, embeddings are mappings of the or...

Graph Compression for Adjacency-Matrix Multiplication

Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: SN Computer Science
Abstract Computing the product of the (binary) adjacency matrix of a large graph with a real-valued vector is an important operation that lies at the heart of various graph analysis tasks, such as com...

DWUG ES: Diachronic Word Usage Graphs for Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...

HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances

Authors: Gonzalo Navarro, Dominik Köppl, Nicola Prezza
Year: 2022
We propose a new representation of the offsets of the Lempel-Ziv (LZ) factorization based on the co-lexicographic order of the text's prefixes. The selected offsets tend to approach the k-th order emp...

Answer-Set Programs for Reasoning about Counterfactual Interventions and Responsibility Scores for Classification

Authors: Leopoldo Bertossi, Gabriela Reyes, Gabriela de los Ángeles Díaz Reyes
Year: 2022Source: Inductive Logic Programming (ILP 2021)

Optimal Joins Using Compressed Quadtrees

Authors: Juan Reutter, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: ACM Transactions on Database Systems
Worst-case optimal join algorithms have gained a lot of attention in the database literature. We now count several algorithms that are optimal in the worst case, and many of them have been implemented...

Language Modeling on Location-Based Social Networks

Authors: Bárbara Poblete, Felipe Bravo-Márquez, Juglar Diaz
Year: 2022Source: ISPRS International Journal of Geo-Information
The popularity of mobile devices with GPS capabilities, along with the worldwide adoption of social media, have created a rich source of text data combined with spatio-temporal information. Text data ...

Ethical Challenges in AI

Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2022Source: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
In the first part we address four current specific challenges through examples: (1) discrimination (e.g., facial recognition, justice, sharing economy, language models); (2) stupid models (e.g., lack ...

Natural Language Processing Of Helpline Chat Data Before And During The Pandemic Revealed Significant Decrease In Self-image Appreciation And Changes In Other Traits

Authors: Susana Eyheramendy, Fernanda Barriga, María P. Raveau et al.
Year: 2022Source: Preprints.org
During the last two years the COVID-19 pandemic has affected the world population in several ways. An important increase in mental health problems is a consequence of this pandemic that is ubiquitous ...

A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images

Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2022Source: ACM Computing Surveys
Every year physicians face an increasing demand of image-based diagnosis from patients, a problem that can be addressed with recent artificial intelligence methods. In this context, we survey works in...

For better and for worse: A panel survey of how mobile-only and hybrid Internet use affects digital skills over time

Authors: Sebastián Valenzuela, Teresa Correa, Isabel Pavez
Year: 2022Source: New Media & Society
Public policies across the world are tackling Internet access inequality through mobile connections, which has led to an increase in mobile-only use. However, digital skills remain as a stumbling bloc...

Spanish SciELO Crawled Biomedical Corpus

Authors: Jocelyn Dunstan, Fabián Villena, Carolina Chiu
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We present a corpus of Spanish medical articles extracted from the SciELO website (https://scielo.cl/). The corpus was constructed using web scraping extraction techniques and consists of 5694 article...

Spanish SciELO Crawled Biomedical Corpus

Authors: Jocelyn Dunstan, Fabián Villena, Carolina Chiu
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We present a corpus of Spanish medical articles extracted from the SciELO website (https://scielo.cl/). The corpus was constructed using web scraping extraction techniques and consists of 5694 article...

Semantics and canonicalisation of SPARQL 1.1

Authors: Aidan Hogan, Jaime Salas
Year: 2022Source: Semantic Web
We define a procedure for canonicalising SPARQL 1.1 queries. Specifically, given two input queries that return the same solutions modulo variable names over any RDF graph (which we call congruent quer...

Cultural, scientific and technical antecedents of the Cybersyn project in Chile

Authors: Claudio Gutiérrez, Juan David Ortega-Alvarez
Year: 2022Source: AI & Society

Morbimortality assessment in abdominal surgery: are we predicting or overreacting?

Authors: Marco Vanegas, Laura Niño Torres, Felipe Girón et al.
Year: 2022Source: BMC Surgery
Abstract Background High-risk surgical procedures represent a fundamental part of general surgery practice due to its significant rates of morbidity and mortality. Different predictive tools have been...

A comprehensive review of the video-to-text problem

Authors: Jorge Pérez, Benjamín Bustos, Ivan Sipiran et al.
Year: 2022Source: Artificial Intelligence Review

CLNews19-20: A new dataset for rumor detection in Spanish

Authors: Marcelo Mendoza, Eliana Providel, Daniel Toro-González et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We create CLNews, a dataset for rumor detection in Spanish. Based on fact-checking agencies' data, we mapped related tweets to verify news into four categories: non-rumor, true rumor, false rumor, and...

CLNews19-20: A new dataset for rumor detection in Spanish

Authors: Marcelo Mendoza, Eliana Providel, Daniel Toro-González et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We create CLNews, a dataset for rumor detection in Spanish. Based on fact-checking agencies' data, we mapped related tweets to verify news into four categories: non-rumor, true rumor, false rumor, and...

Efficient and compact representations of some non-canonical prefix-free codes

Authors: Gonzalo Navarro, Travis Gagie, Antonio Fariña et al.
Year: 2022Source: Theoretical Computer Science

Reactive and Asymmetric Communication Flows: Social Media Discourse and Partisan News Framing in the Wake of Mass Shootings

Authors: Sebastián Valenzuela, Dhavan V. Shah, Jon Pevehouse et al.
Year: 2022Source: The International Journal of Press/Politics
Marked by both deep interconnectedness and polarization, the contemporary media system in the United States features news outlets and social media that are bound together, yet deeply divided along par...

Universal coding and prediction on ergodic random points

Authors: Lukasz Debowski, Tomasz Steifer
Year: 2022Source: Bulletin of Symbolic Logic

Efficient Enumeration Algorithms for Annotated Grammars

Authors: Antoine Amarilli, Cristian Riveros, Martı́n Muñoz et al.
Year: 2022Source: arXiv (Cornell University)
We introduce annotated grammars, an extension of context-free grammars which allows annotations on terminals. Our model extends the standard notion of regular spanners, and is more expressive than the...

A Universal Screening Tool for Dyslexia by a Web-Game and Machine Learning

Authors: Ricardo Baeza-Yates, Luz Rello, Maria Rauschenberger et al.
Year: 2022Source: Frontiers in Computer Science
Children with dyslexia have difficulties learning how to read and write. They are often diagnosed after they fail school even if dyslexia is not related to general intelligence. Early screening of dys...

Knowledge-based programs as building blocks for planning

Authors: Jorge Baier, Sheila A. McIlraith
Year: 2022Source: Artificial Intelligence

Score-Based Explanations in Data Management and Machine Learning: An Answer-Set Programming Approach to Counterfactual Analysis

Authors: Leopoldo Bertossi
Year: 2022Source: Reasoning Web. Declarative Artificial Intelligence. Reasoning Web 2021

3 Overview of Talks 3.1 SHAP Explanations with Booleans Circuit Classifiers

Authors: Leopoldo Bertossi
Year: 2022Source: Dagstuhl Reports

Deductive Knowledge

Authors: Sabrina Kirrane, Axel-Cyrille Ngonga Ngomo, Axel Polleres et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
As humans, we can deduce more from the data graph of Figure 2.1 than what the edges explicitly indicate. We may deduce, for example, that the $${\rm{\tilde N}}$$ am festival ((eidis)) will be located ...

Schema, Identity, and Context

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
In this chapter we describe extensions of the data graph–relating to schema, identity, and context–that provide additional structures for accumulating knowledge. Henceforth, we refer to a data gra...

Creation and Enrichment

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
In this chapter, we discuss the principal techniques by which knowledge graphs can be created and subsequently enriched from diverse sources of legacy data that range from plain text to structured for...

Inductive Knowledge

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
While deductive knowledge is characterized by precise logical consequences, inductively acquiring knowledge involves generalizing patterns from a given set of input observations, which can then be use...

Refinement

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
Beyond assessing the quality of a knowledge graph, there exist techniques to refine the knowledge graph, in particular to (semi-)automatically complete and correct the knowledge graph [Paul-heim, 2017...

DockerPedia: A Knowledge Graph of Software Images and Their Metadata

Authors: Carlos Buil-Aranda, Daniel Garijo, Maximiliano Osorio et al.
Year: 2022Source: International Journal of Software Engineering and Knowledge Engineering
An increasing amount of researchers use software images to capture the requirements and code dependencies needed to carry out computational experiments. Software images preserve the computational envi...

Knowledge Graph Compression for Big Semantic Data

Authors: Claudio Gutiérrez, Miguel A. Martínez‐Prieto, Javier D. Fernández et al.
Year: 2022Source: Encyclopedia of Big Data Technologies

Educational Tools for Mapuzugun

Authors: Claudio Gutiérrez, Antonios Anastasopoulos, Cristian Ahumada
Year: 2022
Mapuzugun is the language of the Mapuche people. Due to political and historical reasons, its number of speakers has decreased and the language has been excluded from the educational system in Chile a...

Knowledge Graphs

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
This book provides a comprehensive and accessible introduction to knowledge graphs, which have recently garnered notable attention from both industry and academia. Knowledge graphs are founded on the

WDBench: A Wikidata Graph Query Benchmark

Authors: Aidan Hogan, Carlos Buil-Aranda, Renzo Angles et al.
Year: 2022Source: Lecture notes in computer science

The Semantic Web – ISWC 2022

Authors: Aidan Hogan, Claudia d’Amato, Giuseppe Pirró et al.
Year: 2022Source: Lecture notes in computer science
The ISWC 2022 proceedings details advances in research, technology, and applications of the semantic web, linked data, and knowledge graphs on the web.

truthy_direct_properties.nt.bz2

Authors: Aidan Hogan, Renzo Angles, Domagoj Vrgoč et al.
Year: 2022Source: Figshare
Wikidata truthy direct properties

Quality Assessment

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
Independent of the (kinds of) source(s) from which a knowledge graph is created, the resulting initial knowledge graph will usually be incomplete, and will often contain duplicate, contradictory or ev...

Squeeze: Efficient Compact Fractals for Tensor Core Gpus

Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: SSRN Electronic Journal
This work presents Squeeze, an efficient compact fractal processing scheme for tensor core GPUs. By combining discrete-space transformations between compact and expanded forms, one can do data-paralle...

A Scalable and Energy Efficient GPU Thread Map for m-Simplex Domains

Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: arXiv (Cornell University)
This work proposes a new GPU thread map for $m$-simplex domains, that scales its speedup with dimension and is energy efficient compared to other state of the art approaches. The main contributions of...

Squeeze: Efficient Compact Fractals for Tensor Core GPUs

Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: arXiv (Cornell University)
This work presents Squeeze, an efficient compact fractal processing scheme for tensor core GPUs. By combining discrete-space transformations between compact and expanded forms, one can do data-paralle...

Resources for Multilingual Hate Speech Detection

Authors: Jorge Pérez, Bárbara Poblete, Magdalena Saldaña et al.
Year: 2022
Most of the published approaches and resources for hate speech detection are tailored for the English language. In consequence, cross-lingual and cross-cultural perspectives lack some essential resour...

Knowledge Graphs in Practice

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
In this chapter, we discuss some of the most prominent knowledge graphs that have emerged in the past years. We begin by discussing open knowledge graphs, most of which have been published on the Web ...

Conclusions

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
We have provided a comprehensive introduction to knowledge graphs, which have been receiving more and more attention in recent years. Under the definition of a knowledge graph as a graph ofdata intend...

Data Graphs

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
At the foundation of any knowledge graph is the principle of first applying a graph abstraction to data, resulting in an initial data graph. We now discuss a selection of graph-structured data models ...

Introduction

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
Though the phrase "knowledge graph" has been used in the literature since at least 1972 [Schneider, 1973], the modern incarnation of the phrase stems from the 2012 announcement of the Google Knowledge...

Publication

Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
While it may not always be desirable to publish knowledge graphs (for example, those that offer a competitive advantage to a company [Noy et al., 2019]), it maybe desirable or even required to publish...

LSCDiscovery: A shared task on semantic change discovery and detection in Spanish

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022
We present the first shared task on semantic change discovery and detection in Spanish and create the first dataset of Spanish words manually annotated for semantic change using the DURel framework Th...

Balancing Run-Length Straight-Line Programs*

Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2022Source: arXiv (Cornell University)
It was recently proved that any SLP generating a given string $w$ can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We show that this result also holds for...

Computing MEMs and Relatives on Repetitive Text Collections

Authors: Gonzalo Navarro
Year: 2022Source: arXiv (Cornell University)
We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern $P[1 .. m]$ on a large repetitive text collection $T[1 .. n]$, which is represented as a (hopefully much smalle...

Near-Optimal Search Time in $δ$-Optimal Space, and Vice Versa

Authors: Tomasz Kociumaka, Francisco Javier Vidal Olivares, Gonzalo Navarro
Year: 2022Source: arXiv (Cornell University)
Two recent lower bounds on the compressibility of repetitive sequences, $\delta \le \gamma$, have received much attention. It has been shown that a length-$n$ string $S$ over an alphabet of size $\sig...

A Critical Analysis Of Nlp and Clinical Correctness Metrics to Measure Progress on X-Ray Report Generation

Authors: Denis Parra, Jocelyn Dunstan, Cecilia Besa et al.
Year: 2022Source: SSRN Electronic Journal
Background: Radiologists face an increasing demand for image-based diagnosis from patients every year,and computer-aided diagnosis systems seem like a promising way to alleviate their workload. Many a...

Near-Optimal Search Time in $$\delta $$-Optimal Space

Authors: Gonzalo Navarro, Tomasz Kociumaka, Francisco Javier Vidal Olivares
Year: 2022Source: Lecture notes in computer science

Balancing Run-Length Straight-Line Programs

Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2022Source: Lecture notes in computer science

Replication Data for: Corruption and Political Knowledge Erosion. A Cautionary Tale from Latin America

Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2022Source: Harvard Dataverse
This study employs a two-wave face-to-face panel survey data conducted by the authors of this study in Santiago, Chile. The survey employed a probability-based sample and is representative of all adul...

An Empirical Evaluation of k-Means Coresets

Authors: Gonzalo Navarro, Eva Rotenberg, Grzegorz Herman et al.
Year: 2022Source: Research Portal Denmark
Coresets are among the most popular paradigms for summarizing data. In particular, there exist many high performance coresets for clustering problems such as k-means in both theory and practice. Curio...

A Local Search Algorithm for Large Maximum Weight Independent Set Problems

Authors: Gonzalo Navarro, Nikos Parotsidis, Yuanyuan Dong et al.
Year: 2022Source: Research Portal Denmark
Motivated by a real-world vehicle routing application, we consider the maximum-weight independent set problem: Given a node-weighted graph, find a set of independent (mutually nonadjacent) nodes whose...

Streaming Enumeration on Nested Documents

Authors: Cristian Riveros, Martı́n Muñoz
Year: 2022Source: arXiv (Cornell University)
Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especi...

DockerPedia: A Knowledge Graph of Software Images and Their Metadata

Authors: Maximiliano Osorio, Carlos Buil-Aranda, Idafen Santana-Perez et al.
Year: 2022Source: INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING

Empirical Evaluation of Machine Learning Ensembles for Rumor Detection

Authors: Eliana Providel, Andrés Zapata, Marcelo Mendoza
Year: 2022Source: Lecture notes in computer science

An Algebra for Path Manipulation in Graph Databases

Authors: Renzo Angles, Roberto García
Year: 2022Source: Lecture notes in computer science

Lenguajes y modelos subyacentes a los grafos de conocimiento

Authors: Renzo Angles
Year: 2022Source: Actas del Congreso Internacional de Ingeniería de Sistemas
Un grafo de conocimiento es una gran base de datos que integra información desde distintas fuentes de datos, con el objetivo de poder extraer conocimiento y transformarlo en valor para los usuarios. ...

Bots don't Vote, but They Surely Bother! A Study of Anomalous Accounts in a National Referendum

Authors: Ricardo Baeza-Yates, Eduardo Graells-Garrido
Year: 2022Source: arXiv (Cornell University)
The Web contains several social media platforms for discussion, exchange of ideas, and content publishing. These platforms are used by people, but also by distributed agents known as bots. Although bo...

Space-efficient conversions from SLPs

Authors: Gonzalo Navarro, Travis Gagie
Year: 2022Source: arXiv (Cornell University)
We give algorithms that, given a straight-line program (SLP) with $g$ rules that generates (only) a text $T [1..n]$, builds within $O(g)$ space the Lempel-Ziv (LZ) parse of $T$ (of $z$ phrases) in tim...

L-systems for Measuring Repetitiveness*

Authors: Gonzalo Navarro, C. Urbina
Year: 2022Source: arXiv (Cornell University)
An L-system (for lossless compression) is a CPD0L-system extended with two parameters $d$ and $n$, which determines unambiguously a string $w = \tau(\varphi^d(s))[1:n]$, where $\varphi$ is the morphis...

Compact data structures to represent spatial hierarchical structures

Authors: Gonzalo Navarro, Diego Seco, M. Andrea Rodríguez et al.
Year: 2022Source: Figshare
Implementation of three compact data structures to represent spatial hierarchical structures, with applications on the topological model.<br>Datasets are also included

Compact data structures to represent spatial hierarchical structures

Authors: Gonzalo Navarro, Diego Seco, M. Andrea Rodríguez et al.
Year: 2022Source: Figshare
Implementation of three compact data structures to represent spatial hierarchical structures, with applications on the topological model.<br>For replication of the experiments, please, check the file ...

Compact data structures to represent spatial hierarchical structures

Authors: Gonzalo Navarro, Diego Seco, M. Andrea Rodríguez et al.
Year: 2022Source: Figshare
Implementation of three compact data structures to represent spatial hierarchical structures, with applications on the topological model.<br>For replication of the experiments, please, check the file ...

Clinical Flair: A Pre-Trained Language Model for Spanish Clinical Natural Language Processing

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas
Year: 2022
Word embeddings have been widely used in Natural Language Processing (NLP) tasks. Although these representations can capture the semantic information of words, they cannot learn the sequence-level sem...

Divide and Conquer: An Extreme Multi-Label Classification Approach for Coding Diseases and Procedures in Spanish

Authors: Jocelyn Dunstan, Andrés Abeliuk, Matías Rojas et al.
Year: 2022
Clinical coding is the task of transforming medical documents into structured codes following a standard ontology. Since these terminologies are composed of hundreds of codes, this problem can be cons...

A Knowledge-Graph-Based Intrinsic Test for Benchmarking Medical Concept Embeddings and Pretrained Language Models

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2022
Using language models created from large data sources has improved the performance of several deep learning-based architectures, obtaining state-of-the-art results in several NLP extrinsic tasks. Howe...

Assessing the Limits of Straightforward Models for Nested Named Entity Recognition in Spanish Clinical Narratives

Authors: Jocelyn Dunstan, Matías Rojas, Aitor Gonzalez‐Agirre et al.
Year: 2022
Nested Named Entity Recognition (NER) is an information extraction task that aims to identify entities that may be nested within other entity mentions. Despite the availability of several corpora with...

The Impact of United States Engagement with Chile: 2000-2020

Authors: Juan Pablo Luna, Bruna Fonseca de Barros
Year: 2022Source: SSRN Electronic Journal
This study examines the diversity, scale, and impacts of efforts undertaken by the U.S. government and civil society to boost prosperity in Chile. It provides quantitative assessments of resource flow...

Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices

Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: arXiv (Cornell University)
As nowadays Machine Learning (ML) techniques are generating huge data collections, the problem of how to efficiently engineer their storage and operations is becoming of paramount importance. In this ...

Scaling up ML-based Black-box Planning with Partial STRIPS Models

Authors: Jorge Baier, Matias Greco, Hector H. Palacios et al.
Year: 2022Source: arXiv (Cornell University)
A popular approach for sequential decision-making is to perform simulator-based search guided with Machine Learning (ML) methods like policy learning. On the other hand, model-relaxation heuristics ca...

Amplifying Counter-Public Spheres on Social Media: News Sharing of Alternative Versus Traditional Media After the 2019 Chilean Uprising

Authors: Sergio Toro, Sebastián Valenzuela, Juan Pablo Luna
Year: 2022Source: Social Media + Society
While much research exists on the role of digital media use in protest movements, few studies compare the long-term impact of protests on online use of alternative and mainstream digital media. This h...

Graph Path Navigation

Authors: Pablo Barceló, Marcelo Arenas, Leonid Libkin
Year: 2022Source: Encyclopedia of Big Data Technologies

Error loading publications: cURL error 28: Operation timed out after 30001 milliseconds with 0 bytes received

Publications for 2020

Displaying 324 publication(s) for 2020

Optimal-Time Dictionary-Compressed Indexes

Authors: Gonzalo Navarro, Nicola Prezza, Mikko Berggren Ettienne et al.
Year: 2020Source: ACM Transactions on Algorithms
We describe the first self-indexes able to count and locate pattern occurrences in optimal time within a space bounded by the size of the most popular dictionary compressors. To achieve this result, w...

Grammar-compressed indexes with logarithmic search time

Authors: Gonzalo Navarro, Francisco Claude, Alejandro Pacheco
Year: 2020Source: Journal of Computer and System Sciences

Cambio, despedida y bienvenida

Authors: Sebastián Valenzuela, Daniela Grassau
Year: 2020

Inspecting state of the art performance and NLP metrics in image-based medical report generation

Authors: Denis Parra, Cecilia Besa, Pablo Pino et al.
Year: 2020
Several deep learning architectures have been proposed over the last years to deal with the task of generating a written report given an imaging exam as input. Most works evaluate the generated report...

Neural language models for text classification in evidence-based medicine

Authors: Denis Parra, Andrés Carvallo, Juan Vasquez et al.
Year: 2020
COVID-19 has brought about a significant challenge to the whole of humanity, but mainly to the medical community. Clinicians must keep updated continuously about symptoms, diagnoses, and effectiveness...

The Future is Big Graphs! A Community View on Graph Processing Systems

Authors: Marcelo Arenas, Renzo Angles
Year: 2020

The Future is Big Graphs! A Community View on Graph Processing Systems

Authors: Petra Selmer, Semih Salihoğlu, Katja Hose et al.
Year: 2020Source: Repository for Publications and Research Data (ETH Zurich)
Graphs are by nature unifying abstractions that can leverage interconnectedness to represent, explore, predict, and explain real- and digital-world phenomena. Although real users and consumers of grap...

The Future is Big Graphs! A Community View on Graph Processing Systems

Authors: Marcelo Arenas, Renzo Angles, Juan Sequeda et al.
Year: 2020Source: Repository for Publications and Research Data (ETH Zurich)
Graphs are by nature unifying abstractions that can leverage interconnectedness to represent, explore, predict, and explain real- and digital-world phenomena. Although real users and consumers of grap...

Adaptive Community Search in Dynamic Networks

Authors: Ricardo Baeza-Yates, Francesco Bonchi, Ioanna Tsalouchidou
Year: 2020Source: 2021 IEEE International Conference on Big Data (Big Data)
Community search is a well-studied problem which, given a static graph and a query set of vertices, requires to find a cohesive (or dense) subgraph containing the query vertices. In this paper we stud...

The Expressive Power of Graph Neural Networks as a Query Language

Authors: Pablo Barceló, Jorge Pérez, Juan Reutter et al.
Year: 2020Source: ACM SIGMOD Record
In this paper we survey our recent results characterizing various graph neural network (GNN) architectures in terms of their ability to classify nodes over graphs, for classifiers based on unary logic...

On the Approximation Ratio of Ordered Parsings

Authors: Gonzalo Navarro, Nicola Prezza, Carlos Ochoa
Year: 2020Source: IEEE Transactions on Information Theory
Shannon's entropy is a clear lower bound for statistical compression. The situation is not so well understood for dictionary-based compression. A plausible lower bound is <inline-formula xmlns:mml="ht...

Learning to combine classifiers outputs with the transformer for text classification

Authors: Marcelo Mendoza, Margarita Bugueño
Year: 2020

Learning to combine classifiers outputs with the transformer for text classification

Authors: Marcelo Mendoza, Margarita Bugueño
Year: 2020Source: Intelligent Data Analysis
Text classification is a fairly explored task that has allowed dealing with a considerable amount of problems. However, one of its main difficulties is to conduct a learning process in data with class...

Predicting risk of dyslexia with an online gamified test

Authors: Ricardo Baeza-Yates, Luz Rello, Jeffrey P. Bigham et al.
Year: 2020Source: PLoS ONE
Dyslexia is a specific learning disorder related to school failure. Detection is both crucial and challenging, especially in languages with transparent orthographies, such as Spanish. To make detectin...

Algorithmic and HCI Aspects for Explaining Recommendations of Artistic Images

Authors: Pablo Messina, Vicente Domínguez, Denis Parra et al.
Year: 2020Source: ACM Transactions on Interactive Intelligent Systems
Explaining suggestions made by recommendation systems is key to make users trust and accept these systems. This is specially critical in areas such as art image recommendation. Traditionally, artworks...

Neural language models for text classification in evidence-based medicine

Authors: Denis Parra, Gabriel Rada, Andrés Carvallo et al.
Year: 2020

Neural language models for text classification in evidence-based medicine

Authors: Denis Parra, Andrés Carvallo, Gabriel Rada et al.
Year: 2020Source: arXiv (Cornell University)
The COVID-19 has brought about a significant challenge to the whole of humanity, but with a special burden upon the medical community. Clinicians must keep updated continuously about symptoms, diagnos...

Automatic document screening of medical literature using word and text embeddings in an active learning setting

Authors: Denis Parra, Álvaro Soto, Hans Löbel et al.
Year: 2020Source: Scientometrics

Fine-Grained Entity Linking

Authors: Aidan Hogan, Bárbara Poblete, Henry Rosales-Méndez
Year: 2020Source: Journal of Web Semantics
The Entity Linking (EL) task involves linking mentions of entities in a text with their identifier in a Knowledge Base (KB) such as Wikipedia, BabelNet, DBpedia, Freebase, Wikidata, YAGO, etc. Numerou...

How to Handle Health-Related Small Imbalanced Data in Machine Learning?

Authors: Ricardo Baeza-Yates, Maria Rauschenberger, Ricardo Baeza‐Yates
Year: 2020Source: i-com
Abstract When discussing interpretable machine learning results, researchers need to compare them and check for reliability, especially for health-related data. The reason is the negative impact of wr...

Special Issue on Database Theory

Authors: Pablo Barceló, Marco Calautti
Year: 2020Source: Theory of Computing Systems

The Chilean Waiting List Corpus

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
In this work we describe the Waiting List Corpus consisting of de-identified referrals for several specialty consultations from the waiting list in Chilean public hospitals. A subset of 3000 referrals...

The Chilean Waiting List Corpus

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Here we describe a new clinical corpus rich in nested entities and a series of neural models to identify them. The corpus comprises de-identified referrals from the waiting list in Chilean public hosp...

The Chilean Waiting List Corpus

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Here we describe a new clinical corpus rich in nested entities and a series of neural models to identify them. The corpus comprises de-identified referrals from the waiting list in Chilean public hosp...

The Chilean Waiting List Corpus

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Here we describe a new clinical corpus rich in nested entities and a series of neural models to identify them. The corpus comprises de-identified referrals from the waiting list in Chilean public hosp...

The Chilean Waiting List Corpus

Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Here we describe a new clinical corpus rich in nested entities and a series of neural models to identify them. The corpus comprises de-identified referrals from the waiting list in Chilean public hosp...

Characterization of Anorexia Nervosa on Social Media: Textual, Visual, Relational, Behavioral, and Demographical Analysis (Preprint)

Authors: Ricardo Baeza-Yates, Nadia Sanz Lamora, Diego Alejandro Velazquez et al.
Year: 2020
<sec> <title>BACKGROUND</title> Eating disorders are psychological conditions characterized by unhealthy eating habits. Anorexia nervosa (AN) is defined as the belief of being overweight despite being...

Gradualizing the Calculus of Inductive Constructions

Authors: Nicolas Tabareau, Meven Lennon-Bertrand, Éric Tanter et al.
Year: 2020Source: ACM Transactions on Programming Languages and Systems
We investigate gradual variations on the Calculus of Inductive Construction (CIC) for swifter prototyping with imprecise types and terms. We observe, with a no-go theorem, a crucial trade-off between ...

Probabilistic automata of bounded ambiguity

Authors: Cristian Riveros, Nathanaël Fijalkow, James Worrell
Year: 2020Source: Information and Computation

Setting the agenda: The news media and public opinion, 3rd edition

Authors: Sebastián Valenzuela, Maxwell McCombs
Year: 2020

Supporting the Classification of Patients in Public Hospitals in Chile by Designing, Deploying and Validating a System Based on Natural Language Processing

Authors: Jocelyn Dunstan, Fabián Villena, René Lagos et al.
Year: 2020Source: Research Square (Research Square)
<title>Abstract</title> BackgroundIn Chile, a patient needing a specialty consultation or surgery has to first be referred by a general practitioner, then placed on a waiting list. The Explicit Health...

Block trees

Authors: Gonzalo Navarro, Travis Gagie, Djamal Belazzougui et al.
Year: 2020Source: Journal of Computer and System Sciences

The ALeRCE Light Curve Classifier: labeled set, features, and classifications

Authors: L. Sabatini-Gacitúa, A. Moya, E. Castillo-Navarrete et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Labeled set, features, and classifications of the ZTF alert stream (up to 2020/06/09) presented in the article "Alert Classification for the ALeRCE Broker System: The Light Curve Classifier", Sánchez...

Inspecting state of the art performance and NLP metrics in image-based medical report generation

Authors: Denis Parra, Pablo Messina, Cecilia Besa et al.
Year: 2020Source: arXiv (Cornell University)
Several deep learning architectures have been proposed over the last years to deal with the problem of generating a written report given an imaging exam as input. Most works evaluate the generated rep...

Inspecting state of the art performance and NLP metrics in image-based medical report generation

Authors: Pablo Messina, Denis Parra, Pablo Pino et al.
Year: 2020

First-Order Rewritability of Frontier-Guarded Ontology-Mediated Queries

Authors: Pablo Barceló, Andréas Pieris, Gérald Berger et al.
Year: 2020Source: arXiv (Cornell University)
We focus on ontology-mediated queries (OMQs) based on (frontier-)guarded existential rules and (unions of) conjunctive queries, and we investigate the problem of FO-rewritability, i.e., whether an OMQ...

The ALeRCE Light Curve Classifier: labeled set, features, and classifications

Authors: Susana Eyheramendy, M. Catelan, J. Borissova et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Labeled set, features, and classifications of the ZTF alert stream (up to 2020/06/09) presented in the article "Alert Classification for the ALeRCE Broker System: The Light Curve Classifier", Sánchez...

New initialization for algorithms to solve Median String Problem

Authors: Diego Seco, Pedro Mirabal, José Abreu et al.
Year: 2020

A Hybrid Compressed Data Structure Supporting Rank and Select on Bit Sequences

Authors: Diego Arroyuelo, Manuel Weitzman
Year: 2020
We introduce a practical data structure for supporting the fundamental operations rank, select, and member on integer sets (and their corresponding characteristic bit vector). Our data structure uses ...

FAIS: A System for Effectively Learning Students Names and Faces in Massive Courses

Authors: Jorge Baier, Jorge Muñoz-Gama, Raul Alvarez-Esteban et al.
Year: 2020
Low classroom engagement and distractions are important challenges of massive courses. The literature shows that those problems could decrease when the teacher addresses the students by their names. L...

SiGeCo: Customizable peer assessment management system for teaching-learning process

Authors: Sebastián Valenzuela, Luís Silvestre
Year: 2020
Teamwork activities can create difficulties for teachers to evaluate the performance of the members. To address this challenge a peer evaluation, also known as peer assessment, is usually applied. In ...

Declarative Approaches to Counterfactual Explanations for Classification

Authors: Leopoldo Bertossi
Year: 2020

Declarative Approaches to Counterfactual Explanations for Classification

Authors: Leopoldo Bertossi
Year: 2020Source: arXiv (Cornell University)
We propose answer-set programs that specify and compute counterfactual interventions on entities that are input on a classification model. In relation to the outcome of the model, the resulting counte...

I Don’t Want You to Be My President! Incivility and Media Bias During the Presidential Election in Chile

Authors: Magdalena Saldaña, Andrés Rosenberg
Year: 2020Source: Social Media + Society
This study observes two relevant issues in today’s media ecosystem: incivility in online news comments and media bias during election periods. By analyzing 84 stories and 4670 comments published dur...

Gradual verification of recursive heap data structures

Authors: Eric Tanter, Jenna Wise, Johannes Bader et al.
Year: 2020Source: Proceedings of the ACM on Programming Languages
Current static verification techniques do not provide good support for incrementality, making it difficult for developers to focus on specifying and verifying the properties and components that are mo...

Querying the Semantic Web via Rules

Authors: Marcelo Arenas, Georg Gottlob, Andréas Pieris
Year: 2020Source: Studies on the semantic web
The problem of querying RDF data is a central issue for the development of the Semantic Web. The query language SPARQL has become the standard language for querying RDF since its W3C standardization i...

Semantic Optimization of Conjunctive Queries

Authors: Pablo Barceló, Andreas Pieris, Diego Figueira et al.
Year: 2020Source: Journal of the ACM
This work deals with the problem of semantic optimization of the central class of conjunctive queries (CQs). Since CQ evaluation is NP-complete, a long line of research has focussed on identifying fra...

Genome-wide association study identifies eight loci associated with blood pressure

Authors: Susana Eyheramendy, John C. Chambers, Christopher Newton‐Cheh et al.
Year: 2020Source: Carolina Digital Repository (University of North Carolina at Chapel Hill)
Christopher Newton-Cheh and colleagues report a genome-wide association study for blood pressure traits as part of the Global BPgen consortium. They report eight loci with replicated association to sy...

Specifying and computing causes for query answers in databases via database repairs and repair-programs

Authors: Leopoldo Bertossi
Year: 2020Source: Knowledge and Information Systems

An index for moving objects with constant-time access to their compressed trajectories

Authors: Nieves Brisaboa, Travis Gagie, Gonzalo Navarro et al.
Year: 2020Source: International Journal of Geographical Information Science
As the number of vehicles and devices equipped with GPS technology has grown explosively, an urgent need has arisen for time- and space-efficient data structures to represent their trajectories. The m...

In-Database Graph Analytics with Recursive SPARQL

Authors: Aidan Hogan, Juan Reutter, Adrián Soto et al.
Year: 2020Source: Lecture notes in computer science
Works on knowledge graphs and graph-based data management often focus either on graph query languages or on frameworks for graph analytics, where there has been little work in trying to combine both a...

Minding the AI Gap in LATAM

Authors: Jorge Pérez, Bárbara Poblete
Year: 2020Source: Communications of the ACM
Comision Nacional de Investigacion Cientifica y Tecnologica (CONICYT) \n \nCONICYT FONDECYT \n \n1191604 \n1200967

Three Success Stories About Compact Data Structures

Authors: Diego Seco, Diego Arroyuelo, José Fuentes-Sepúlveda et al.
Year: 2020Source: Communications of the ACM
No abstract available.

Gobernanza en ciudades portuarias. Aprendizajes desde el Área Metropolitana de Concepción

Authors: Sergio Toro-Maureira, Mabel Alarcón, Violeta Montero et al.
Year: 2020

Extending SPARQL with Similarity Joins

Authors: Aidan Hogan, Sebastián Ferrada, Benjamín Bustos
Year: 2020Source: Lecture notes in computer science

GENE: Graph generation conditioned on named entities for polarity and controversy detection in social media

Authors: Denis Parra, Marcelo Mendoza, Álvaro Soto
Year: 2020Source: Information Processing & Management

Chile's new interdisciplinary institute for foundational research on data

Authors: Pablo Barceló, Marcelo Arenas
Year: 2020Source: Communications of the ACM
research-article Share on Chile's new interdisciplinary institute for foundational research on data Authors: Marcelo Arenas Universidad Católica and IMFD in Santiago, Chile Universidad Católica and ...

Differential Privacy and SPARQL

Authors: Federico Olmedo, Carlos Buil-Aranda, Jorge Lobo
Year: 2020

Querying APIs with SPARQL: Language and Worst-Case Optimal Algorithm

Authors: Juan Reutter, Domagoj Vrgoč, Matthieu Mosser et al.
Year: 2020Source: Lecture notes in computer science

Suggesting Citations for Wikidata Claims basedon Wikipedia’s External References

Authors: Aidan Hogan, Pablo Curotto
Year: 2020

Versioned Queries over RDF Archives: All You Need is SPARQL?

Authors: Aidan Hogan, Ignacio Cuevas
Year: 2020

Global Vertex Similarity for Large-Scale Knowledge Graphs

Authors: Aidan Hogan, Marco Caballero
Year: 2020

Model Interpretability through the Lens of Computational Complexity

Authors: Pablo Barceló, Jorge Pérez
Year: 2020

Model Interpretability through the Lens of Computational Complexity

Authors: Pablo Barceló, Mikaël Monet, Bernardo Subercaseaux et al.
Year: 2020Source: arXiv (Cornell University)
In spite of several claims stating that some models are more interpretable than others -- e.g., "linear models are more interpretable than deep neural networks" -- we still lack a principled notion of...

Welcome

Authors: Gonzalo Navarro, Virgı́lio Almeida, Sergio Rajsbaum
Year: 2020Source: Communications of the ACM
introduction Share on Welcome Authors: Virgilio Almeida Harvard University, Cambridge, MA Harvard University, Cambridge, MAView Profile , Gonzalo Navarro University of Chile, in Santiago University of...

Contextual Linear Types for Differential Privacy

Authors: Matías Toro, Eric Tanter, Federico Olmedo et al.
Year: 2020

Offering an Entrepreneurship Course to All Engineering Students: Self-efficacy Gains and Learning Benefits

Authors: Jorge Baier, Isabel Hilliger, Mar Pérez‐Sanagustín et al.
Year: 2020Source: 2021 IEEE Frontiers in Education Conference (FIE)
In order to develop an entrepreneurial mindset in future engineers, entrepreneurial training has become a key aspect of engineering education. Following this trend, a large and prestigious engineering...

Contextual Linear Types for Differential Privacy

Authors: Matías Toro, Eric Tanter, Federico Olmedo et al.
Year: 2020Source: arXiv (Cornell University)
Language support for differentially-private programming is both crucial and delicate. While elaborate program logics can be very expressive, type-system based approaches using linear types tend to be ...

A Survey on Deep Learning and Explainability for Automatic Image-based Medical Report Generation

Authors: Pablo Messina, Denis Parra, Álvaro Soto et al.
Year: 2020

A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images

Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2020Source: arXiv (Cornell University)
Every year physicians face an increasing demand of image-based diagnosis from patients, a problem that can be addressed with recent artificial intelligence methods. In this context, we survey works in...

Recursion in SPARQL

Authors: Juan Reutter, Domagoj Vrgoč, Adrián Soto et al.
Year: 2020Source: Semantic Web
The need for recursive queries in the Semantic Web setting is becoming more and more apparent with the emergence of datasets where different pieces of information are connected by complicated patterns...

Laconic Image Classification: Human vs. Machine Performance

Authors: Javier Carrasco, Aidan Hogan, Jorge Pérez
Year: 2020
We propose laconic classification as a novel way to understand and compare the performance of diverse image classifiers. The goal in this setting is to minimise the amount of information (aka. entropy...

Knowledge Graphs: A Tutorial on the History of Knowledge Graph's Main Ideas

Authors: Claudio Gutiérrez, Juan Sequeda
Year: 2020
Knowledge Graphs can be considered as fulfilling an early vision in Computer Science of creating intelligent systems that integrate knowledge and data at large scale. Stemming from scientific advancem...

Knowledge Graphs: Research Directions

Authors: Aidan Hogan
Year: 2020Source: Lecture notes in computer science
In these lecture notes, we provide an overview of some of the high-level research directions and open questions relating to knowledge graphs. We discuss six high-level concepts relating to knowledge g...

Bots in Social and Interaction Networks: Detection and Impact Estimation

Authors: Marcelo Mendoza, Maurizio Tesconi, Stefano Cresci
Year: 2020Source: ACM transactions on office information systems
The rise of bots and their influence on social networks is a hot topic that has aroused the interest of many researchers. Despite the efforts to detect social bots, it is still difficult to distinguis...

Ranked Enumeration of MSO Logic on Words

Authors: Cristian Riveros, Alejandro Grez, Pierre Bourhis et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
In the last years, enumeration algorithms with bounded delay have attracted a lot of attention for several data management tasks. Given a query and the data, the task is to preprocess the data and the...

Ranked enumeration of MSO logic on words

Authors: Cristian Riveros, Alejandro Grez, Pierre Bourhis et al.
Year: 2020Source: arXiv (Cornell University)
In the last years, enumeration algorithms with bounded delay have attracted a lot of attention for several data management tasks. Given a query and the data, the task is to preprocess the data and the...

Gradual Verification of Recursive Heap Data Structures

Authors: Eric Tanter, Joshua Sunshine, Cameron Wong et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
<pre>Current static verification techniques do not provide good support for incrementality, making it difficult for developers to focus on specifying and verifying the properties and components that a...

AMW 2019 Special Issue

Authors: Aidan Hogan, Tova Milo
Year: 2020Source: Information Systems

Gradual Verification of Recursive Heap Data Structures

Authors: Eric Tanter, Joshua Sunshine, Cameron Wong et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
<pre>Current static verification techniques do not provide good support for incrementality, making it difficult for developers to focus on specifying and verifying the properties and components that a...

Analyzing the effect of the topology on succinct tree encodings

Authors: Diego Seco, José Fuentes Sepúlveda, Alexander Irribarra
Year: 2020

Streaming Enumeration on Nested Documents

Authors: Martı́n Muñoz, Cristian Riveros
Year: 2020Source: arXiv (Cornell University)
Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especi...

Constant-delay enumeration algorithms for document spanners over nested documents

Authors: Cristian Riveros, Martín Muñoz
Year: 2020

Streaming enumeration on nested documents

Authors: Cristian Riveros, Martı́n Muñoz
Year: 2020Source: arXiv (Cornell University)
Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especi...

Peripheral elaboration model: The impact of incidental news exposure on political participation

Authors: Magdalena Saldaña, Saif Shahin, Homero Gil de Zúñiga
Year: 2020Source: Journal of Information Technology & Politics
This study places the “cognitive elaboration model” on news gathering and political behavior within the dual-processing “elaboration likelihood model” to derive hypotheses about the effects of...

Predecessor Search

Authors: Gonzalo Navarro, Javiel Rojas-Ledesma
Year: 2020Source: ACM Computing Surveys
The predecessor problem is a key component of the fundamental sorting-and-searching core of algorithmic problems. While binary search is the optimal solution in the comparison model, more realistic ma...

A MOOC-based flipped experience: Scaffolding SRL strategies improves learners' time management and engagement

Authors: Jorge Baier, Isabel Hilliger, Mar Pérez‐Sanagustín et al.
Year: 2020Source: Computer Applications in Engineering Education
Abstract Higher education institutions are increasingly considering the use of a form of blended learning, commonly named as flipped classroom (FC), in which students watch video lectures drawn from a...

Scalable recommendation of wikipedia articles to editors using representation learning

Authors: Denis Parra, Oleksii Moskalenko, Diego Saez-Trumper
Year: 2020

CuratorNet: Visually-aware recommendation of art images

Authors: Felipe del Rio, Pablo Messina, Denis Parra et al.
Year: 2020

Scalable Recommendation of Wikipedia Articles to Editors Using Representation Learning

Authors: Denis Parra, Diego Sáez-Trumper, Oleksii Moskalenko
Year: 2020Source: arXiv (Cornell University)
Wikipedia is edited by volunteer editors around the world. Considering the large amount of existing content (e.g. over 5M articles in English Wikipedia), deciding what to edit next can be difficult, b...

Improving query expansion strategies with word embeddings

Authors: Marcelo Mendoza, Alfredo Silva
Year: 2020
Representation learning has been a fruitful area in recent years, driven by the growing interest in deep learning methods. In particular, word representation learning, a.k.a. word embeddings has trigg...

Score-Based Explanations in Data Management and Machine Learning: An Answer-Set Programming Approach to Counterfactual Analysis

Authors: Leopoldo Bertossi
Year: 2020Source: Lecture notes in computer science

Interpretable Contextual Team-aware Item Recommendation: Application in Multiplayer Online Battle Arena Games

Authors: Denis Parra, Vladimir Araujo, Andrés Villa et al.
Year: 2020
The video game industry has adopted recommendation systems to boost users interest with a focus on game sales. Other exciting applications within video games are those that help the player make decisi...

Storage, Indexing, Query Processing, and Benchmarking in Centralized and Distributed RDF Engines: A Survey

Authors: Aidan Hogan, Axel-Cyrille Ngonga Ngomo, Bin Yao et al.
Year: 2020Source: arXiv (Cornell University)
The recent advancements of the Semantic Web and Linked Data have changed the working of the traditional web. There is significant adoption of the Resource Description Framework (RDF) format for saving...

Space/time-efficient RDF stores based on circular suffix sorting

Authors: Gonzalo Navarro, Guillermo de Bernardo, Nieves R. Brisaboa et al.
Year: 2020Source: arXiv (Cornell University)
In recent years, RDF has gained popularity as a format for the standardized publication and exchange of information in the Web of Data. In this paper we introduce RDFCSA, a data structure that is able...

Bias in Search and Recommender Systems

Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2020
We explore the vicious cycle of bias on the Web related to search and recommender systems. The first bias is activity bias [1], called by Nielsen participation inequality in Internet [3]. This means t...

Compact structure for sparse undirected graphs based on a clique graph partition

Authors: Susana Ladra, Felipe Glaria, Gonzalo Navarro et al.
Year: 2020Source: Information Sciences

Author Correction: Reclassifying neurodegenerative diseases

Authors: Ricardo Baeza-Yates, Pablo Villoslada, Joseph C. Masdeu
Year: 2020Source: Nature Biomedical Engineering

"DCC-Uchile at SemEval-2020 Task 1: Temporal Referencing Word Embeddings"

Authors: Felipe Bravo-Márquez, Frank Zamora
Year: 2020

Tree Path Majority Data Structures

Authors: Travis Gagie, Meng He, Gonzalo Navarro et al.
Year: 2020Source: Theoretical Computer Science

3D Shape Matching for Retrieval and Recognition

Authors: Benjamín Bustos, Ivan Sipiran
Year: 2020Source: Springer eBooks

Work in Progress: Engaging Engineering Teaching Staff in Continuous Improvement Process

Authors: Jorge Baier, Isabel Hilliger, Mar Pérez‐Sanagustín et al.
Year: 2020
Abstract External influences to schools of engineering have resulted in important curriculum changes. Over the two decades, the influence of the Accreditation Board of Engineering and Technology (ABET...

CuratorNet: Visually-aware Recommendation of Art Images

Authors: Denis Parra, Pablo Messina, Manuel Cartagena et al.
Year: 2020Source: arXiv (Cornell University)
Although there are several visually-aware recommendation models in domains like fashion or even movies, the art domain lacks thesame level of research attention, despite the recent growth of the onlin...

Resource Description Framework

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

SPARQL Query Language

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

Shape Constraints and Expressions

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

RDF Schema and Semantics

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

Web Ontology Language

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

The Web of Data

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

Linked Data

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

What is Care in Engineering Teaching?

Authors: Jorge Baier, Isabel Hilliger, Constanza Melian et al.
Year: 2020Source: 2020 ASEE Virtual Annual Conference Content Access Proceedings
His research focuses on areas of automated reasoning in Artificial Intelligence; specifically, automated planning, search and knowledge representation. Currently his research focuses on understanding ...

Work in Progress: What Makes Courses Demanding in Engineering Education? A Combination of Mixed Methods and Grounded Theory Research

Authors: Jorge Baier, Isabel Hilliger, Constanza Melian et al.
Year: 2020Source: 2020 ASEE Virtual Annual Conference Content Access Proceedings
Abstract Due to external influences, such as internationalization and technological changes, engineering curricula have incorporated an increasing number of contents and competencies. Future engineers...

Efficient Logspace Classes for Enumeration, Counting, and Uniform Generation

Authors: Cristian Riveros, Marcelo Arenas, Luis Alberto Croquevielle et al.
Year: 2020Source: ACM SIGMOD Record
We study two simple yet general complexity classes, which provide a unifying framework for efficient query evaluation in areas like graph databases and information extraction, among others. We investi...

Explaining VQA predictions using visual grounding and a knowledge base

Authors: Álvaro Soto, Juan Carlos Niebles, Yundong Zhang et al.
Year: 2020

3D-printed foot prosthesis during gait: Comparison with two standard prostheses

Authors: Aidan Hogan, Ursula Trinler, Mathias Rehg et al.
Year: 2020Source: Gait & Posture

Storage, Indexing, Query Processing, and Benchmarking in Centralized and Distributed RDF Engines: A Survey

Authors: Aidan Hogan, Axel-Cyrille Ngonga Ngomo, Waqas Ali et al.
Year: 2020Source: Preprints.org
The recent advancements of the Semantic Web and Linked Data have changed the working of the traditional web. There is significant adoption of the Resource Description Framework (RDF) format for saving...

External References of English Wikipedia (ref-wiki-en)

Authors: Aidan Hogan, Paolo Curotto
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
<strong>External References of English Wikipedia </strong>(<strong>ref-wiki-en</strong>) is a corpus of the plain-text content of 2,475,461 external webpages linked from the reference section of artic...

External References of English Wikipedia (ref-wiki-en)

Authors: Aidan Hogan, Paolo Curotto
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
<strong>External References of English Wikipedia </strong>(<strong>ref-wiki-en</strong>) is a corpus of the plain-text content of 2,475,461 external webpages linked from the reference section of artic...

An ASP-Based Approach to Counterfactual Explanations for Classification

Authors: Leopoldo Bertossi
Year: 2020Source: Lecture notes in computer science

Reclassifying neurodegenerative diseases

Authors: Ricardo Baeza-Yates, Pablo Villoslada, Joseph C. Masdeu et al.
Year: 2020Source: Nature Biomedical Engineering

SHREC 2020: Retrieval of digital surfaces with similar geometric reliefs

Authors: Claudio Tortorici, Naoufel Werghi, Ahmad Shaker Obeid et al.
Year: 2020Source: Computers & Graphics
This paper presents the methods that have participated in the SHREC’20 contest on retrieval of surface patches with similar geometric reliefs and the analysis of their performance over the benchmark...

Fast and Compact Planar Embeddings

Authors: Travis Gagie, Meng He, Gonzalo Navarro et al.
Year: 2020Source: Computational Geometry
There are many representations of planar graphs, but few are as elegant as Turan's (1984): it is simple and practical, uses only 4 bits per edge, can handle self-loops and multiedges, and can store an...

A Multi-resolution Approximation for Time Series

Authors: Benjamín Bustos, Heider Sanchez
Year: 2020Source: Neural Processing Letters

Postadmixture Selection on Chileans Targets Haplotype Involved in Pigmentation, Thermogenesis and Immune Defense against Pathogens

Authors: Lucas Vicuña, Susana Eyheramendy, Tomás Norambuena et al.
Year: 2020Source: Genome Biology and Evolution
Abstract Detection of positive selection signatures in populations around the world is helping to uncover recent human evolutionary history as well as the genetic basis of diseases. Most human evoluti...

Arabic dialect sentiment analysis with ZERO effort. Case study: Algerian dialect

Authors: Marcelo Mendoza, Imane Guellil, Faical Azoauau et al.
Year: 2020Source: INTELIGENCIA ARTIFICIAL
This paper presents an analytic study showing that it is entirely possible to analyze the sentiment of an Arabic dialect without constructing any resources. The idea of this work is to use the resourc...

The Tractability of SHAP-Score-Based Explanations over Deterministic and Decomposable Boolean Circuits

Authors: Pablo Barceló, Marcelo Arenas, Mikaël Monet
Year: 2020Source: arXiv (Cornell University)
Scores based on Shapley values are widely used for providing explanations to\nclassification results over machine learning models. A prime example of this is\nthe influential SHAP-score, a version of ...

Nuevos desafíos, enfoques y perspectivas para estudiar élites políticas

Authors: Sergio Toro, Sergio Toro-Maureira, Alejandro Olivares et al.
Year: 2020Source: DOAJ (DOAJ: Directory of Open Access Journals)
Esta seccion analiza los nuevos desafios, enfoques analiticos y metodologias para el estudio de diferentes elites politicas. En base a la literatura existente y la revision de casos de America Latina,...

Correcting for differential recruitment in respondent-driven sampling data using ego-network information

Authors: Isabelle Beaudry, Krista J. Gile
Year: 2020

R3MAT: A Rapid and Robust Graph Generator

Authors: Renzo Angles, Roberto García, Rodrigo Paredes
Year: 2020

Personalization, Bias and Privacy

Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2020
Personalization can be seen as a positive bias towards each user. However, it also has negative consequences such as privacy loss as well as the filter bubble or echo chamber effect due to the feedbac...

GENE: Graph generation conditioned on named entities

Authors: Denis Parra, Marcelo Mendoza, Álvaro Soto
Year: 2020
This dataset consiste on a collection of news and their comments labeled according to the level of controversy that the comments produced in an online newspaper. The dataset comprises 143340 news with...

Using Deep Learning to Detect Rumors in Twitter

Authors: Marcelo Mendoza, Eliana Providel
Year: 2020Source: Lecture notes in computer science

Efficient GPU thread mapping on embedded 2D fractals

Authors: Raimundo Vega, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2020Source: Future Generation Computer Systems

“Fake News is Anything They Say!”–Conceptualization and Weaponization of Fake News Among the American Public

Authors: Sebastián Valenzuela, Hernando Rojas, Chau Tong et al.
Year: 2020Source: Mass Communication & Society
This study examines the articulation of public opinion about so-called fake news using a national survey (N = 510) of U.S. adults conducted in 2018. We coded respondents' open-ended answers about what...

A User Interface for Exploring and Querying Knowledge Graphs (Extended Abstract)

Authors: Aidan Hogan, Carlos Buil-Aranda, Hernán Vargas et al.
Year: 2020
As the adoption of knowledge graphs grows, more and more non-experts users need to be able to explore and query such graphs. These users are not typically familiar with graph query languages such as S...

Solving a Special Case of the Intensional vs Extensional Conjecture in Probabilistic Databases

Authors: Mikael Monet
Year: 2020

Adversarial Evaluation of BERT for Biomedical Named Entity Recognition

Authors: Denis Parra, Vladimir Araujo, Andrés Carvallo
Year: 2020

Long-term constitutive equations (aging factor approach)

Authors: Gonzalo Navarro, Borja Regúlez, David Fernández‐Ordóñez et al.
Year: 2020Source: Bulletin - FIB

Phenomenological study of restraint moments over the piers

Authors: Gonzalo Navarro, David Fernández‐Ordóñez, Pieter van der Zee et al.
Year: 2020Source: Bulletin - FIB

Estimation of the continuity forces

Authors: Gonzalo Navarro, Borja Regúlez, David Fernández‐Ordóñez et al.
Year: 2020Source: Bulletin - FIB

Introduction

Authors: Gonzalo Navarro, David Fernández‐Ordóñez, Pieter van der Zee et al.
Year: 2020Source: Bulletin - FIB

Bibliography

Authors: Gonzalo Navarro, Hugo Corres Peiretti, Maher K. Tadros et al.
Year: 2020Source: Bulletin - FIB

Parametric study

Authors: Gonzalo Navarro, David Fernández‐Ordóñez, Pieter van der Zee et al.
Year: 2020Source: Bulletin - FIB

References

Authors: Gonzalo Navarro, Hugo Corres Peiretti, Maher K. Tadros et al.
Year: 2020Source: Bulletin - FIB

Notation

Authors: Gonzalo Navarro, David Fernández‐Ordóñez, Pieter van der Zee et al.
Year: 2020Source: Bulletin - FIB

Scope

Authors: Gonzalo Navarro, David Fernández‐Ordóñez, Pieter van der Zee et al.
Year: 2020Source: Bulletin - FIB

An efficient algorithm for approximated self-similarity joins in metric spaces.

Authors: Sebastián Ferrada, Benjamín Bustos, Nora Reyes et al.
Year: 2020Source: Information Systems

The Chilean Waiting List Corpus

Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Referrals from the waiting list for several specialty consultations in Chilean public hospitals were used to create a de-identified clinical corpus. A subset of 900 referrals was manually annotated wi...

Chilean Waiting List Corpus Embeddings

Authors: Jocelyn Dunstan, Fabián Villena
Year: 2020Source: Figshare
The Chilean Waiting List Corpus Embeddings is a Word2Vec word embedding trained over 11 million unstructured free text diagnostics obtained from the Chilean Waiting List through Transparency Law. The ...

WEFE: The Word Embeddings Fairness Evaluation Framework

Authors: Jorge Pérez, Felipe Bravo-Márquez, Pablo Valdés-Badilla
Year: 2020
Word embeddings are known to exhibit stereotypical biases towards gender, race, religion, among other criteria. Severa fairness metrics have been proposed in order to automatically quantify these bias...

Hate speech detection is not as easy as you may think: A closer look at model validation (extended version)

Authors: Jorge Pérez, Bárbara Poblete, Aymé Arango
Year: 2020Source: Information Systems

Crisis de la representación política en América Latina y los ciclos pendulares de coaliciones electorales oligárquicas y antisistema

Authors: Juan Pablo Luna
Year: 2020

Every Colour You Are: Stance Prediction and Turnaround in Controversial Issues

Authors: Ricardo Baeza-Yates, Eduardo Graells-Garrido, Mounia Lalmas
Year: 2020
Web platforms have allowed political manifestation and debate for decades. Technology changes have brought new opportunities for expression, and the availability of longitudinal data of these debates ...

Ties, Likes, and Tweets: Using Strong and Weak Ties to Explain Differences in Protest Participation Across Facebook and Twitter Use

Authors: Sebastián Valenzuela, Homero Gil de Zúñiga, Teresa Correa
Year: 2020Source: Routledge eBooks
Based on the theoretical concepts of social networks and technology affordances, this article argues that different social media platforms influence political participation through unique, yet complem...

Stronger and Safer Together: Motivations for and Challenges of (Trans)National Collaboration in Investigative Reporting in Latin America

Authors: Magdalena Saldaña, Lourdes M. Cueva Chacón
Year: 2020Source: Digital Journalism
Despite the growing scholarship on investigative journalism in Latin America, very few studies have addressed collaboration across newsrooms in the region. By analyzing the responses of 251 journalist...

Causality-based Explanation of Classification Outcomes

Authors: Leopoldo Bertossi, Jordan Li, Maximilian Schleich et al.
Year: 2020
We propose a simple definition of an explanation for the outcome of a classifier based on concepts from causality. We compare it with previously proposed notions of explanation, and study their comple...

The Limits of Efficiency for Open- and Closed-World Query Evaluation under Guarded TGDs

Authors: Pablo Barceló, Carsten Lutz, Victor Dalmau et al.
Year: 2020
Ontology-mediated querying and querying in the presence of constraints are two key database problems where tuple-generating dependencies (TGDs) play a central role. In ontology-mediated querying, TGDs...

Differentiable adaptive computation time for visual reasoning

Authors: Álvaro Soto, Cristóbal Eyzaguirre
Year: 2020

Abstracting gradual references

Authors: Matías Toro, Eric Tanter, Éric Tanter
Year: 2020Source: Science of Computer Programming

A Simple and Fast Bi-Objective Search Algorithm

Authors: Jorge Baier, Carlos Hernández-Ulloa, William Yeoh et al.
Year: 2020

Uruguay 2019: Party system restructuring and the end of the progressive cycle

Authors: Rafael Piñeiro Rodríguez, Fernando Rosenblatt, Lihuen Nocetto
Year: 2020

A More General Theory of Static Approximations for Conjunctive Queries

Authors: Pablo Barceló, Miguel Romero, Thomas Zeume
Year: 2020Source: Theory of Computing Systems

A Simple and Fast Bi-Objective Search Algorithm

Authors: Jorge Baier, Sven Koenig, William Yeoh et al.
Year: 2020Source: Proceedings of the International Conference on Automated Planning and Scheduling
Many interesting search problems can be formulated as bi-objective search problems, that is, search problems where two kinds of costs have to be minimized, for example, travel distance and time for tr...

Counting Problems over Incomplete Databases

Authors: Pablo Barceló, Marcelo Arenas, Mikael Monet et al.
Year: 2020
We study the complexity of various fundamental counting problems that arise in the context of incomplete databases, i.e., relational databases that can contain unknown values in the form of labeled nu...

WEFE: The Word Embeddings Fairness Evaluation Framework.

Authors: Jorge Pérez, Felipe Bravo-Márquez, Rajesh Jayaram
Year: 2020

Evaluating the 2014 Sugar-Sweetened Beverage Tax in Chile: Observational Evidence from Urban Areas

Authors: Jocelyn Dunstan, Andrew J. Mirelman, Ryota Nakamura et al.
Year: 2020Source: World Scientific series in global healthcare economics and public policy
An already large and growing number of countries — both rich and poor — are facing an enormous challenge to curb rising rates of obesity and diet-related ill-health, much of which affects lower so...

Lempel–Ziv-Like Parsing in Small Space

Authors: Gonzalo Navarro, Simon J. Puglisi, Daniel Valenzuela et al.
Year: 2020Source: Algorithmica

cBiK: A Space-Efficient Data Structure for Spatial Keyword Queries

Authors: Diego Seco, Miguel A. Martínez-Prieto, Carlos E. Sanjuan-Contreras et al.
Year: 2020

Semantic Search of Memes on Twitter

Authors: Benjamín Bustos, Magdalena Saldaña, Jesús Pérez-Martín
Year: 2020

Multipath Adaptive A*: Factors That Influence Performance in Goal-Directed Navigation in Unknown Terrain

Authors: Roberto Asin Acha, Jorge Baier, Carlos Hernández-Ulloa
Year: 2020

Los gobiernos socialdemócratas en Chile

Authors: Sergio Toro-Maureira, Ana Farías Antognini
Year: 2020

Political parties, diminished subtypes, and democracy

Authors: Rafael Piñeiro Rodríguez, Fernando Rosenblatt, Juan Pablo Luna et al.
Year: 2020Source: Party Politics
There is a resurgence of interest in political parties. This resurgent interest embraces a minimalist definition of political parties, according to which any group that competes in elections and recei...

PGO: Describing Property Graphs in RDF

Authors: Renzo Angles, Harsh Thakkar, Dominik Tomaszuk
Year: 2020

Kevin LaGrandeur, James J. Hughes (eds) (2017) Surviving the Machine Age. Intelligent Technology and the Transformation of Human Work. Cham: Palgrave Macmillan. 166 pages. ISBN: 978-3-319-84584-5

Authors: Claudio Gutiérrez
Year: 2020Source: Science & Technology Studies

Mapping RDF Databases to Property Graph Databases

Authors: Renzo Angles, Harsh Thakkar, Dominik Tomaszuk
Year: 2020

Data Quality and Explainable AI

Authors: Leopoldo Bertossi, Floris Geerts
Year: 2020Source: Journal of Data and Information Quality
In this work, we provide some insights and develop some ideas, with few technical details, about the role of explanations in Data Quality in the context of data-based machine learning models (ML). In ...

Detection of Suicidal Ideation on Social Media: Multimodal, Relational, and Behavioral Analysis

Authors: Ricardo Baeza-Yates, Diego Alejandro Velazquez, Josep M. Gonfaus et al.
Year: 2020Source: Journal of Medical Internet Research
Background Suicide risk assessment usually involves an interaction between doctors and patients. However, a significant number of people with mental disorders receive no treatment for their condition ...

Computing coverage kernels under restricted settings

Authors: Javiel Rojas-Ledesma, Jérémy Barbay, Pablo Pérez-Lantero
Year: 2020

Studying incidental news: Antecedents, dynamics and implications

Authors: Sebastián Valenzuela, Neta Kligler-Vilenchik, Alfred Hermida et al.
Year: 2020Source: Journalism
In light of concerns about decreasing news use, a decline in interest in political news or even active avoidance or resistance of news in general, the idea of ‘incidental news’ has been seen as a ...

A Survey on Frameworks Used for Robustness Analysis on Interdependent Networks

Authors: Benjamín Bustos, Ivana Bachmann, Javier Bustos-Jimenez et al.
Year: 2020Source: Complexity
The analysis of network robustness tackles the problem of studying how a complex network behaves under adverse scenarios, such as failures or attacks. In particular, the analysis of interdependent net...

On Adversarial Examples for Biomedical NLP Tasks

Authors: Denis Parra, Vladimir Araujo, Carlos Aspillaga et al.
Year: 2020

On Adversarial Examples for Biomedical NLP Tasks

Authors: Denis Parra, Andrés Carvallo, Vladimir Araujo et al.
Year: 2020Source: arXiv (Cornell University)
The success of pre-trained word embeddings has motivated its use in tasks in the biomedical domain. The BERT language model has shown remarkable results on standard performance metrics in tasks such a...

An Interoperable Repository of Clinical Data

Authors: Marcelo Mendoza, Mauricio Solar, Mauricio Araya-López et al.
Year: 2020

Stable Model Semantics for Recursive SHACL

Authors: Julien Corman, Ognjen Savković, Magdalena Ortiz et al.
Year: 2020
SHACL (SHape Constraint Language) is a W3C recommendation for validating graph-based data against a set of constraints (called shapes). Importantly, SHACL allows to define recursive shapes, i.e. a sha...

Screening risk of dyslexia through a web-game using language-independent content and machine learning

Authors: Ricardo Baeza-Yates, Luz Rello, Maria Rauschenberger et al.
Year: 2020
Children with dyslexia are often diagnosed after they fail school even if dyslexia is not related to general intelligence. In this work, we present an approach for universal screening of dyslexia usin...

Representativeness of Abortion Legislation Debate on Twitter: A Case Study in Argentina and Chile

Authors: Ricardo Baeza-Yates, Eduardo Graells-Garrido, Mounia Lalmas et al.
Year: 2020Source: Companion Proceedings of the The Web Conference 2018
The role of the Web in political exchange has been crucial for society. Its platforms have connected people and allowed manifestation, organization, and access to information; however, they have also ...

Bias on the web and beyond

Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2020
The Web is the most powerful communication medium and the largest public data repository that humankind has created. Its content ranges from great reference sources such as Wikipedia to ugly fake news...

Biases on Social Media Data

Authors: Ricardo Baeza-Yates
Year: 2020Source: Companion Proceedings of the The Web Conference 2018
Comunicació presentada al WWW'20: International World Wide Web Conference, celebrat del 20 al 24 d'abril de 2020 a Taipei, Taiwan.

Trace-Relating Compiler Correctness and Secure Compilation

Authors: Eric Tanter, Carmine Abate, Roberto Blanco et al.
Year: 2020Source: Lecture notes in computer science
Abstract Compiler correctness is, in its simplest form, defined as the inclusion of the set of traces of the compiled program into the set of traces of the original program, which is equivalent to the...

DISEÑO COLABORATIVO DE UNA PROPUESTA PARA ABORDAR LA NOCIÓN DE FUNCIÓN QUE COORDINA GRÁFICOS CARTESIANOS CON MODELOS GEOMÉTRICOS DINÁMICOS

Authors: Juan Pablo Luna, Marina Andrés, Enrique Di Rico et al.
Year: 2020Source: Revista de Educación Matemática
En este artículo presentamos una secuencia de actividades -con su análisis- para abordar, en el aula de secundaria, la noción de función como herramienta modelizadora. La propuesta se centra en la...

Hybrid Hashtags: #YouKnowYoureAKiwiWhen Your Tweet Contains Māori and English

Authors: Felipe Bravo-Márquez, Te Taka Keegan, David Trye et al.
Year: 2020Source: Frontiers in Artificial Intelligence
Twitter constitutes a rich resource for investigating language contact phenomena. In this paper, we report findings from the analysis of a large-scale diachronic corpus of over one million tweets, con...

A Lightweight and Extensible AspectJ Implementation

Authors: Eric Tanter, Rodolfo Toledo, Éric Tanter
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
Abstract: Extending AspectJ to experiment with new language features can be cumbersome, even with an extensible implementation. Often, a language designer only needs a rapid prototyping environment, b...

Controlling Aspect Reentrancy

Authors: Eric Tanter, Éric Tanter
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
The coexpression of cytokeratin and vimentin intermediate filaments has been immunohistochemically evaluated in 124 benign and malignant sweat gland tumors of various types in comparison to normal swe...

Ranked Document Selection

Authors: Gonzalo Navarro, Sharma V. Thankachan, J. Ian Munro et al.
Year: 2020Source: Theoretical Computer Science

Indexing Highly Repetitive String Collections

Authors: Gonzalo Navarro
Year: 2020Source: arXiv (Cornell University)
Two decades ago, a breakthrough in indexing string collections made it possible to represent them within their compressed space while at the same time offering indexed search functionalities. As this ...

A trustworthy mechanized formalization of R

Authors: Eric Tanter, Tomás Díaz, Martin Bodin et al.
Year: 2020Source: ACM SIGPLAN Notices
The R programming language is very popular for developing statistical software and data analysis, thanks to rich libraries, concise and expressive syntax, and support for interactive programming. Yet,...

Solving Sum-of-Costs Multi-Agent Pathfinding with Answer-Set Programming

Authors: Jorge Baier, Rodrigo Gómez, Carlos Hernández
Year: 2020Source: Proceedings of the AAAI Conference on Artificial Intelligence
Solving a Multi-Agent Pathfinding (MAPF) problem involves finding non-conflicting paths that lead a number of agents to their goal location. In the sum-of-costs variant of MAPF, one is also required t...

Recursive SPARQL for Graph Analytics

Authors: Aidan Hogan, Juan Reutter, Adrián Soto
Year: 2020

Recursive SPARQL for Graph Analytics

Authors: Aidan Hogan, Juan Reutter, Adrián Soto et al.
Year: 2020Source: arXiv (Cornell University)
Work on knowledge graphs and graph-based data management often focus either on declarative graph query languages or on frameworks for graph analytics, where there has been little work in trying to com...

A sketch-aided retrieval approach for incomplete 3D objects

Authors: Benjamín Bustos, Tobias Schreck, Stefan Lengauer et al.
Year: 2020Source: Computers & Graphics
With the growing amount of digital collections of visual CH data being available across different repositories, it becomes increasingly important to provide archaeologists with means to find relations...

Towards Large-scale RoI Indexing for Content-aware Data Discovery

Authors: Marcelo Mendoza, Muez Araya, Rafaela Cáceres et al.
Year: 2020

Social QA in non-CQA platforms

Authors: Denis Parra, Bárbara Poblete, José Miguel Herrera et al.
Year: 2020Source: Future Generation Computer Systems

An Interoperable Repository of Clinical Data

Authors: Marcelo Mendoza, Mauricio Araya, Mauricio Solar et al.
Year: 2020
This article shows an innovation project that aims contributing, from the ICT perspective, to necessities of health sector, specifically in interoperability and generation of information starting from...

Is Data Privacy The Price We Must Pay to Survive a Pandemic?

Authors: Ricardo Baeza-Yates, Cristina Pombo, Natalia González Alarcón et al.
Year: 2020Source: Inter-American Development Bank eBooks

THE STRUCTURE OF POLITICAL CONFLICT: KINSHIP NETWORKS AND POLITICAL ALIGNMENTS IN THE CIVIL WARS OF NINETEENTH-CENTURY CHILE

Authors: Naim Bro
Year: 2020Source: Apollo (University of Cambridge)
Based on a novel database of kinship relations among the political elites of Chile in the nineteenth century, this thesis identifies the impact of family networks on the formation of political faction...

The shapley value of tuples in query answering

Authors: Leopoldo Bertossi, Benny Kimelfeld, Ester Livshits et al.
Year: 2020Source: Logical Methods in Computer Science
We investigate the application of the Shapley value to quantifying the contribution of a tuple to a query answer. The Shapley value is a widely known numerical measure in cooperative game theory and i...

On the expressiveness of Lara: A unified language for linear and relational algebra

Authors: Pablo Barceló, Nelson Higuera, Jorge Pérez et al.
Year: 2020

Learning to Detect Online Harassment on Twitter with the Transformer

Authors: Marcelo Mendoza, Margarita Bugueño
Year: 2020Source: Communications in computer and information science

The Little Prover

Authors: Eric Tanter
Year: 2020

ProNA2020 predicts protein-DNA, protein-RNA, and protein-protein binding proteins and residues from sequence

Authors: Tomás Norambuena, Jiajun Qiu, Michael Bernhofer et al.
Year: 2020

Transforming RDF Data into Property Graphs

Authors: Renzo Angles, Roberto García
Year: 2020

On Dynamic Succinct Graph Representations

Authors: Gonzalo Navarro
Year: 2020

Semantrix: A Compressed Semantic Matrix

Authors: Gonzalo Navarro
Year: 2020

Compressing and randomly accessing sequences (note)

Authors: Diego Arroyuelo, Rajeev Raman, Laith Ali Abdusahib
Year: 2020

Approximating Optimal Bidirectional Macro Schemes

Authors: Gonzalo Navarro
Year: 2020

When is Ontology-Mediated Querying Efficient?

Authors: Pablo Barceló, Andréas Pieris, Carsten Lutz et al.
Year: 2020Source: arXiv (Cornell University)
In ontology-mediated querying, description logic (DL) ontologies are used to enrich incomplete data with domain knowledge which results in more complete answers to queries. However, the evaluation of ...

Optimal Joins Using Compact Data Structures

Authors: Juan Reutter, Gonzalo Navarro, Javiel Rojas-Ledesma
Year: 2020

GSP4PDB: a web tool to visualize, search and explore protein-ligand structural patterns

Authors: Roberto García, Ehmke Pohl, José Antonio Reyes-Suárez et al.
Year: 2020Source: BMC Bioinformatics
Abstract Background In the field of protein engineering and biotechnology, the discovery and characterization of structural patterns is highly relevant as these patterns can give fundamental insights ...

A Family of Centrality Measures for Graph Data Based on Subgraphs

Authors: Cristian Riveros, Jorge Salas
Year: 2020

Current Challenges in Graph Databases (Invited Talk)

Authors: Juan Reutter
Year: 2020

Towards Streaming Evaluation of Queries with Correlation in Complex Event Processing

Authors: Alejandro Grez, Cristian Riveros
Year: 2020

On the Expressiveness of Languages for Complex Event Recognition

Authors: Alejandro Grez, Cristian Riveros, Stijn Vansummeren et al.
Year: 2020

Cryptocurrency mining games with economic discount and decreasing rewards

Authors: Marcelo Arenas, Juan Reutter, Martín Ugarte et al.
Year: 2020

Supplementary Material for the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting

Authors: Denis Parra, Hans Löbel, Álvaro Soto et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
This is the dataset used in the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting. It is composed of: - Pre-trained models using ac...

Supplementary Material for the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting

Authors: Denis Parra, Hans Löbel, Álvaro Soto et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
This is the dataset used in the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting. It is composed of: - Pre-trained models using ac...

Supplementary Material for the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting

Authors: Denis Parra, Hans Löbel, Álvaro Soto et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
This is the dataset used in the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting. It is composed of: - Pre-trained models using ac...

Supplementary Material for the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting

Authors: Denis Parra, Hans Löbel, Álvaro Soto et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
This is the dataset used in the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting. It is composed of: - Pre-trained models using ac...

Supplementary Material for the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting

Authors: Denis Parra, Hans Löbel, Álvaro Soto et al.
Year: 2020Source: Zenodo (CERN European Organization for Nuclear Research)
This is the dataset used in the paper: Automatic Document Screening of Medical Literature Using Word and Text Embeddings in an Active Learning Setting. It is composed of: - Pre-trained models using ac...

JSON: Data model and query languages

Authors: Juan Reutter, Domagoj Vrgoč, Pierre Bourhis et al.
Year: 2020Source: Information Systems

On Dynamic Succinct Graph Representations

Authors: Gonzalo Navarro, Guillermo de Bernardo, Susana Ladra et al.
Year: 2020
We address the problem of representing dynamic graphs using k <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> -trees. The k <sup xmlns:mml="http:...

Approximating Optimal Bidirectional Macro Schemes

Authors: Gonzalo Navarro, Luís M. S.Russo, Alexandre P. Francisco et al.
Year: 2020
Lempel-Ziv is an easy-to-compute member of a wide family of so-called macro schemes; it restricts pointers to go in one direction only. Optimal bidirectional macro schemes are NP-complete to find, but...

Semantrix: A Compressed Semantic Matrix

Authors: Gonzalo Navarro, Nieves R. Brisaboa, Antonio Fariña et al.
Year: 2020
We present a compact data structure to represent both the duration and length of homogeneous segments of trajectories from moving objects in a way that, as a data warehouse, it allows us to efficientl...

Stability and incorporation: Toward a new concept of party system institutionalization

Authors: Rafael Piñeiro Rodríguez, Fernando Rosenblatt
Year: 2020

A Polygenic Risk Score Suggests Shared Genetic Architecture of Voice Break With Early Markers of Pubertal Onset in Boys

Authors: Susana Eyheramendy, Verónica Mericq, Patricio Miranda et al.
Year: 2020Source: The Journal of Clinical Endocrinology & Metabolism
Voice break, as a landmark of advanced male puberty in genome-wide association studies (GWAS), has revealed that pubertal timing is a highly polygenic trait. Although voice break is easily recorded in...

To index or not to index: Time-space trade-offs for positional ranking functions in search engines

Authors: Diego Arroyuelo, Senen González, Mauricio Marín et al.
Year: 2020Source: Information Systems

Compressing and Randomly Accessing Sequences (note)

Authors: Diego Arroyuelo, Rajeev Raman, Laith Ali Abdusahib
Year: 2020
In this paper we consider the problem of storing sequences of symbols in a compressed format, while supporting random access to the symbols without decompression. Although this is a well-studied probl...

An integrated model for textual social media data with spatio-temporal dimensions

Authors: Bárbara Poblete, Felipe Bravo-Márquez, Juglar Díaz et al.
Year: 2020Source: Information Processing & Management
GPS-enabled devices and social media popularity have created an unprecedented opportunity for researchers to collect, explore, and analyze text data with fine-grained spatial and temporal metadata. In...

ResiliNet: Failure-Resilient Inference in Distributed Neural Networks

Authors: Hans Löbel, Brian Nguyen, Guanhua Wang et al.
Year: 2020Source: arXiv (Cornell University)
Federated Learning aims to train distributed deep models without sharing the raw data with the centralized server. Similarly, in distributed inference of neural networks, by partitioning the network a...

Compressed Dynamic Range Majority and Minority Data Structures

Authors: Travis Gagie, Meng He, Gonzalo Navarro
Year: 2020Source: Algorithmica

Dynamic Data Structures for Timed Automata Acceptance

Authors: Cristian Riveros, Alejandro Grez, Filip Mazowiecki et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Ensuring the correctness of distributed cyber-physical systems can be done at runtime by monitoring properties over their behaviour. In a decentralised setting, such behaviour consists of multiple loc...

The monitoring problem for timed automata

Authors: Alejandro Grez, Cristian Riveros, Filip Mazowiecki et al.
Year: 2020

Importance of the electrolyte cation on the non-covalent interactions in the electrooxidation of 1-heptanol on gold in 0.1 M alkali metal hydroxides

Authors: Claudio Gutiérrez, Francisco J. Fernández‐Álvarez, M.S. Ureta-Zañartu et al.
Year: 2020Source: Materials Chemistry and Physics

The monitoring problem for timed automata

Authors: Cristian Riveros, Alejandro Grez, Filip Mazowiecki et al.
Year: 2020Source: arXiv (Cornell University)
We study a variant of the classical membership problem in automata theory, which consists of deciding whether a given input word is accepted by a given automaton. We do so under a different perspectiv...

Information extraction meets the Semantic Web: A survey

Authors: Aidan Hogan, José Martínez-Rodríguez, Iván Lopez-Arévalo
Year: 2020

Descriptive Complexity for Counting Complexity Classes

Authors: Cristian Riveros, Marcelo Arenas, Martín Muñoz
Year: 2020

Descriptive Complexity for Counting Complexity Classes

Authors: Marcelo Arenas, Martín Muñoz, Cristian Riveros
Year: 2020Source: Logical Methods in Computer Science
Descriptive Complexity has been very successful in characterizing complexity classes of decision problems in terms of the properties definable in some logics. However, descriptive complexity for count...

Efficient Enumeration Algorithms for Regular Document Spanners

Authors: Cristian Riveros, Stijn Vansummeren, Fernando Florenzano et al.
Year: 2020Source: ACM Transactions on Database Systems
Regular expressions and automata models with capture variables are core tools in rule-based information extraction. These formalisms, also called regular document spanners , use regular languages to l...

Semantic Search of Memes on Twitter

Authors: Benjamín Bustos, Magdalena Saldaña, Jesus Perez-Martin
Year: 2020Source: arXiv (Cornell University)
Memes are becoming a useful source of data for analyzing behavior on social media. However, a problem to tackle is how to correctly identify a meme. As the number of memes published every day on socia...

Let's build Bridges, not Walls: SPARQL Querying of TinkerPop Graph Databases with Sparql-Gremlin

Authors: Renzo Angles, Harsh Thakkar, Marko Rodriguez et al.
Year: 2020

The Semantic Web: Two decades on

Authors: Aidan Hogan
Year: 2020

CompactNets: Compact Hierarchical Compositional Networks for Visual Recognition

Authors: Álvaro Soto, Hans Löbel, René Vidal
Year: 2020Source: Computer Vision and Image Understanding

Let's build Bridges, not Walls: SPARQL Querying of TinkerPop Graph Databases with Sparql-Gremlin

Authors: Renzo Angles, Jens Lehmann, Harsh Thakkar et al.
Year: 2020
This article presents sparql-gremlin, a tool to translate SPARQL queries to Gremlin pattern matching traversals. Currently, sparql-gremlin is a plugin of the Apache TinkerPop graph computing framework...

The Semantic Web: Two decades on

Authors: Aidan Hogan
Year: 2020Source: Semantic Web
More than two decades have passed since the establishment of the initial cornerstones of the Semantic Web. Since its inception, opinions have remained divided regarding the past, present and potential...

A mechanized formalization of GraphQL

Authors: Eric Tanter, Federico Olmedo, Tomás Díaz et al.
Year: 2020
International audience

The 2(k) Neighborhoods for Grid Path Planning

Authors: Jorge Baier, Nicolás Rivera, Carlos Hernández et al.
Year: 2020Source: Journal of Artificial Intelligence Research
Grid path planning is an important problem in AI. Its understanding has been key for the development of autonomous navigation systems. An interesting and rather surprising fact about the vast literatu...

Fully Functional Suffix Trees and Optimal Text Searching in BWT-Runs Bounded Space

Authors: Gonzalo Navarro, Travis Gagie, Nicola Prezza
Year: 2020Source: Journal of the ACM
Indexing highly repetitive texts—such as genomic databases, software repositories and versioned text collections—has become an important problem since the turn of the millennium. A relevant compre...

Computation over compressed data

Authors: Gonzalo Navarro, Travis Gagie
Year: 2020Source: Information and Computation

Detection of Suicidal Ideation on Social Media: Multimodal, Relational, and Behavioral Analysis (Preprint)

Authors: Ricardo Baeza-Yates, Diego Alejandro Velazquez, Josep M. Gonfaus et al.
Year: 2020
<sec> <title>BACKGROUND</title> Suicide risk assessment usually involves an interaction between doctors and patients. However, a significant number of people with mental disorders receive no treatment...

From access deprivation to skill acquisition: Cluster analysis of user behavior in face of a 12-hour legal blockage of WhatsApp in Brazil.

Authors: Magdalena Saldaña, Andrés Rosenberg, Marcelo Santos
Year: 2020Source: First Monday
This study takes advantage of a forceful legal 12-hour deprivation of access to WhatsApp messaging service nationwide in Brazil on 18 December 2015. Right after the blockage, we ran a survey to captur...

"IMFD IMPRESEE at TRECVID 2020: Description Generation by Visual-Syntactic Embedding"

Authors: Jorge Pérez, Benjamín Bustos, Juan Manuel Barrios et al.
Year: 2020

Extending General Compact Querieable Representations to GIS Applications

Authors: Nieves Brisaboa, Gonzalo Navarro, Guillermo de Bernardo et al.
Year: 2020Source: Information Sciences

Plan estratégico de la logística urban-portuaria. Mejores ciudades portuarias en elÁrea Metropolitana de Concepción.

Authors: Sergio Toro-Maureira, Mabel Alarcón, Violeta Montero et al.
Year: 2020

Spanish pre-trained BERT model and evaluation data

Authors: Jorge Pérez, José Cañete, Gabriel Chaperon et al.
Year: 2020

Translating navigation instructions in natural language to a high-level plan for behavioral robot navigation

Authors: Álvaro Soto, Juan Carlos Niebles, Xiaoxue Zang et al.
Year: 2020

Towards a Definitive Measure of Repetitiveness

Authors: Gonzalo Navarro, Nicola Prezza, Tomasz Kociumaka
Year: 2020Source: Lecture notes in computer science

Practical Random Access to SLP-Compressed Texts

Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2020Source: Lecture notes in computer science
Grammar-based compression is a popular and powerful approach to compressing repetitive texts but until recently its relatively poor time-space trade-offs during real-life construction made it impracti...

Contextual Pattern Matching

Authors: Gonzalo Navarro
Year: 2020Source: Lecture notes in computer science
The research on indexing repetitive string collections has focused on the same search problems used for regular string collections, though they can make little sense in this scenario. For example, the...

On the Expressiveness of Languages for Complex Event Recognition

Authors: Cristian Riveros, Martín Ugarte, Stijn Vansummeren et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Complex Event Recognition (CER for short) has recently gained attention as a mechanism for detecting patterns in streams of continuously arriving event data. Numerous CER systems and languages have be...

A grammar compressor for collections of reads with applications to the construction of the BWT

Authors: Gonzalo Navarro, Diego Díaz-Domínguez
Year: 2020Source: arXiv (Cornell University)
We describe a grammar for DNA sequencing reads from which we can compute the BWT directly. Our motivation is to perform in succinct space genomic analyses that require complex string queries not yet s...

PHONI: Streamed Matching Statistics with Multi-Genome References

Authors: Gonzalo Navarro, I Tomohiro, Alejandro Gustavo Vigo Pacheco et al.
Year: 2020Source: arXiv (Cornell University)
Computing the matching statistics of patterns with respect to a text is a fundamental task in bioinformatics, but a formidable one when the text is a highly compressed genomic database. Bannai et al. ...

Towards Streaming Evaluation of Queries with Correlation in Complex Event Processing

Authors: Cristian Riveros, Alejandro Grez
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Complex event processing (CEP) has gained a lot of attention for evaluating complex patterns over high-throughput data streams. Recently, new algorithms for the evaluation of CEP patterns have emerged...

Abstracting Gradual References (SCICO Journal-first)

Authors: Matías Toro, Eric Tanter, Éric Tanter
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Gradual typing is an effective approach to integrate static and dynamic typing, which supports the smooth transition between both extremes via the (programmer-controlled) precision of type annotations...

Text Indexing and Searching in Sublinear Time

Authors: J. Ian Munro, Yakov Nekrich, Gonzalo Navarro
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
We introduce the first index that can be built in o(n) time for a text of length n, and can also be queried in o(q) time for a pattern of length q. On an alphabet of size σ, our index uses O(n log σ...

Optimal Joins Using Compact Data Structures

Authors: Javiel Rojas-Ledesma, Gonzalo Navarro, Juan L. Reutter
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Worst-case optimal join algorithms have gained a lot of attention in the database literature. We now count with several algorithms that are optimal in the worst case, and many of them have been implem...

Current Challenges in Graph Databases (Invited Talk)

Authors: Juan Reutter, Juan L. Reutter
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
As graph databases grow in popularity, decades of work in graph query languages and models are materialising in industry standards and in the construction of new graph database systems. However, this ...

Expressive power of linear algebra query languages

Authors: Cristian Riveros, Domagoj Vrgoč, Thomas Muñoz et al.
Year: 2020Source: arXiv (Cornell University)
Linear algebra algorithms often require some sort of iteration or recursion as is illustrated by standard algorithms for Gaussian elimination, matrix inversion, and transitive closure. A key character...

When is Approximate Counting for Conjunctive Queries Tractable?

Authors: Cristian Riveros, Marcelo Arenas, Rajesh Jayaram et al.
Year: 2020Source: arXiv (Cornell University)
Conjunctive queries are one of the most common class of queries used in database systems, and the best studied in the literature. A seminal result of Grohe, Schwentick, and Segoufin (STOC 2001) demons...

On the Expressiveness of LARA: A Unified Language for Linear and Relational Algebra

Authors: Pablo Barceló, Jorge Pérez, Nelson Higuera et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
We study the expressive power of the Lara language - a recently proposed unified model for expressing relational and linear algebra operations - both in terms of traditional database query languages a...

Recommendations to Handle Health-related Small Imbalanced Data in Machine Learning

Authors: Ricardo Baeza‐Yates, Maria Rauschenberger
Year: 2020Source: Gesellschaft für Informatik (GI)
When discussing interpretable machine learning results, researchers need to compare results and reflect on reliable results, especially for health-related data. The reason is the negative impact of wr...

Optimal Joins Using Compact Data Structures

Authors: Juan Reutter, Gonzalo Navarro, Javiel Rojas-Ledesma et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Worst-case optimal join algorithms have gained a lot of attention in the database literature. We now count with several algorithms that are optimal in the worst case, and many of them have been implem...

On the Expressiveness of LARA: A Unified Language for Linear and Relational Algebra

Authors: Jorge Pérez, Nelson Higuera, Pablo Barceló et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
We study the expressive power of the Lara language - a recently proposed unified model for expressing relational and linear algebra operations - both in terms of traditional database query languages a...

Current Challenges in Graph Databases (Invited Talk)

Authors: Juan L. Reutter
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
As graph databases grow in popularity, decades of work in graph query languages and models are materialising in industry standards and in the construction of new graph database systems. However, this ...

Think the Vote: Information Processing, Selective Exposure to Social Media, and Support for Trump and Clinton

Authors: Magdalena Saldaña, Thomas J. Johnson, Barbara K. Kaye
Year: 2020

First-Order Rewritability of Frontier-Guarded Ontology-Mediated Queries

Authors: Pablo Barceló, Gerald Berger, Andreas Pieris et al.
Year: 2020
We focus on ontology-mediated queries (OMQs) based on (frontier-)guarded existential rules and (unions of) conjunctive queries, and we investigate the problem of FO-rewritability, i.e., whether an OMQ...

For Learners, with Learners: Identifying Indicators for an Academic Advising Dashboard for Students

Authors: Mar Pérez‐Sanagustín, Julio Guerra, Valeria Henríquez et al.
Year: 2020Source: Lecture notes in computer science

Multipath Adaptive A*: Factors That Influence Performance in Goal-Directed Navigation in Unknown Terrain

Authors: Jorge Baier, Carlos Hernández, Roberto Asín‐Achá
Year: 2020Source: IEEE Access
Incremental heuristic search algorithms are a class of heuristic search algorithms applicable to the problem of goal-directed navigation. D* and D*Lite are among the most well-known algorithms for thi...

Contextual Pattern Matching

Authors: Gonzalo Navarro
Year: 2020Source: arXiv (Cornell University)
The research on indexing repetitive string collections has focused on the same search problems used for regular string collections, though they can make little sense in this scenario. For example, the...

Approximating Optimal Bidirectional Macro Schemes

Authors: Gonzalo Navarro, Luís M. S.Russo, Alexandre P. Francisco et al.
Year: 2020Source: arXiv (Cornell University)
Lempel-Ziv is an easy-to-compute member of a wide family of so-called macro schemes; it restricts pointers to go in one direction only. Optimal bidirectional macro schemes are NP-complete to find, but...

Semantrix: A Compressed Semantic Matrix

Authors: Gonzalo Navarro, Nieves R. Brisaboa, Antonio Fariña et al.
Year: 2020Source: arXiv (Cornell University)
We present a compact data structure to represent both the duration and length of homogeneous segments of trajectories from moving objects in a way that, as a data warehouse, it allows us to efficientl...

Grammar-Compressed Indexes with Logarithmic Search Time

Authors: Gonzalo Navarro, Francisco Claude, Alejandro Gustavo Vigo Pacheco
Year: 2020Source: arXiv (Cornell University)
Let a text $T[1..n]$ be the only string generated by a context-free grammar with $g$ (terminal and nonterminal) symbols, and of size $G$ (measured as the sum of the lengths of the right-hand sides of ...

PFP Data Structures

Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2020Source: arXiv (Cornell University)
Prefix-free parsing (PFP) was introduced by Boucher et al. (2019) as a preprocessing step to ease the computation of Burrows-Wheeler Transforms (BWTs) of genomic databases. Given a string $S$, it prod...

Grammar Compression By Induced Suffix Sorting

Authors: Gonzalo Navarro, Simon Gog, Maurício Ayala-Rincón et al.
Year: 2020Source: arXiv (Cornell University)
A grammar compression algorithm, called GCIS, is introduced in this work. GCIS is based on the induced suffix sorting algorithm SAIS, presented by Nong et al. in 2009. The proposed solution builds on ...

The Chilean Waiting List Corpus: a new resource for clinical Named Entity Recognition in Spanish

Authors: Jocelyn Dunstan, Fabián Villena, Pablo Báez et al.
Year: 2020
In this work we describe the Waiting List Corpus consisting of de-identified referrals for several specialty consultations from the waiting list in Chilean public hospitals. A subset of 900 referrals ...

On the Construction of Multilingual Corpora for Clinical Text Mining

Authors: Jocelyn Dunstan, Fabián Villena, Matthias Ganzinger et al.
Year: 2020Source: Studies in health technology and informatics
The amount of digital data derived from healthcare processes have increased tremendously in the last years. This applies especially to unstructured data, which are often hard to analyze due to the lac...

Review of “The Little Prover” by Daniel P. Friedman and Carl Eastlund, MIT Press, 2015

Authors: Eric Tanter
Year: 2020Source: Journal of Functional Programming
An abstract is not available for this content so a preview has been provided. As you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

Cryptocurrency Mining Games with Economic Discount and Decreasing Rewards.

Authors: Marcelo Arenas, Juan Reutter, Domagoj Vrgoč et al.
Year: 2020Source: Symposium on Theoretical Aspects of Computer Science
In the consensus protocols used in most cryptocurrencies, participants called miners must find valid blocks of transactions and append them to a shared tree-like data structure. Ideally, the rules of ...

The Shapley Value of Tuples in Query Answering.

Authors: Benny Kimelfeld, Moshe Sebag, Ester Livshits et al.
Year: 2020Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
We investigate the application of the Shapley value to quantifying the contribution of a tuple to a query answer. The Shapley value is a widely known numerical measure in cooperative game theory and i...

Score-Based Explanations in Data Management and Machine Learning

Authors: Leopoldo Bertossi
Year: 2020Source: arXiv (Cornell University)
We describe some approaches to explanations for observed outcomes in data management and machine learning. They are based on the assignment of numerical scores to predefined and potentially relevant i...

Causality-based Explanation of Classification Outcomes

Authors: Leopoldo Bertossi, Zografoula Vagena, Maximilian Schleich et al.
Year: 2020Source: arXiv (Cornell University)
We propose a simple definition of an explanation for the outcome of a classifier based on concepts from causality. We compare it with previously proposed notions of explanation, and study their comple...

DCC-Uchile at SemEval-2020 Task 1: Temporal Referencing Word Embeddings

Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina
Year: 2020
We present a system for the task of unsupervised lexical change detection. Given a target word and two corpora spanning different periods of time, automatically detects whether the word has lost or ga...

codebook of_returning_homestyles_ATL.pdf

Authors: Sergio Toro, Juan Pablo Luna, Daniel Alcatruz
Year: 2020Source: Harvard Dataverse
codebook

paper_deployment_V26-04-2020.do

Authors: Sergio Toro, Juan Pablo Luna, Daniel Alcatruz
Year: 2020Source: Harvard Dataverse
dofile

Replication Data for: Returning Homestyles to congressional politics

Authors: Sergio Toro, Juan Pablo Luna, Daniel Alcatruz
Year: 2020Source: Harvard Dataverse
This dataset contains classification the activities of 50 incumbent congressmembers in 2016 who were elected under the old electoral formula and ran for re-election in the post-reform race of 2017. Th...

df_deployment_deputies.tab

Authors: Sergio Toro, Juan Pablo Luna, Daniel Alcatruz
Year: 2020Source: Harvard Dataverse
:unav

The Complexity of Counting Problems over Incomplete Databases

Authors: Pablo Barceló, Marcelo Arenas, Mikaël Monet
Year: 2020Source: arXiv (Cornell University)
We study the complexity of various fundamental counting problems that arise in the context of incomplete databases, i.e., relational databases that can contain unknown values in the form of labeled nu...

Mapping RDF Databases to Property Graph Databases

Authors: Renzo Angles, Dominik Tomaszuk, Harsh Thakkar
Year: 2020Source: IEEE Access
RDF triplestores and property graph databases are two approaches for data management which are based on modeling, storing, and querying graph-like data. In spite of such common principles, they presen...

The LDBC Social Network Benchmark

Authors: Renzo Angles, Marcus Paradies, Peter Boncz et al.
Year: 2020Source: arXiv (Cornell University)
The Linked Data Benchmark Council's Social Network Benchmark (LDBC SNB) is an effort intended to test various functionalities of systems used for graph-like data management. For this, LDBC SNB uses th...

Anonymity and Asynchronicity as Key Design Dimensions for the Reciprocity of Online Democratic Deliberation

Authors: Claudio Gutiérrez, Leandro De Brasi
Year: 2020Source: International Journal of Applied Philosophy
The aim of this paper is to identify, given certain democratic normative standards regarding deliberation, some pros as well as cons of possible online deliberation designs due to variations in two ke...

An ASP-Based Approach to Counterfactual Explanations for Classification

Authors: Leopoldo Bertossi
Year: 2020Source: arXiv (Cornell University)
We propose answer-set programs that specify and compute counterfactual interventions as a basis for causality-based explanations to decisions produced by classification models. They can be applied wit...

Web of Data

Authors: Aidan Hogan
Year: 2020Source: Springer eBooks

Efficient GPU Thread Mapping on Embedded 2D Fractals

Authors: Benjamín Bustos, Raimundo Vega, Felipe A. Quezada et al.
Year: 2020Source: arXiv (Cornell University)
This work proposes a new approach for mapping GPU threads onto a family of discrete embedded 2D fractals. A block-space map $\lambda: \mathbb{Z}_{\mathbb{E}}^{2} \mapsto \mathbb{Z}_{\mathbb{F}}^{2}$ i...

A Family of Centrality Measures for Graph Data Based on Subgraphs

Authors: Cristian Riveros, Jorge Salas
Year: 2020Source: International Conference on Database Theory
We present the theoretical foundations of a new approach in centrality measures for graph data. The main principle of our approach is very simple: the more around a vertex, the more central it is in...

R3MAT: A Rapid and Robust Graph Generator

Authors: Renzo Angles, Roberto García, Rodrígo Paredes
Year: 2020Source: IEEE Access
One of the main problems when developing graph-based applications is the availability of large and representative datasets. The lack of real graphs has motivated the development of software tools for ...

PGO: Describing Property Graphs in RDF

Authors: Renzo Angles, Dominik Tomaszuk, Harsh Thakkar
Year: 2020Source: IEEE Access
RDF and Property Graphs are data models that are being used to represent Knowledge Graphs. The definition of methods to transform RDF data into Property graph data is fundamental to allow interoperabi...

Transforming RDF Data into Property Graphs

Authors: Renzo Angles, Roberto García
Year: 2020Source: IEEE Latin America Transactions
RDF databases and graph databases are two approaches of data management which are based on modeling, storing and querying data following a graph structure. RDF databases are based on a single graph da...

rdf2pg experimental datasets

Authors: Renzo Angles, Dominik Tomaszuk, Harsh Thakkar
Year: 2020Source: Figshare
RDF Datasets used in the rdf2pg paper, and the corresponding YARS-PG files.

rdf2pg experimental datasets

Authors: Renzo Angles, Dominik Tomaszuk, Harsh Thakkar
Year: 2020Source: Figshare
RDF Datasets used in the rdf2pg paper, and the corresponding YARS-PG files.

rdf2pg experimental datasets

Authors: Renzo Angles
Year: 2020Source: Figshare
RDF Datasets used in the rdf2pg paper, and the corresponding YARS-PG files.

What Kind of Content Are You Prone to Tweet? Multi-topic Preference Model for Tweeters

Authors: Ricardo Baeza-Yates, Lorena Recalde
Year: 2020Source: Communications in computer and information science

Pre-indexing Pruning Strategies

Authors: Ricardo Baeza-Yates, B. Barla Cambazoğlu, Soner Altin
Year: 2020Source: Lecture notes in computer science

Enhanced Word Embeddings for Anorexia Nervosa Detection on Social Media

Authors: Ricardo Baeza-Yates, Diana Ramírez‐Cifuentes, Ana Freire et al.
Year: 2020Source: Lecture notes in computer science
Comunicació presentada a: The18th International Symposium on Intelligent Data Analysis, IDA 2020, celebrat del 27 al 29 d'abril de 2020 a Konstanz, Alemanya.

Adaptive Community Search in Dynamic Networks

Authors: Ricardo Baeza-Yates, Francesco Bonchi, Ioanna Tsalouchidou
Year: 2020Source: arXiv (Cornell University)
Community search is a well-studied problem which, given a static graph and a query set of vertices, requires to find a cohesive (or dense) subgraph containing the query vertices. In this paper we stud...

Análisis de la secuencia de un aislamiento de coronavirus Covid-19

Authors: Sebastián Valenzuela, Daniela Costa
Year: 2020Source: OSTI OAI (U.S. Department of Energy Office of Scientific and Technical Information)

Universidades asociadas