Publicaciones
Publications for 2025
Displaying 154 publication(s) for 2025
Compiling Gradual Types with Evidence
Authors: Matías Toro, Éric Tanter, José Luis Romero et al.
Year: 2025Source: arXiv (Cornell University)
Efficiently supporting sound gradual typing in a language with structural types is challenging. To date, the Grift compiler is the only close-to-the-metal implementation of gradual typing in this sett...
Criminalidad y Democracia en América Latina
Authors: Juan Pablo Luna, A. Feldmann
Year: 2025
En la última década, el crimen organizado ha dejado de ser un fenómeno localizado para convertirse en una amenaza estructural a las democracias de América Latina. Redes criminales diversificadas y...
Joint model shows association of Mapuche genetic ancestry and longitudinal BMI with early menarche
Authors: Danilo Alvares, S. Eyheramendy, Lucas Vicuña et al.
Year: 2025
Abstract The age at puberty onset varies greatly between individuals and ethnic populations, with significant health implications. Early menarche increases risk for breast cancer, cardiovascular disea...
Anesthésie dans la chirurgie de De Quervain : les avantages de la technique WALANT
Authors: Claudio Gutiérrez, Nicole Mercier Rodríguez
Year: 2025Source: Hand surgery & rehabilitation
Identifying a novel Mecp2-mediated epigenetic mechanism controlling Lonp1 in the hippocampus and its disruption by aging
Authors: Alejandra Loyola, Karina A Cicali, Jesús Llanquinao-Sandoval et al.
Year: 2025Source: Scientific Reports
New compressed indices for multijoins on graph databases
Authors: Diego Arroyuelo, Adrián Gómez‐Brandón, Gonzalo Navarro
Year: 2025Source: Information Systems
Active Learning of Symbolic Automata Over Rational Numbers
Authors: Cristian Riveros, Sebastian Hagedorn, Martı́n Muñoz et al.
Year: 2025Source: arXiv (Cornell University)
Automata learning has many applications in artificial intelligence and software engineering. Central to these applications is the $L^*$ algorithm, introduced by Angluin. The $L^*$ algorithm learns det...
Simulating conversations on social media with generative agent-based models
Authors: Marcelo Mendoza, Andrés Carvallo, Eliana Providel et al.
Year: 2025Source: EPJ Data Science
Large Language Models (LLMs) can generate realistic text resembling human-produced content. However, the ability of these models to simulate conversations on social media is still less explored. To in...
Query Answering Under Volume-Based Diversity Functions
Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2025Source: Proceedings of the ACM on Management of Data
When query evaluation produces too many tuples, a new approach in query answering is to retrieve a diverse subset of them. The standard approach for measuring the diversity of a set of tuples is to us...
Full Waveform Inversion via Optimal Transport with Sign-Sensitive Signal Decomposition
Authors: Juan Pablo Luna
Year: 2025
We developed a theoretical framework that encompasses a broad family of misfit functions between real and simulated seismogram data, including well-known examples such as the least-squares criterion. ...
User Perception of Attention Visualizations: Effects on Interpretability Across Evidence-Based Medical Documents
Authors: Vladimir Araujo, Andrés Carvallo, Hernan Valdivieso et al.
Year: 2025Source: Lecture notes in computer science
Perceptual Evaluation of GANs and Diffusion Models for Generating X-Rays
Authors: Cecilia Besa, Denis Parra, Gregory Schuit
Year: 2025Source: Lecture notes in computer science
Graph Querying or Similarity Search? Both!
Authors: Vicente Calisto, Gonzalo Navarro, Sebastián Ferrada et al.
Year: 2025Source: Lecture notes in computer science
Striving for excellence is striving for diversity
Authors: Magdalena Saldaña, Edson C. Tandoc, Kristy Hess et al.
Year: 2025
There is rising momentum within the fields of communication, media studies and (digital) journalism studies to enhance diversity of scholarship, away from a Western-centric gaze, and to be more inclus...
Evaluating GPT-4o in high-stakes medical assessments: performance and error analysis on a Chilean anesthesiology exam
Authors: Marcelo Mendoza, Andrés Neyem, Fernando Altermatt et al.
Year: 2025Source: BMC Medical Education
Abstract Background Large language models (LLMs) such as GPT-4o have the potential to transform clinical decision-making, patient education, and medical research. Despite impressive performance in gen...
Using large language models for survey research in communication: opportunities and challenges
Authors: Stephan Winter, Sebastián Rivera, Sebastián Valenzuela
Year: 2025Source: Communication and Change
Abstract Artificial intelligence (AI) is transforming survey research, offering powerful tools like large language models (LLMs) to analyze human beliefs, opinions, and behaviors. As researchers incre...
Can Large Language Models Compete with Specialized Models in Lexical Semantic Change Detection?
Authors: Nikolay Arefyev, Felipe Bravo-Márquez, Frank D. Zamora-Reina et al.
Year: 2025Source: Frontiers in artificial intelligence and applications
In this paper, we present a comprehensive comparison between specialized Lexical Semantic Change Detection (LSCD) models and Large Language Models (LLMs) for the LSCD task. In addition to comparing mo...
An empirical study of the effect of video encoders on Temporal Video Grounding
Authors: Edison Marrese-Taylor, Cristian Rodríguez-Opazo, Felipe Bravo-Márquez et al.
Year: 2025Source: arXiv (Cornell University)
Temporal video grounding is a fundamental task in computer vision, aiming to localize a natural language query in a long, untrimmed video. It has a key role in the scientific community, in part due to...
B-Call: integrating ideological position and voting cohesion in legislative behavior
Authors: Juan Reutter, Sergio Toro, Daniel Alcatruz et al.
Year: 2025Source: Frontiers in Political Science
This paper addresses two central dimensions of legislative behavior: ideological position and voting cohesion. Although both approaches have been widely used to analyze legislative behavior, no unifie...
Flexible and Expressive Typed Path Patterns for GQL
Authors: Manuel Rigger, Matías Toro, Wenjia Ye et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Graph databases have become an important data management technology across various domains, including biology, sociology, industry (e.g. fraud detection, supply chain management, financial services), ...
Incremental Certified Programming
Authors: Éric Tanter, Kenji Maillard, Nicolas Tabareau et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Certified programming, as carried out in proof assistants and dependently-typed programming languages, ensures that a software meets its requirements by supporting the definition of both specification...
CompactLTJ: Space & Time Efficient Leapfrog Triejoin on Graph Databases
Authors: Domagoj Vrgoč, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2025Source: The VLDB Journal
Abstract Leapfrog Triejoin (LTJ) is arguably the most practical and popular worst-case-optimal (wco) algorithm for solving basic graph patterns in graph databases. Its main drawback is that it needs t...
Human Response to Decision Support in Face Matching: The Influence of Task Difficulty and Machine Accuracy
Authors: Ricardo Baeza-Yates, Carlos Castillo, Marina Estévez-Almenzar
Year: 2025Source: Frontiers in artificial intelligence and applications
Decision support systems enhanced by Artificial Intelligence (AI) are increasingly being used in high-stakes scenarios where errors or biased outcomes can have significant consequences. In this work, ...
Smallest Suffixient Sets as a Repetitiveness Measure
Authors: Gonzalo Navarro, Cristian Urbina, Giuseppe Romana
Year: 2025Source: Lecture notes in computer science
Cache-Friendly Compressed Boolean Matrices
Authors: Gonzalo Navarro, Adrián Gómez-Brandón, Antonio Fariña et al.
Year: 2025Source: Lecture notes in computer science
Query Answering under Volume-Based Diversity Functions
Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2025Source: arXiv (Cornell University)
When query evaluation produces too many tuples, a new approach in query answering is to retrieve a diverse subset of them. The standard approach for measuring the diversity of a set of tuples is to us...
My Private–Public Sphere: Women’s Information Strategies in Times of News Mistrust
Authors: Magdalena Saldaña, Isabel Pavez, Claudia Lagos Lira et al.
Year: 2025Source: Journalism & Mass Communication Quarterly
Problematic information, such as mis- and disinformation, circulating in fragmented news ecosystems, has contributed to mistrust and information fatigue. Using survey data ( N = 2,117) and two focus g...
The Missing Link: Identifying Digital Intermediaries in E‐Government
Authors: Sergio Toro, Sebastián Valenzuela, Teresa Correa et al.
Year: 2025Source: Public Administration Review
Shrec 2025: Partial Retrieval Benchmark
Authors: Bart Iver van Blokland, Ivan Sipiran, Benjamín Bustos et al.
Year: 2025Source: Computers & Graphics
Partial retrieval is a long-standing problem in the 3D Object Retrieval community. Its main difficulties arise from how to define 3D local descriptors in a way that makes them effective for partial re...
Human-AI Coevolution (Abstract Reprint)
Authors: Ricardo Baeza-Yates, Dino Pedreschi, Alistair Knott et al.
Year: 2025
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence a...
Artificial Intelligence and Peacebuilding: Opportunities and Challenges
Authors: Sebastián Valenzuela, Philip N. Howard, Fredrick Ogenga et al.
Year: 2025
A high-level précis of this Technical Paper can be found in the Summary for Policymakers report, Artificial Intelligence for Peacebuilding: Promises and Pitfalls. Artificial intelligence (AI) is rapi...
A Uniform Language for Safety, Robustness and Explainability
Authors: Pablo Barceló, Vaishak Belle
Year: 2025Source: Lecture notes in computer science
Personalized MRI-based characterization of subcortical anomalies in Ataxia-Telangiectasia using deep-learning
Authors: Denis Parra, Robert A. Dineen, Cristian Salazar-Vilches et al.
Year: 2025Source: PLoS ONE
Background Cerebellar atrophy is a known feature of ataxia-telangiectasia (A-T). However, basal ganglia dysfunction contributing to extrapyramidal movement disorders in A-T remains understudied. Objec...
Slicing of Probabilistic Programs: A Review of Existing Approaches
Authors: Federico Olmedo
Year: 2025Source: ACM Computing Surveys
Program slicing aims to simplify programs by identifying and removing non-essential parts while preserving program behavior. It is widely used for program understanding, debugging, and software mainte...
Engineering rank/select data structures for large-alphabet strings
Authors: Diego Arroyuelo, Erick Sepúlveda, Francisco Riveros et al.
Year: 2025Source: The Computer Journal
Abstract Large-alphabet strings, prevalent in information retrieval and natural language processing, pose unique storage and processing challenges. This paper explores the efficient implementation of ...
Introduction to the Special Issue on Temporal Web: Studying Time and the Temporal Dimension
Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2025Source: ACM Transactions on the Web
On Computing Probabilistic Explanations for Decision Trees
Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2025Source: Journal of Artificial Intelligence Research
Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases ...
WIP: Does this Course Need a Well-being Teaching Assistant?
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2025
Fast and Small Subsampled R-indexes
Authors: Gonzalo Navarro, Travis Gagie, Dustin Cobas
Year: 2025Source: ACM Transactions on Algorithms
The \(r\) -index (Gagie et al., JACM 2020) represented a breakthrough in compressed indexing of repetitive text collections, outperforming its alternatives by orders of magnitude in query time. Its sp...
Sex differences in work-related accidents extracted from free text in Spanish using natural language processing
Authors: Jocelyn Dunstan, Víctor Rocco, Daniela Moyano et al.
Year: 2025Source: BMC Public Health
By sharing our prompts and code, we aim to help other institutions and countries extract crucial information from free text to a controlled vocabulary of ILO. Future work includes the analysis of comm...
Uncovering the Hidden Biases in Personal Informatics
Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2025Source: GetMobile Mobile Computing and Communications
Personal Informatics (PI) systems, such as apps and wearables that help users track physical activity, sleep, heart rate, or stress, have become critical tools for self-monitoring and health research....
Perceptual Evaluation of GANs and Diffusion Models for Generating X-rays
Authors: Cecilia Besa, Denis Parra, Gregory Schuit
Year: 2025Source: arXiv (Cornell University)
Generative image models have achieved remarkable progress in both natural and medical imaging. In the medical context, these techniques offer a potential solution to data scarcity-especially for low-p...
Corrections to “On the data complexity of consistent query answering over graph databases [Journal of Computer and System Sciences 88 (2017) 164–194]”
Authors: Pablo Barceló, Gaëlle Fontaine, Sophie Tison et al.
Year: 2025Source: Journal of Computer and System Sciences
Robust Dynamic Embedding for Gradual Typing
Authors: Matías Toro, Eric Tanter, Nicolas Tabareau et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Gradual typing has long been advocated as a means to bridge the gap between static and dynamic typing disciplines, enabling a range of use cases such as the gradual migration of existing dynamically t...
Are Your Fairness Metrics Accurate? A Semi-Supervised Approach to Improving Fairness Estimates Under Sample Selection Bias
Authors: Ricardo Baeza-Yates, M. Clara De Paolis Kaluza, Shantanu Jain et al.
Year: 2025
CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray
Authors: Mingquan Lin, Hao Chen, Adam E. Flanders et al.
Year: 2025Source: Medical Image Analysis
ChatGPT as a Stable and Fair Tool for Automated Essay Scoring
Authors: Marcelo Mendoza, Miguél Nussbaum, Zvi Bekerman et al.
Year: 2025Source: Education Sciences
The evaluation of open-ended questions is typically performed by human instructors using predefined criteria to uphold academic standards. However, manual grading presents challenges, including high c...
(Worst-case) Optimal Adaptive Dynamic Bitvectors
Authors: Gonzalo Navarro
Year: 2025Source: Theory of Computing Systems
Public Knowledge and Expertise Under Authoritarian Siege: A Defense of Academic Freedom from Digital Journalism Studies
Authors: Magdalena Saldaña, Ramón Salaverría, Oscar Westlund et al.
Year: 2025Source: Digital Journalism
This article addresses the growing global assault on academic freedom—a cornerstone of democratic societies now under increasing threat from authoritarian regimes. It highlights a global decline in ...
Reducing urban speed limits decreases work-related traffic injury severity: Evidence from Santiago, Chile
Authors: Matías Toro, Eduardo Graells-Garrido, Gabriel Mansilla et al.
Year: 2025Source: Travel Behaviour and Society
Cross-Lingual Cross-Domain Transfer Learning for Rumor Detection
Authors: Marcelo Mendoza, Mauricio Solar, Eliana Providel
Year: 2025Source: Future Internet
This study introduces a novel method that merges propagation-based transfer learning with word embeddings for rumor detection. This approach aims to use data from languages with abundant resources to ...
Querying Graph Data: Where We Are and Where To Go
Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2025
Although graph query languages such as Cypher, SQL/PGQ, and GQL take inspiration from theoretical languages such as conjunctive regular path queries (CRPQs), their pattern matching facilities are sign...
Rel: A Programming Language for Relational Data
Authors: Liat Peterfreund, George Kastrinis, Molham Aref et al.
Year: 2025
From the moment of their inception, languages for relational data have been described as sublanguages embedded in a host programming language. Rel is a new relational language whose key design goal is...
CORE+: A Complex Event Recognition Engine in C++
Authors: Cristian Riveros, Stijn Vansummeren, Vicente Calisto et al.
Year: 2025
Complex Event Recognition (CER) refers to the activity of analyzing streams of continuously arriving event data, to recognize collections of events that satisfy user-defined patterns. CER is known to ...
Editorial
Authors: Bárbara Poblete, Makoto P. Kato, H. Liu et al.
Year: 2025Source: Information Retrieval Research
This editorial celebrates the first issue of the Information Retrieval Research Journal, IRRJ.
Using publicly available data for predicting socioeconomic values in urban context
Authors: Juan Reutter, Mario Miguel Ojeda, Juan L. Reutter
Year: 2025Source: Computational Urban Science
Abstract Urban transportation networks are recognized for their pivotal role in forecasting city indicators and facilitating efficient planning and management. However, despite the increase of methodo...
Gradual Sensitivity Typing
Authors: Matías Toro, Éric Tanter, Eric Tanter et al.
Year: 2025
A Systematic Review of User-Centred Evaluation of Explainable AI in Healthcare
Authors: Kristýna Sirka Kacafírková, Maxwell Szymanski, Katrien Verbert et al.
Year: 2025Source: arXiv (Cornell University)
Despite promising developments in Explainable Artificial Intelligence, the practical value of XAI methods remains under-explored and insufficiently validated in real-world settings. Robust and context...
Complex Event Recognition under Time Constraints: Towards a Formal Framework for Efficient Query Evaluation
Authors: Cristian Riveros, Jaime García
Year: 2025Source: Proceedings of the ACM on Management of Data
Complex Event Recognition (CER) establishes a relevant solution for processing streams of events, giving users timely information. CER systems detect patterns in real-time, producing complex events an...
Accurate and Efficient Solid Waste Recognition: A Novel Approach Using Google Teachable Machine Based on Convolutional Neural Network (CNN)
Authors: Marcelo Mendoza, László Duma
Year: 2025
Explaining k -Nearest Neighbors: Abductive and Counterfactual Explanations
Authors: Pablo Barceló, Bernardo Subercaseaux, Miguel Romero et al.
Year: 2025Source: Proceedings of the ACM on Management of Data
Despite the wide use of k -Nearest Neighbors as classification models, their explainability properties remain poorly understood from a theoretical perspective. While nearest neighbors classifiers offe...
Characterizing Knowledge Manipulation in a Russian Wikipedia Fork
Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Pablo Aragón et al.
Year: 2025Source: Proceedings of the International AAAI Conference on Web and Social Media
Wikipedia is powered by MediaWiki, a free and open-source software that is also the infrastructure for many other wiki-based online encyclopedias. These include the recently launched website Ruwiki, w...
SPLASH-SegFormer Pipeline: A Transformer-Based Approach for High-Resolution and Low-Cost Laser Scanner Seafloor Mapping
Authors: Hans Löbel, Javiera Fuentes-Guíñez, Giancarlo Troni
Year: 2025Source: IEEE Robotics and Automation Letters
High-resolution seafloor mapping continues to be challenging, primarily due to the high costs and complexity of traditional sensors. Laser scanners offer a more affordable alternative, using a monocul...
Evaluating the Performance of Large Language Models on the CONACEM Anesthesiology Certification Exam: A Comparison with Human Participants
Authors: Marcelo Mendoza, Andrés Neyem, Fernando Altermatt et al.
Year: 2025Source: Applied Sciences
Large Language Models (LLMs) have demonstrated strong performance on English-language medical exams, but their effectiveness in non-English, high-stakes environments is less understood. This study ben...
Information Integrity about Climate Science: A Systematic Review
Authors: Sebastián Valenzuela, Philip N. Howard, Jusen Asuka et al.
Year: 2025
A high-level précis of this Synthesis Report can be found in the Summary for Policymakers report, Facts, Fakes, and Climate Science. The human response to the climate crisis is being obstructed and d...
Facts, Fakes, and Climate Science: Recommendations for Improving Information Integrity about Climate Science
Authors: Sebastián Valenzuela, Philip N. Howard, Jusen Asuka et al.
Year: 2025
This Summary for Policymakers provides a high-level précis of the Synthesis Report, Information Integrity about Climate Science: A Systematic Review. The human response to the climate crisis is being...
Benefits and Risks of LLMs
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Ethics in Artificial Intelligence and Information Technologies
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025
Visual Transformers and the Rise of Multimodality
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
A Sociotechnical Approach to Integrate Ethics into AI Projects
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Regulatory Initiatives in AI
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Explainable Artificial Intelligence
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Perspectives and Challenges
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Fairness, Accountability, and Transparency in AI
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
What is AI Ethics?
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Beyond the Mainstream: Sustainability and the Replicability Crisis
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Bias in Al
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
NLP and Representational Bias
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Transformers and Generative AI
Authors: Marcelo Mendoza, Claudia López, Gabriela Arriagada-Bruneau
Year: 2025Source: CRC Press eBooks
Practical Adaptive Dynamic Bitvectors
Authors: Gonzalo Navarro
Year: 2025Source: Software Practice and Experience
ABSTRACT Introduction While operations rank and select on static bitvectors can be supported in constant time, lower bounds show that this is impossible when supporting updates; practical implementati...
Large Language Models in Crisis Informatics for Zero and Few-Shot Classification
Authors: Bárbara Poblete, Andrés Abeliuk, Cinthia Sánchez
Year: 2025Source: ACM Transactions on the Web
This article presents an exploration of the use of pre-trained Large Language Models (LLMs) for crisis classification to address labeled data dependency issues. We present a methodology that enhances ...
The Role of Organizations in Networked Mobilization: Examining the 2011 Chilean Student Movement Through The Logic of Connective Action
Authors: Denis Parra, Carolina Pérez-Arredondo, Diego Gómez-Zará
Year: 2025
This study examines the communication mechanisms that shape the formation of digitally-enabled mobilization networks. Informed by the logic of connective action, we postulate that the emergence of net...
Novel SIMEX algorithm for autoregressive models to estimate AGN variability
Authors: Susana Eyheramendy, Wilfredo Palma, Felipe Elorrieta et al.
Year: 2025Source: Monthly Notices of the Royal Astronomical Society
Abstract The origin of the variability in accretion disks of active galactic nuclei (AGN) is still unknown, but its behavior can be characterized by modeling the time series of optical wavelength flux...
Foreword to the special section on 3D object retrieval 2024 symposium (3DOR2024)
Authors: Benjamín Bustos, Ivan Sipiran, Tobias Schreck et al.
Year: 2025Source: Computers & Graphics
Imitating Human Reasoning to Extract 5W1H in News
Authors: Marcelo Mendoza, Hans Löbel, Carlos Muñoz et al.
Year: 2025
Extracting key information from news articles is crucial for advancing search systems.Historically, the 5W1H framework, which organises information based on 'Who', 'What', 'When', 'Where', 'Why', and ...
15th Temporal Web Analytics Workshop (TempWeb) Overview
Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2025
Performance of single-agent and multi-agent language models in Spanish language medical competency exams
Authors: Marcelo Mendoza, Andrés Neyem, Fernando Altermatt et al.
Year: 2025Source: BMC Medical Education
Abstract Background Large language models (LLMs) like GPT-4o have shown promise in advancing medical decision-making and education. However, their performance in Spanish-language medical contexts rema...
Elucidating Type Conversions in SQL Engines
Authors: Matías Toro, Eric Tanter, Claudio Gutiérrez et al.
Year: 2025Source: Lecture notes in computer science
Abstract Practical SQL engines differ in subtle ways in their handling of typing constraints and implicit type casts. These issues, usually not considered in formal accounts of SQL, directly affect th...
The Role of Generative AI Use in 2024 Elections Worldwide
Authors: Sebastián Valenzuela, Philip N. Howard, Inga Kristina Trauthig
Year: 2025
A high-level précis of the Technical Paper can be found in the Summary for Policymakers report, Generative AI in Electoral Campaigns: Mapping Global Patterns. GenAI is being deployed in many ways dur...
Generative AI in Electoral Campaigns: Mapping Global Patterns
Authors: Sebastián Valenzuela, Philip N. Howard, Inga Kristina Trauthig
Year: 2025
This Summary for Policymakers provides a high-level précis of the Technical Paper, The Role of Generative AI Use in 2024 Elections Worldwide. GenAI is being deployed in many ways during elections, ra...
Correction: Cross-lingual hate speech detection using domain-specific word embeddings
Authors: Bárbara Poblete, Ayme Arango Monnar, Jorge Perez Rojas
Year: 2025Source: PLoS ONE
[This corrects the article DOI: 10.1371/journal.pone.0306521.].
Worst-Case-Optimal Joins on Graphs with Topological Relations
Authors: Aidan Hogan, Juan Reutter, Gonzalo Navarro et al.
Year: 2025
Spatial data play an important role in many applications built over knowledge graphs, and are frequently referenced in queries posed to public query services, such as that of Wikidata.Querying for spa...
Repetitiveness Measures Based on String Morphisms
Authors: Gonzalo Navarro, Cristian Urbina
Year: 2025Source: Theoretical Computer Science
Probabilistic Explanations for Linear Models
Authors: Marcelo Arenas, Bernardo Subercaseaux, Kuldeep S. Meel
Year: 2025Source: Proceedings of the AAAI Conference on Artificial Intelligence
Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is cent...
Complex event recognition under time constraints: towards a formal framework for efficient query evaluation
Authors: Cristian Riveros, Jaime García
Year: 2025Source: arXiv (Cornell University)
This work studies Complex Event Recognition (CER) under time constraints regarding its query language, computational models, and streaming evaluation algorithms. We start by introducing an extension o...
Advancing the Study of Political Misinformation Across Countries and Platforms—Introduction to the Special Issue
Authors: Sebastián Valenzuela, Edson C. Tandoc, Frank Esser et al.
Year: 2025Source: The International Journal of Press/Politics
The global spread of political misinformation poses serious challenges to democracies, eroding trust and distorting public discourse. However, research has largely focused on WEIRD countries—Western...
Bridging Inequality Gaps: Sustainable Journalism in the News Coverage of Education Policies
Authors: Magdalena Saldaña, Valentina Proust, Cristian Cabalín et al.
Year: 2025Source: Journalism Practice
By conducting a content analysis of 331 news stories, this study observed how six news organizations covered Chile's new school admission system (SAE) for enrolling in K-12 schools. To identify the pr...
Causality-Based Scores Alignment in Explainable Data Management
Authors: Felipe Azua, Leopoldo Bertossi
Year: 2025Source: arXiv (Cornell University)
Different attribution scores have been proposed to quantify the relevance of database tuples for query answering in databases; e.g. Causal Responsibility, the Shapley Value, the Banzhaf Power-Index, a...
HealthIUI: Workshop on Intelligent and Interactive Health User Interfaces
Authors: Denis Parra, Peter Brusilovsky, Shriti Raj et al.
Year: 2025
Logical Expressiveness of Graph Neural Networks on Knowledge Graphs
Authors: Pablo Barceló, Miguel Romero, İsmail İlkan Ceylan et al.
Year: 2025Source: Frontiers in artificial intelligence and applications
Graph neural networks are prominent models for representation learning over graph-structured data. While the capabilities and limitations of these models are well-understood for simple graphs, our und...
Dialogue on difference: Identity and political communication
Authors: Magdalena Saldaña, Sebastián Valenzuela, Khadijah Costley White et al.
Year: 2025Source: UNC Libraries
Identity is a crucial force in every facet of contemporary politics, but political communication research has too often addressed it only superficially, excluded it from the subfield’s primary f...
NLP modeling recommendations for restricted data availability in clinical settings
Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena
Year: 2025Source: BMC Medical Informatics and Decision Making
Abstract Background Clinical decision-making in healthcare often relies on unstructured text data, which can be challenging to analyze using traditional methods. Natural Language Processing (NLP) has ...
A Shopping Agent for Addressing Subjective Product Needs
Authors: Bárbara Poblete, Preetam Prabhu Srikar Dammu, Omar Alonso
Year: 2025
In e-commerce, customers often struggle to find relevant items when their needs involve subjective properties characterized by personal or collective perception, tastes, and opinions, which are typica...
Dialogue on difference: Identity and political communication
Authors: Magdalena Saldaña, Sebastián Valenzuela, Khadijah Costley White et al.
Year: 2025Source: Communication Monographs
Constant-delay enumeration for SLP-compressed documents
Authors: Cristian Riveros, Martı́n Muñoz
Year: 2025Source: Logical Methods in Computer Science
We study the problem of enumerating results from a query over a compressed document. The model we use for compression are straight-line programs (SLPs), which are defined by a context-free grammar tha...
How Expressive are Knowledge Graph Foundation Models?
Authors: Pablo Barceló, Juan Reutter, Michael M. Bronstein et al.
Year: 2025Source: arXiv (Cornell University)
Knowledge Graph Foundation Models (KGFMs) are at the frontier for deep learning on knowledge graphs (KGs), as they can generalize to completely novel knowledge graphs with different relational vocabul...
A Comparison of Human and Machine Learning Errors in Face Recognition
Authors: Ricardo Baeza-Yates, Carlos Castillo, Marina Estévez-Almenzar
Year: 2025Source: arXiv (Cornell University)
Machine learning applications in high-stakes scenarios should always operate under human oversight. Developing an optimal combination of human and machine intelligence requires an understanding of the...
Patterns of Persistence: Studying News Repertoires Before, During, and After Covid-19
Authors: Sebastián Valenzuela, Ingrid Bachmann, Natalia Solís Valdés
Year: 2025Source: Journalism Studies
In the realm of news consumption, individuals often establish recurrent patterns, integrating diverse sources into distinct repertoires. However, these patterns can change during unprecedented events ...
Generalized straight-line programs
Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2025Source: Acta Informatica
Preface to the special issue on “Artificial Intelligence‐driven Decision Making in Health and Medicine”
Authors: Leopoldo Bertossi, Herb Kunze, Davide La Torre et al.
Year: 2025Source: International Transactions in Operational Research
Enhancing contact recommendation in social platforms through mental health awareness: Exploring Anorexia Nervosa as a case study
Authors: Ricardo Baeza-Yates, Diana Ramírez‐Cifuentes, Ana Freire et al.
Year: 2025Source: PLoS ONE
We analyze and propose a solution for the exposure of vulnerable users to harmful content during their interaction with contact recommender systems in social platforms. Our approach is dedicated to ma...
Digital Journalism (Studies): An Agenda for the Future
Authors: Magdalena Saldaña, Ramón Salaverría, Oscar Westlund et al.
Year: 2025Source: Digital Journalism
Digital Journalism has an important role to play in encouraging and publishing research with societal relevance that advances digital journalism studies as a field. In this article we discuss the mult...
The Causal-Effect Score in Data Management
Authors: Leopoldo Bertossi, Felipe Azua
Year: 2025Source: arXiv (Cornell University)
The Causal Effect (CE) is a numerical measure of causal influence of variables on observed results. Despite being widely used in many areas, only preliminary attempts have been made to use CE as an at...
Developing and Validating an Automatic Support System for Tumor Coding in Pathology Reports in Spanish
Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2025Source: JCO Clinical Cancer Informatics
These results demonstrate the feasibility of implementing natural language processing tools in the routine of a cancer center to extract and code valuable information from pathology reports. Our recom...
Towards A Global AI Auditing Framework: Assessment and Recommendations
Authors: Marcelo Mendoza, Sebastián Valenzuela, Janaki Srinivasan et al.
Year: 2025
A high-level précis of the Synthesis Report can be found in the Summary for Policymakers Recommendations for a Global AI Auditing Framework: Summary of Standards and Features. The growing integration...
Towards Computer-Using Personal Agents
Authors: Aidan Hogan, Katja Hose, Olaf Hartig et al.
Year: 2025Source: arXiv (Cornell University)
Computer-Using Agents (CUA) enable users to automate increasingly-complex tasks using graphical interfaces such as browsers. As many potential tasks require personal data, we propose Computer-Using Pe...
A Comunication Framework for Compositional Generation
Authors: Denis Parra, Rafael Elberg, Mircea Petrache
Year: 2025Source: arXiv (Cornell University)
Compositionality and compositional generalization--the ability to understand novel combinations of known concepts--are central characteristics of human language and are hypothesized to be essential fo...
Semantic Web and Creative AI -- A Technical Report from ISWS 2023
Authors: Frank van Harmelen, Anna Sofia Lippolis, John Domingue et al.
Year: 2025Source: arXiv (Cornell University)
The International Semantic Web Research School (ISWS) is a week-long intensive program designed to immerse participants in the field. This document reports a collaborative effort performed by ten team...
A frustratingly easy way of extracting political networks from text
Authors: Naim Bro
Year: 2025Source: PLoS ONE
This study demonstrates the use of GPT-4 and variants, advanced language models readily accessible to many social scientists, in extracting political networks from text. This approach showcases the no...
Novel SIMEX algorithm for autoregressive models to estimate AGN variability
Authors: E. Camacho, S. Eyheramendy, Wilfredo Palma et al.
Year: 2025Source: arXiv (Cornell University)
The origin of the variability in accretion disks of active galactic nuclei (AGN) is still unknown, but its behavior can be characterized by modeling the time series of optical wavelength fluxes coming...
FairXAI - A Taxonomy and Framework for Fairness and Explainability Synergy in Machine Learning
Authors: Ricardo Baeza-Yates, Fredrik Heintz, Resmi Ramachandranpillai et al.
Year: 2025Source: IEEE Transactions on Neural Networks and Learning Systems
Explainable artificial intelligence (XAI) and fair learning have made significant strides in various application domains, including criminal recidivism predictions, healthcare settings, toxic comment ...
Ehrenfeucht-Haussler Rank and Chain of Thought
Authors: Pablo Barceló, Tomasz Steifer, Alexander Kozachinskiy
Year: 2025Source: arXiv (Cornell University)
The notion of rank of a Boolean function has been a cornerstone in the theory of PAC learning, enabling quasipolynomial-time learning algorithms for polynomial-size decision trees. We present a novel ...
B-Call: Integrating Ideological Position and Political Cohesion in Legislative Voting Models
Authors: Sergio Toro, Daniel Alcatruz, M. Aníbal Valenzuela et al.
Year: 2025Source: arXiv (Cornell University)
This paper combines two significant areas of political science research: measuring individual ideological position and cohesion. Although both approaches help analyze legislative behaviors, no unified...
The Missing Link: Identifying Digital Intermediaries in E-Government
Authors: Sergio Toro, Sebastián Valenzuela, Teresa Correa et al.
Year: 2025Source: arXiv (Cornell University)
The digitalization of public administration has advanced significantly on a global scale. Many governments now view digital platforms as essential for improving the delivery of public services and fos...
Unsupervised Framing Analysis for Social Media Discourse in Polarizing Events
Authors: Hernán Sarmiento, Felipe Bravo-Márquez, Sebastián Valenzuela et al.
Year: 2025Source: ACM Transactions on the Web
This study investigates the concept of frames in the realm of online polarization, with a focus on social media platforms. The research extends the understanding of how frames–emerging, complex, and...
Mid-Career Reflections: Climbing the Academic Ladder Without a Safety Net
Authors: Marcelo Arenas
Year: 2025Source: ACM SIGMOD Record
As I supposed everyone else did when asked by Tamer to write an article with Advice to Mid-Career Researchers, I read all the previous articles in this series to understand what this paper should be a...
Explaining k-Nearest Neighbors: Abductive and Counterfactual Explanations
Authors: Pablo Barceló, Miguel Romero Orth, Alexander Kozachinskiy et al.
Year: 2025Source: arXiv (Cornell University)
Despite the wide use of $k$-Nearest Neighbors as classification models, their explainability properties remain poorly understood from a theoretical perspective. While nearest neighbors classifiers off...
Curcumin Improves Hippocampal Cell Bioenergetics, Redox and Inflammatory Markers, and Synaptic Proteins, Regulating Mitochondrial Calcium Homeostasis
Authors: Sebastián Valenzuela, Alfonso González, Cláudio Retamal et al.
Year: 2025Source: Neurotoxicity Research
All Your Base Are Belong to Us: Sort Polymorphism for Proof Assistants
Authors: Eric Tanter, Kenji Maillard, Nicolas Tabareau et al.
Year: 2025Source: Proceedings of the ACM on Programming Languages
Proof assistants based on dependent type theory, such as Coq, Lean and Agda, use different universes to classify types, typically combining a predicative hierarchy of universes for computationally-rel...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2025Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
When is the Computation of a Feature Attribution Method Tractable?
Authors: Pablo Barceló, Micaela Morgado, Roberto Cominetti
Year: 2025Source: arXiv (Cornell University)
Feature attribution methods have become essential for explaining machine learning models. Many popular approaches, such as SHAP and Banzhaf values, are grounded in power indices from cooperative game ...
TOI-4504: Exceptionally Large Transit Timing Variations Induced by Two Resonant Warm Gas Giants in a Three-planet System
Authors: Susana Eyheramendy, Andrés Jordán, Néstor Espinoza et al.
Year: 2025Source: The Astrophysical Journal Letters
Abstract We present a joint analysis of transit timing variations (TTVs) and Doppler data for the transiting exoplanet system TOI-4504. TOI-4504 c is a warm Jupiter-mass planet that exhibits the large...
Correction to: The Semantic Web – ISWC 2024
Authors: Aidan Hogan, Daniel Hernández, Katja Hose et al.
Year: 2025Source: Lecture notes in computer science
Benchmarking zero-shot biomedical relation triplet extraction across language model architectures
Authors: Marcelo Mendoza, Frederik Steensgaard Gade, Ole Lund
Year: 2025
Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection
Authors: Ricardo Baeza-Yates, Mykola Trokhymovych, Diego Sáez Trumper et al.
Year: 2025
14 Kg of CO2: Analyzing the Carbon Footprint and Performance of Session-Based Recommendation Algorithms
Authors: Jessie Gil, Alejandro Plaza, Denis Parra
Year: 2025Source: Communications in computer and information science
Top-k Document Retrieval in Compressed Space
Authors: Gonzalo Navarro, Yakov Nekrich
Year: 2025Source: Society for Industrial and Applied Mathematics eBooks
Let 𝓓 be a collection of D strings of total length n over an alphabet of size σ. We consider the so-called top-k document retrieval problem: given a short string P and an integer k, list the ident...
A Theoretical Bound which Improves the Performance of Compilation-Based Multi-Agent Path Finding
Authors: Jorge Baier, Roberto Asín‐Achá, Rodrigo López
Year: 2025Source: IEEE Access
Database Theory in Action: Cypher, GQL, and Regular Path Queries
Authors: Domagoj Vrgoč, Oskar van Rest, Stefan Plantikow et al.
Year: 2025Source: arXiv (Cornell University)
Cypher has so far been the most commonly used query language for property graphs, and served as the foundation of the recently standardized graph query language GQL. In designing the features of GQL, ...
Assessment of the acceptability, proximate properties, and product cost of amylase-enhanced mixed cassava and sweet potato syrup
Authors: Marcelo Mendoza
Year: 2025Source: Pantao, international journal of the humanities and social sciences
The variety of goods obtained from root crops, particularly cassava and sweet potatoes, is getting low, thereby affecting their sustainability. The researcher has produced a syrup by combining cassava...
Dynamic Direct Access of MSO Query Evaluation over Strings
Authors: Cristian Riveros, Pierre Bourhis, Stefan Mengel et al.
Year: 2025Source: arXiv (Cornell University)
We study the problem of evaluating a Monadic Second Order (MSO) query over strings under updates in the setting of direct access. We present an algorithm that, given an MSO query with first-order free...
IALab UC at BEA 2025 Shared Task: LLM-Powered Expert Pedagogical Feature Extraction
Authors: Jorge Baier, Sofía Correa Busquets, Valentina Córdova Véliz
Year: 2025
Modeling and Comparative Scenario - Based Simulation of SmartBottle+: An Artificial Intelligence (AI) - Powered Recycling Reward System versus Hungary’s Conventional Reverse Vending Machines (RVMs)
Authors: Marcelo Mendoza, Mohamed Ammar Ahmed
Year: 2025Source: Procedia Computer Science
The growing need for sustainable waste management calls for more efficient and accessible recycling systems. This paper introduces SmartBottle+, an artificial intelligence (AI) powered recycling rewar...
Output Bounds for Conjunctions of Path Queries
Authors: Juan Reutter, Domagoj Vrgoč, Tamara Cucumides
Year: 2025Source: SSRN Electronic Journal
Complexity of Consistent Query Answering in Databases under Cardinality-Based and Incremental Repair Semantics (extended version)
Authors: Leopoldo Bertossi, Andrei Lopatenko
Year: 2025Source: arXiv (Cornell University)
A database D may be inconsistent wrt a given set IC of integrity constraints. Consistent Query Answering (CQA) is the problem of computing from D the answers to a query that are consistent wrt IC . Co...
In-Memory Object Graph Stores
Authors: Benjamin A. Steer, Minh-Duc Pham, Josep Lluís Larriba Pey et al.
Year: 2025Source: arXiv (Cornell University)
We present a design and implementation of an in-memory object graph store, dubbed εStore. Our key innovation is a storage model - epsilon store - that equates an object on the heap to a node in a gra...
Advancing AI Incidents Classification: Leveraging LLMs with Strategic Prompting
Authors: Ricardo Baeza-Yates, Yian Chen, Lana Do et al.
Year: 2025Source: Communications in computer and information science
Screening Dyslexia Using Visual Auditory Computer Games and Machine Learning
Authors: Ricardo Baeza-Yates, Luz Rello, Maria Rauschenberger et al.
Year: 2025Source: IEEE Access
Reading acquisition is one the main keys for school success and a crucial component for empowering individuals to participate meaningfully in society. Yet, it is still a challenging skill to acquire f...
A Framework for Extraction and Transformation of Documents
Authors: Cristian Riveros, Nicole Schweikardt, Markus L. Schmid
Year: 2025Source: arXiv (Cornell University)
We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the se...
Adapting Bias Evaluation to Domain Contexts using Generative Models
Authors: Valentin Barrière, Tamara Quiroga, Felipe Bravo-Márquez
Year: 2025
Optimizing the Performance of the FM-Index for Large-Scale Data
Authors: Dustin Cobas, Gonzalo Navarro, Travis Gagie
Year: 2025Source: arXiv (Cornell University)
The FM-index is a fundamental data structure used in bioinformatics to efficiently search for strings and index genomes. However, the FM-index can pose computational challenges, particularly in the co...
A TWO-STAGE STOCHASTIC PROGRAMMING MODEL FOR THE MID-TERM OIL REFINERY PLANNING UNDER UNCERTAIN DEMAND
Authors: Juan Pablo Luna, Virgílio José Martins Ferreira Filho, Leonardo Nascimento
Year: 2025Source: Anais do Simpósio Brasileiro de Pesquisa Operacional
Artificial Intelligence Enhanced Colposcopy Supports Early Detection of High Grade Cervical Intraepithelial Neoplasia in HPV Positive Individuals
Authors: Claudio Gutiérrez, Andrea Weitoschova, Sandhya Yerra et al.
Year: 2025Source: International Journal of Research Studies in Microbiology and Biotechnology
Hybrid framework for automated generation of mammography radiology reports
Authors: Denis Parra, Eduardo Godoy, Rodrigo Salas et al.
Year: 2025Source: Computational and Structural Biotechnology Journal
Breast cancer remains a significant health concern for women at various stages of life, impacting both productivity and reproductive health. Recent advancements in deep learning (DL) have enabled subs...
Publications for 2024
Displaying 166 publication(s) for 2024
Probabilistic Explanations for Linear Models
Authors: Marcelo Arenas, Bernardo Subercaseaux, Kuldeep S. Meel
Year: 2024Source: arXiv (Cornell University)
Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is cent...
Clinical analogy resolution performance for foundation language models
Authors: Jocelyn Dunstan, Fabián Villena, Tamara Quiroga
Year: 2024Source: ACM Transactions on Computing for Healthcare
Using extensive data sources to create foundation language models has revolutionized the performance of deep learning-based architectures. This remarkable improvement has led to state-of-the-art resul...
Competing Frames and Melodrama: The Effects of Facebook Posts on Policy Preferences about COVID-19
Authors: Sebastián Valenzuela, Ingrid Bachmann, Daniel Halpern et al.
Year: 2024Source: Routledge eBooks
The tension between health and economic considerations regarding COVID-19 has resulted in a framing contest, in which proponents and adversaries of strong containment measures hold oppositional frames...
Glycemic Control With Layperson-Delivered Telephone Calls vs Usual Care for Patients With Diabetes
Authors: Sebastián Valenzuela, Mathew Sither, Rhonda Aubrey et al.
Year: 2024Source: JAMA Network Open
Importance Diabetes is associated with emotional distress and poor mental health, especially for individuals with low income, hindering patients’ ability to manage their condition. The health care s...
Optimization of Bias Mitigation in Word Embeddings: a Methodological Approach
Authors: Felipe Bravo-Márquez, Mayteé Zambrano
Year: 2024
A Comparative Analysis of Offensive Discourse in the 2021 Chilean Presidential Campaign on Twitter and WhatsApp
Authors: Hernán Sarmiento, Felipe Bravo-Márquez, Sebastián Valenzuela et al.
Year: 2024
TOI-4504: Exceptionally large Transit Timing Variations induced by two resonant warm gas giants in a three planet system
Authors: Trifon Trifonov, M. Skarka, Néstor Espinoza et al.
Year: 2024Source: arXiv (Cornell University)
We present a joint analysis of TTVs and Doppler data for the transiting exoplanet system TOI-4504. TOI-4504 c is a warm Jupiter-mass planet that exhibits the largest known transit timing variations (T...
Bias in Retrieval Systems
Authors: Ricardo Baeza-Yates, Shiran Dudy, Leena Murgai
Year: 2024Source: Information Retrieval
Gradual C0: Symbolic Execution for Gradual Verification
Authors: Eric Tanter, Joshua Sunshine, Jonathan Aldrich et al.
Year: 2024Source: ACM Transactions on Programming Languages and Systems
Current static verification techniques such as separation logic support a wide range of programs. However, such techniques only support complete and detailed specifications, which places an undue burd...
“Your house won’t be yours anymore!” Effects of Misinformation, News Use, and Media Trust on Chile’s Constitutional Referendum
Authors: Magdalena Saldaña, Sebastián Rivera, Ximena Orchard et al.
Year: 2024Source: The International Journal of Press/Politics
News consumption and voting behavior are interlinked and particularly important in elections where traditional political cleavages are not easily applicable. This relationship becomes more complex and...
Multi-label learning on low label density sets with few examples
Authors: Benjamín Bustos, Ivan Sipiran, Tobias Schreck et al.
Year: 2024Source: Expert Systems with Applications
ML-Based Classification of Hamstring Strain Injury from Nonlinear Features of Surface Electromyography Signals
Authors: Marcelo Mendoza, Ma. Belinda C. Fidel, Gian Angelo A. Calumpang et al.
Year: 2024Source: TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON)
Recommendations for a Global AI Auditing Framework: Summary of Standards and Features
Authors: Marcelo Mendoza, Sebastián Valenzuela, Janaki Srinivasan et al.
Year: 2024
This Summary for Policymakers provides a high-level précis of the Synthesis Report Towards A Global AI Auditing Framework: Assessment and Recommendations. The growing integration of artificial intell...
Report on the 14th Workshop on Temporal Web Analytics (TempWeb 2024) at WWW 2024
Authors: Ricardo Baeza-Yates, Marc Spaniol, Ómar Alonso
Year: 2024Source: ACM SIGIR Forum
The TempWeb workshop (series) is an established co-located event at The Web Conference that aims at bringing together researchers and practitioners across various domains, taking the constantly evolvi...
PD155 RedETS Horizon Scanning: Impact In The Decision-Making Process
Authors: Gonzalo Navarro, Roland Pastells‐Peiró, Maria-Dolors Estrada et al.
Year: 2024Source: International Journal of Technology Assessment in Health Care
Introduction The RedETS horizon scanning (HS) program in Spain is focused on identifying non-pharmaceutical emerging health technologies. HS is organized in three steps: (i) identification using diffe...
Public adherence to the principles of criminal law in Chile: Shaping factors and consequences for trust in the criminal justice system
Authors: Magdalena Saldaña, Rodrigo González-Fuente, Omar A. Barriga et al.
Year: 2024Source: Política criminal
Adaptive Plane Reformatting for 4D Flow MRI using Deep Reinforcement Learning
Authors: Denis Parra, Cristián Tejos, Sergio Uribe et al.
Year: 2024Source: Proceedings on CD-ROM - International Society for Magnetic Resonance in Medicine. Scientific Meeting and Exhibition/Proceedings of the International Society for Magnetic Resonance in Medicine, Scientific Meeting and Exhibition
Motivation: The standard approach for plane reformatting in 4D flow MRI is manual, leading to time-consuming and user-dependent results. Goal(s): Our goal was to enhance plane reformatting in 4D flow ...
Evaluating regular path queries on compressed adjacency matrices
Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón et al.
Year: 2024Source: The VLDB Journal
PathFinder: Returning Paths in Graph Queries
Authors: Domagoj Vrgoč, Carlos Rojas, Wim Martens et al.
Year: 2024Source: Lecture notes in computer science
Restructuring Tractable Probabilistic Circuits
Authors: Marcelo Arenas, Guy Van den Broeck, Honghua Zhang et al.
Year: 2024Source: arXiv (Cornell University)
Probabilistic circuits (PCs) are a unifying representation for probabilistic models that support tractable inference. Numerous applications of PCs like controllable text generation depend on the abili...
Influence of regional anesthesia on fall risk in adults over 60 years
Authors: Paul Alfred Grützner, Laura C. Siegwart, Svetlana Hetjens et al.
Year: 2024Source: Clinical Biomechanics
Human-AI Coevolution
Authors: Paul Lukowicz, Frank Dignum, Albert‐László Barabási et al.
Year: 2024Source: Artificial Intelligence
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence a...
Static Slicing for Probabilistic Programs: An Overview
Authors: Federico Olmedo
Year: 2024Source: Lecture notes in computer science
Reducing Interpretative Ambiguity in an educational environment with ChatGPT.
Authors: Marcelo Mendoza, Miguél Nussbaum, Zvi Bekerman et al.
Year: 2024Source: Computers & Education
Enhancing commit message quality in software capstone projects with generative AI
Authors: Marcelo Mendoza, Andrés Neyem, Juan Pablo Sandoval Alcocer et al.
Year: 2024Source: SoftwareX
Software Capstone Projects provide valuable hands-on experience for students in software development, and creating effective commit messages is an essential, though often challenging, part of this pro...
Towards Tractability of the Diversity of Query Answers: Ultrametrics to the Rescue
Authors: Cristian Riveros, Marcelo Arenas, Reinhard Pichler et al.
Year: 2024Source: Proceedings of the ACM on Management of Data
The set of answers to a query may be very large, potentially overwhelming users when presented with the entire set. In such cases, presenting only a small subset of the answers to the user may be pref...
Complex Event Recognition meets Hierarchical Conjunctive Queries
Authors: Cristian Riveros, Dante Pinto
Year: 2024Source: Proceedings of the ACM on Management of Data
Hierarchical conjunctive queries (HCQ) are a subclass of conjunctive queries (CQ) with robust algorithmic properties. Among others, Berkholz, Keppeler, and Schweikardt have shown that HCQ is the subcl...
Trends in the Global Information Environment: 2024 Expert Survey Results
Authors: Sebastián Valenzuela, Philip N. Howard, Sacha Altay
Year: 2024
The global information environment is under great pressure. How do experts around the world perceive the features of and threats to the information environment in the countries they study? In June 202...
Disjointed Polarization in Chile’s Enduring Crisis of Representation – ERRATUM
Authors: Juan Pablo Luna
Year: 2024Source: Latin American Politics and Society
An abstract is not available for this content. As you have access to this content, full HTML content is provided on this page. A PDF of this content is also available in through the 'Save PDF' action ...
BWBEV: A Bitwise Query Processing Algorithm for Approximate Prefix Search
Authors: Ricardo Baeza-Yates, Edleno Silva de Moura, Berg Ferreira et al.
Year: 2024Source: Journal of the Brazilian Computer Society
We tackle the challenge of conducting an approximate prefix search within datasets of strings. We explore using a bit-parallelism technique to compute the edit distance between distinct strings and il...
A Uniform Language to Explain Decision Trees
Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2024
The formal XAI community has studied a plethora of interpretability queries aiming to understand the classifications made by decision trees. However, a more uniform understanding of what questions we ...
Streaming enumeration on nested documents
Authors: Cristian Riveros, Martín Muñoz
Year: 2024Source: ACM Transactions on Database Systems
Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especi...
Computing MEMs and Relatives on Repetitive Text Collections
Authors: Gonzalo Navarro
Year: 2024Source: ACM Transactions on Algorithms
We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern \(P[1\mathinner{.. }m]\) on a large repetitive text collection \(T[1\mathinner{.. }n]\) over an alphabet of siz...
Survey of data stories: Guidelines for data story authoring
Authors: Denis Parra, Manuela Garretón, Daniela Moyano et al.
Year: 2024Source: Information Visualization
Data stories are sequences of data facts connected through a meaningful narrative and combine data visualizations and storytelling to convey information effectively. They have gained popularity due to...
Correction: AI content detection in the emerging information ecosystem: new obligations for media and tech companies
Authors: Ricardo Baeza-Yates, David Eyers, Susan Leavy et al.
Year: 2024Source: Ethics and Information Technology
The Distributional Uncertainty of the SHAP Score in Explainable Machine Learning
Authors: Leopoldo Bertossi, Miguel Romero, Nina Pardal et al.
Year: 2024Source: Frontiers in artificial intelligence and applications
Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an...
ARCHIE: Articulated Robot for Collaborative Highly Integrated Education
Authors: Marcelo Mendoza, E. E. Mendoza, Juan Saeteros et al.
Year: 2024
Merging Gradual Typing
Authors: Matías Toro, Wenjia Ye, Bruno C. d. S. Oliveira
Year: 2024Source: Proceedings of the ACM on Programming Languages
Programming language mechanisms with a type-directed semantics are nowadays common and widely used. Such mechanisms include gradual typing, type classes, implicits and intersection types with a merge ...
Dynamic direct access of MSO query evaluation over strings
Authors: Cristian Riveros, Pierre Bourhis, Stefan Mengel et al.
Year: 2024Source: arXiv (Cornell University)
We study the problem of evaluating a Monadic Second Order (MSO) query over strings under updates in the setting of direct access. We present an algorithm that, given an MSO query with first-order free...
Dynamic Direct Access of MSO Query Evaluation over Strings
Authors: Cristian Riveros, Pierre Bourhis, Stefan Mengel et al.
Year: 2024Source: arXiv (Cornell University)
We study the problem of evaluating a Monadic Second Order (MSO) query over strings under updates in the setting of direct access. We present an algorithm that, given an MSO query with first-order free...
Fast and Small Subsampled R-indexes
Authors: Gonzalo Navarro, Travis Gagie, Dustin Cobas
Year: 2024Source: arXiv (Cornell University)
The $r$-index represented a breakthrough in compressed indexing of repetitive text collections, outperforming its alternatives by orders of magnitude in query time. Its space usage, $O(r)$ where $r$ i...
AI content detection in the emerging information ecosystem: new obligations for media and tech companies
Authors: Ricardo Baeza-Yates, David Eyers, Susan Leavy et al.
Year: 2024Source: Ethics and Information Technology
The world is about to be swamped by an unprecedented wave of AI-generated content. We need reliable ways of identifying such content, to supplement the many existing social institutions that enable tr...
Sense through time: diachronic word sense annotations for word sense induction and Lexical Semantic Change Detection
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg et al.
Year: 2024Source: Language Resources and Evaluation
Abstract There has been extensive work on human word sense annotation, i.e., manually labeling word uses in natural texts according to their senses. Such labels were primarily created for the tasks of...
Political participation and technology
Authors: Sebastián Valenzuela, Marcelo Santos
Year: 2024Source: Routledge eBooks
Adaptive Dynamic Bitvectors
Authors: Gonzalo Navarro
Year: 2024Source: Lecture notes in computer science
Compressed Graph Representations for Evaluating Regular Path Queries
Authors: Gonzalo Navarro, J Robert
Year: 2024Source: Lecture notes in computer science
Repetitive Patterns Recognition in Textures of Ancient Peruvian Pottery
Authors: Benjamín Bustos, Ivan Sipiran, Sebastian Sepulveda
Year: 2024Source: Journal on Computing and Cultural Heritage
We present a study and comparison of computer vision methods for the task of finding repetitive motifs in ancient Peruvian pottery. Under this context, the main difficulties for solving the task are t...
Expert Survey on the Global Information Environment 2024: Searching for Solutions
Authors: Sebastián Valenzuela, Philip N. Howard, Sacha Altay
Year: 2024
The global information environment is under significant pressure from the development of new technologies and shifting public policies. How do experts around the world perceive the varied features of,...
Responsible AI Day
Authors: Ricardo Baeza-Yates, Nataly Buslón
Year: 2024Source: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
We summarize the goals of the Responsible AI day, giving a glimpse on the program as well as a short biography of the organizers.
Automatic knowledge-graph creation from historical documents: The Chilean dictatorship as a case study
Authors: Antonia Fonck, Camila Díaz, Alejandro Grez et al.
Year: 2024Source: arXiv (Cornell University)
We present our results regarding the automatic construction of a knowledge graph from historical documents related to the Chilean dictatorship period (1973-1990). Our approach consists on using LLMs t...
Microbiota-dependent T-cell response to α-synuclein-derived antigens triggers the development of hypersensitivity and neuroinflammation associated with Parkinson's Disease
Authors: Sebastián Valenzuela, M Ricca, Valentina Ugalde et al.
Year: 2024Source: Research Square (Research Square)
<title>Abstract</title> <bold>Background</bold>. Previous evidence has shown that both the T-cell response and the microbiota play fundamental roles on the development of Parkinson's Disease (PD), whi...
Gradual Indexed Inductive Types
Authors: Eric Tanter, Kenji Maillard, Nicolas Tabareau et al.
Year: 2024Source: Proceedings of the ACM on Programming Languages
Indexed inductive types are essential in dependently-typed programming languages, enabling precise and expressive specifications of data structures and properties. Recognizing that programming and pro...
Self-Supervised Learning Applied to Variable Star Semi-Supervised Classification Using LSTM and GRU Networks
Authors: Hans Löbel, R. B. Merino, Billy Peralta et al.
Year: 2024
Recognizing variable stars is a task of interest in the astronomy community. Currently, this task has taken advantage of deep learning algorithms. However, these algorithms require a large amount of d...
Movelet Trees
Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2024Source: arXiv (Cornell University)
We combine Nishimoto and Tabei's move structure with a wavelet tree to show how, if $T [1..n]$ is over a constant-sized alphabet and its Burrows-Wheeler Transform (BWT) consists of $r$ runs, then we c...
WIP: Evaluation of the Third Design Cycle of the Wellbeing Teaching Assistant (WTA): Understanding What Type of Cases are Served Through a Categorization Analysis
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
Abstract Well-being is increasingly recognized as a key element to foster within higher education. In this context, our institution—a prominent engineering school in Latin America—has created the ...
Unpacking Student Workload through Elicitation Techniques: Perspectives from Engineering Faculty and Students
Authors: Jorge Baier, Isabel Hilliger, Erick Svec et al.
Year: 2024
Abstract This is a work-in-progress about student workload. Over the past two decades, practitioners and researchers have shown concern for student workload within engineering programs. Since the late...
WIP: Exploring the Effects of a Purpose-in-Life Reflection Activity in an Introductory Artificial Intelligence Course
Authors: Jorge Baier, Trini Balart, Kristi Shryock et al.
Year: 2024
Abstract Sense of purpose in life is related to actively choosing to work for the benefit of society and has been recognized as a key influencer of well-being which in turn has been established to be ...
WIP: Traditional Engineering Assessments Challenged by ChatGPT: An Evaluation of its Performance on a Fundamental Competencies Exam
Authors: Jorge Baier, Trini Balart, Martín Castillo
Year: 2024
Abstract ChatGPT, a chatbot which produces text with remarkable coherence, is leading higher education institutions to question the relevance of the current model of engineering education and, particu...
Complex event recognition meets hierarchical conjunctive queries
Authors: Cristian Riveros, Dante Pinto
Year: 2024Source: arXiv (Cornell University)
Hierarchical conjunctive queries (HCQ) are a subclass of conjunctive queries (CQ) with robust algorithmic properties. Among others, Berkholz, Keppeler, and Schweikardt have shown that HCQ is the subcl...
Towards Tractability of the Diversity of Query Answers: Ultrametrics to the Rescue
Authors: Reinhard Pichler, Timo Camillo Merkl, Cristian Riveros et al.
Year: 2024Source: arXiv (Cornell University)
The set of answers to a query may be very large, potentially overwhelming users when presented with the entire set. In such cases, presenting only a small subset of the answers to the user may be pref...
Global Approaches to Auditing Artificial Intelligence: A Literature Review
Authors: Marcelo Mendoza, Sebastián Valenzuela, Janaki Srinivasan et al.
Year: 2024
This Synthesis Report is a literature review outlining the regulatory, industry, and academic approaches to AI audits. We review 78 articles published in peer-reviewed journals and as preprints, 21 do...
Telar and TelarKG: Data-Driven Insights into Chile’s Constitutional Process
Authors: Aidan Hogan, Juan Reutter, Sergio Toro et al.
Year: 2024Source: Communications of the ACM
Human-AI Coevolution (Abstract Reprint)
Authors: Ricardo Baeza-Yates, Dino Pedreschi, Alistair Knott et al.
Year: 2024
Human-AI coevolution, defined as a process in which humans and AI algorithms continuously influence each other, increasingly characterises our society, but is understudied in artificial intelligence a...
New Compressed Indices for Multijoins on Graph Databases
Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón et al.
Year: 2024Source: arXiv (Cornell University)
A recent surprising result in the implementation of worst-case-optimal (wco) multijoins in graph databases (specifically, basic graph patterns) is that they can be supported on graph representations t...
Gradual Differentially Private Programming
Authors: Matías Toro, Eric Tanter, Federico Olmedo et al.
Year: 2024Source: Communications of the ACM
Response to Kempf et al on Methodological and Practical Aspects of a Distant Metastasis Detection Model
Authors: Jocelyn Dunstan, Pablo Báez, Ricardo Ahumada et al.
Year: 2024Source: JCO Clinical Cancer Informatics
“The more official, the less I believe”: Using focus groups to explore public opinion formation in politically polarized contexts
Authors: Magdalena Saldaña, Andrés Scherman, Cristian Cabalín et al.
Year: 2024Source: Social Science Quarterly
Abstract Introduction Public opinion studies have traditionally relied on survey analyses. However, a qualitative approach is needed to address opinion formation's multidimensional and contextual natu...
Welcome
Authors: Bárbara Poblete, Fábio Kon, Sebastián Uchitel
Year: 2024Source: Communications of the ACM
T I S W I T H great pleasure that we introduce the second edition of the Communications of the ACM Latin American Regional Special Section.In this edition, we showcase some of the region's most intere...
A pseudonymized corpus of occupational health narratives for clinical entity recognition in Spanish
Authors: Sebastián Viteri Valenzuela, Paulina Vera, Tamara Quiroga et al.
Year: 2024Source: BMC Medical Informatics and Decision Making
Despite the high creation cost, annotated corpora are indispensable for robust natural language processing systems. In the clinical field, in addition to annotating medical entities, corpus creators m...
Dynamic compact data structure for temporal reachability with unsorted contact insertions
Authors: Gonzalo Navarro, Bruno Augusto Nassif Travençolo, Marcelo Keese Albertini et al.
Year: 2024Source: The Computer Journal
Abstract Temporal graphs represent interactions between entities over time. Deciding whether entities can reach each other through temporal paths is useful for various applications such as in communic...
LB1040 Machine learning-based predictive model with routine blood work identifies moderate-severe alopecia areata
Authors: Marcelo Mendoza, Tarun Sharma, Ross O’Hagan et al.
Year: 2024Source: Journal of Investigative Dermatology
Tackling Challenges in Implementing Large-Scale Graph Databases
Authors: Aidan Hogan, Juan Reutter, Domagoj Vrgoč et al.
Year: 2024Source: Communications of the ACM
The economics of ethnic marriages: Endogamy and the social status of minority groups
Authors: Naim Bro, Liran Morav
Year: 2024Source: British Journal of Sociology
Abstract This study examines the relationship between ethnic endogamy and socioeconomic status (SES) within the socioeconomically divergent Jewish and Native‐Chilean Mapuche communities of Santiago,...
Joint models reveal genetic architecture of pubertal stage transitions and their association with BMI in admixed Chilean population
Authors: Susana Eyheramendy, Lucas Vicuña, Verónica Mericq et al.
Year: 2024Source: Human Molecular Genetics
Early or late pubertal onset can lead to disease in adulthood, including cancer, obesity, type 2 diabetes, metabolic disorders, bone fractures, and psychopathologies. Thus, knowing the age at which pu...
Path-based Algebraic Foundations of Graph Query Languages
Authors: Renzo Angles, Domagoj Vrgoč, Angela Bonifati et al.
Year: 2024Source: arXiv (Cornell University)
Graph databases are gaining momentum thanks to the flexibility and expressiveness of their data model and query languages. A standardization activity driven by the ISO/IEC standardization body is also...
Cuando los algoritmos son editores: Cómo las redes sociales, la IA y la desinformación alteran el consumo de noticias
Authors: Sebastián Valenzuela
Year: 2024Source: Comunicación y Medios
Esta es una versión editada de la charla magistral del autor en la inauguración de la Conferencia Académica por el Día Mundial de la Libertad de Prensa de UNESCO 2024 y que organizaron la Universi...
Entity normalization in a Spanish medical corpus using a UMLS-based lexicon: findings and limitations
Authors: Jocelyn Dunstan, Pablo Báez, Fredy Núñez Torres et al.
Year: 2024Source: Language Resources and Evaluation
Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation
Authors: René Víctor Valqui Vidal, Álvaro Soto, Denis Parra et al.
Year: 2024Source: arXiv (Cornell University)
Advancing representation learning in specialized fields like medicine remains challenging due to the scarcity of expert annotations for text and images. To tackle this issue, we present a novel two-st...
Taxonomic classification with maximal exact matches in KATKA kernels and minimizer digests.
Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2024Source: PubMed
For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which ...
Adversarial Pairwise Multimodal Recommendation
Authors: Denis Parra, Ricardo Ñanculef, Mario Mallea
Year: 2024Source: 2022 International Joint Conference on Neural Networks (IJCNN)
Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning
Authors: Denis Parra, Rafael Elberg, Mircea Petrache
Year: 2024
Image and multimodal machine learning tasks are very challenging to solve in the case of poorly distributed data. In particular, data availability and privacy restrictions exacerbate these hurdles in ...
ERDoc: A Web Interface for Entity-Relation Modelling
Authors: Aidan Hogan, Sebastián Ferrada, Matias Lopez
Year: 2024
The Generalized Causal-Effect Score in Data Management (short paper)
Authors: Leopoldo Bertossi, Felipe Azua
Year: 2024
TelarKG: a Knowledge Graph of Chile's Constitutional Process
Authors: Aidan Hogan, Juan Reutter, Renzo Angles et al.
Year: 2024
In this paper we present TelarKG, a knowledge graph (KG) that consolidates multiple sources of information regarding the Chilean Constitutional process, particularly about the work of the members of t...
Space & Time Efficient Leapfrog Triejoin
Authors: Domagoj Vrgoč, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2024
Leapfrog Triejoin (LTJ) is arguably the most practical and popular worst-case-optimal (wco) algorithm for solving basic graph patterns in graph databases. Its main drawback is that it needs the databa...
Physics-informed neural networks for parameter estimation in blood flow models
Authors: Jocelyn Dunstan, Sergio Uribe, Jeremías Garay et al.
Year: 2024Source: Computers in Biology and Medicine
Gender Representation Across Online Retail Products
Authors: Bárbara Poblete, Dana Pessach
Year: 2024Source: 2022 ACM Conference on Fairness, Accountability, and Transparency
We present a broad characterization of gender representation in a large heterogeneous sample of retail products. In particular, we study online product textual information, such as titles and descript...
A New Upper Bound for the Makespan of Cost-Optimal Solutions for Multi-Agent Path Finding (Extended Abstract)
Authors: Jorge Baier, Rodrigo López, Roberto Asín‐Achá
Year: 2024Source: Proceedings of the International Symposium on Combinatorial Search
A well-known approach to solving Multi-Agent Path Finding (MAPF) optimally is compilation to Boolean Satisfiability or Answer Set Programming (ASP). Such compilation-based approaches are superior to o...
Finding a Small, Diverse Subset of the Pareto Solution Set in Bi-Objective Search (Extended Abstract)
Authors: Jorge Baier, Nicolás Rivera, Pablo Araneda et al.
Year: 2024Source: Proceedings of the International Symposium on Combinatorial Search
Bi-objective search requires computing a Pareto solution set which contains a set of paths. In real-world applications, Pareto solution sets may contain several tens or even hundreds of solutions. For...
Counting on General Run-Length Grammars
Authors: Gonzalo Navarro, Alejandro Pacheco
Year: 2024Source: arXiv (Cornell University)
We introduce a data structure for counting pattern occurrences in texts compressed with any run-length context-free grammar. Our structure uses space proportional to the grammar size and counts the oc...
Querying Graph Databases at Scale
Authors: Aidan Hogan, Domagoj Vrgoč
Year: 2024
The tutorial provides an in-depth overview of recent advances in algorithms and data structures for processing graph database queries. The focus will be on scalable algorithms that have been demonstra...
MillenniumDB: A Multi-modal, Multi-model Graph Database
Authors: Aidan Hogan, Marcelo Arenas, Juan Reutter et al.
Year: 2024
Current knowledge graphs encompass diverse data formats, including images, text, tables, audio files, and videos. Additionally, the graph database ecosystem is required to support multiple co-existing...
The Limitations of Data, Machine Learning and Us
Authors: Ricardo Baeza-Yates
Year: 2024
Machine learning (ML), particularly deep learning, is being used everywhere. However, not always is applied well or has ethical and/or scientific issues. In this keynote we first do a deep dive in the...
Demonstrating REmatch: A Novel RegEx Engine for Finding all Matches
Authors: Cristian Riveros, Domagoj Vrgoč, Vicente Calisto et al.
Year: 2024
In this demonstration we showcase REmatch, a regular expression (RegEx) engine built to find all matches of a given pattern in a document. REmatch is based on the theory of enumeration algorithms, and...
A Data Management Approach to Explainable AI
Authors: Marcelo Arenas
Year: 2024
In recent years, there has been a growing interest in developing methods to explain individual predictions made by machine learning models. This has led to the development of various notions of explan...
Using Color Refinement to Boost Enumeration and Counting for Acyclic CQs of Binary Schemas
Authors: Cristian Riveros, Nicole Schweikardt, Benjamin Scheidt
Year: 2024Source: arXiv (Cornell University)
We present an index structure, called the color-index, to boost the evaluation of acyclic conjunctive queries (ACQs) over binary schemas. The color-index is based on the color refinement algorithm, a ...
A Framework for Extraction and Transformation of Documents
Authors: Cristian Riveros, Nicole Schweikardt, Markus L. Schmid
Year: 2024Source: arXiv (Cornell University)
We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the se...
A Principled Approach for a New Bias Measure
Authors: Ricardo Baeza-Yates, Bruno Scarone, Alfredo Viola
Year: 2024Source: arXiv (Cornell University)
The widespread use of machine learning and data-driven algorithms for decision making has been steadily increasing over many years. The areas in which this is happening are diverse: healthcare, employ...
A framework for extraction and transformation of documents
Authors: Cristian Riveros, Nicole Schweikardt, Markus L. Schmid
Year: 2024Source: arXiv (Cornell University)
We present a theoretical framework for the extraction and transformation of text documents. We propose to use a two-phase process where the first phase extracts span-tuples from a document, and the se...
Compact Path Representations for Graph Database Pattern Matching
Authors: Domagoj Vrgoč, Carlos Rojas, Stijn Vansummeren et al.
Year: 2024
Modern graph database query languages such as GQL, SQL/PGQ, and Cypher allow regular path queries to return entire paths, as opposed to only their endpoints. This is challenging for query evaluation, ...
14th Temporal Web Analytics Workshop (TempWeb)
Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2024
The TempWeb workshop series is an established co-located event at The Web Conference that aims at bringing together researchers and practitioners across various domains. Naturally, submissions address...
Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence
Authors: Ricardo Baeza-Yates, Marios Constantinides, Michael Madaio et al.
Year: 2024
With the upcoming AI regulations (e.g., EU AI Act) and rapid advancements in generative AI, new challenges emerge in the area of Human-Centered Responsible Artificial Intelligence (HCR-AI). As AI beco...
Long Tail Image Generation Through Feature Space Augmentation and Iterated Learning
Authors: Mircea Petrache, Denis Parra, Rafael Elberg
Year: 2024Source: arXiv (Cornell University)
Image and multimodal machine learning tasks are very challenging to solve in the case of poorly distributed data. In particular, data availability and privacy restrictions exacerbate these hurdles in ...
Disjointed Polarization in Chile’s Enduring Crisis of Representation
Authors: Juan Pablo Luna
Year: 2024Source: Latin American Politics and Society
Abstract This analytical essay proposes the notion of disjointed polarization to characterize the nature of polarization in contemporary Chile. In disjointed polarization, elite-level polarization doe...
Is the change deforestation? Using time-series analysis of satellite data to disentangle deforestation from other forest degradation causes
Authors: Susana Eyheramendy, Javier Lopatin, Ignacio Fuentes et al.
Year: 2024Source: Remote Sensing Applications Society and Environment
Augmented non-hallucinating large language models as medical information curators
Authors: Aidan Hogan, Jakob Nikolas Kather, Stephen Gilbert
Year: 2024Source: npj Digital Medicine
Reliably processing and interlinking medical information has been recognized as a critical foundation to the digital transformation of medical workflows, and despite the development of medical ontolog...
U Can't Gen This? A Survey of Intellectual Property Protection Methods for Data in Generative AI
Authors: Andreas Rauber, Tanja Šarčević, Rudolf Mayer et al.
Year: 2024Source: arXiv (Cornell University)
Large Generative AI (GAI) models have the unparalleled ability to generate text, images, audio, and other forms of media that are increasingly indistinguishable from human-generated content. As these ...
SpatialCluster: A Python library for urban clustering
Authors: Marcelo Mendoza, Hans Löbel, Naim Bro et al.
Year: 2024Source: SoftwareX
This paper introduces SpatialCluster, a Python library developed for clustering urban areas using geolocated data. The library integrates a range of methods for urban clustering, including Deep Modula...
A Circus of Circuits: Connections Between Decision Diagrams, Circuits, and Automata
Authors: Mikaël Monet, Guy Van den Broeck, Antoine Amarilli et al.
Year: 2024Source: arXiv (Cornell University)
This document is an introduction to two related formalisms to define Boolean functions: binary decision diagrams, and Boolean circuits. It presents these formalisms and several of their variants studi...
Generalized Straight-Line Programs
Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2024Source: arXiv (Cornell University)
It was recently proved that any Straight-Line Program (SLP) generating a given string can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We generalize this ...
(Don’t) Stop Believing: A Signal Detection Approach to Risk and Protective Factors for Engagement with Politicized (Mis)Information in Social Media
Authors: Sebastián Valenzuela, Marcelo Santos, Tobias Rothmund et al.
Year: 2024
Prior misinformation research often lacks comparisons with the processing of true information and specifically focuses on the dangers of right-wing or conservative misinformation. By employing a signa...
A Self-Righteous, Not a Virtuous, Circle: Proposing a New Framework for Studying Media Effects on Knowledge and Political Participation in a Social Media Environment
Authors: Sebastián Valenzuela, Sangwon Lee
Year: 2024Source: Social Media + Society
To explain the participatory effects of news exposure, communication scholars have long relied upon the “virtuous circle” framework of media use and civic participation. That is, news consumption ...
Detection and impact estimation of social bots in the Chilean Twitter network
Authors: Marcelo Mendoza, Sebastián Valenzuela, Marcelo Santos et al.
Year: 2024Source: Scientific Reports
Abstract The rise of bots that mimic human behavior represents one of the most pressing threats to healthy information environments on social media. Many bots are designed to increase the visibility o...
Faster Maximal Exact Matches with Lazy LCP Evaluation
Authors: Gonzalo Navarro, Travis Gagie, Adrián Goga et al.
Year: 2024
MONI (Rossi et al.,
A simpler data structure for dynamic strings
Authors: Gonzalo Navarro, Zsuzsanna Lipták, Francesco Masillo
Year: 2024Source: arXiv (Cornell University)
We consider the problem of maintaining a collection of strings while efficiently supporting splits and concatenations on them, as well as comparing two substrings, and computing the longest common pre...
BAT-LZ Out of Hell
Authors: Gonzalo Navarro, Zsuzsanna Lipták, Francesco Masillo
Year: 2024Source: arXiv (Cornell University)
Despite consistently yielding the best compression on repetitive text collections, the Lempel-Ziv parsing has resisted all attempts at offering relevant guarantees on the cost to access an arbitrary s...
Worst-Case-Optimal Similarity Joins on Graph Databases
Authors: Aidan Hogan, Juan Reutter, Benjamín Bustos et al.
Year: 2024Source: Proceedings of the ACM on Management of Data
We extend the concept of worst-case optimal equijoins in graph databases to the case where some nodes are required to be within the k-nearest neighbors (kNN) of others under some similarity function. ...
Similarity joins and clustering for SPARQL
Authors: Aidan Hogan, Benjamín Bustos, Sebastián Ferrada
Year: 2024Source: Semantic Web
The SPARQL standard provides operators to retrieve exact matches on data, such as graph patterns, filters and grouping. This work proposes and evaluates two new algebraic operators for SPARQL 1.1 that...
Exploring the Impact of Generative AI for StandUp Report Recommendations in Software Capstone Project Development
Authors: Marcelo Mendoza, Andrés Neyem, Juan Pablo Sandoval Alcocer et al.
Year: 2024
StandUp Reports play an important role in capstone software engineering courses, facilitating progress tracking, obstacle identification, and team collaboration. However, despite their significance, s...
Introduction to Responsible AI
Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2024
In the first part of this tutorial we define responsible AI and we discuss the problems embedded in terms like ethical or trustworthy AI. In the second part, to set the stage, we cover irresponsible A...
Stronger and Safer Together: Motivations for and Challenges of (Trans)National Collaboration in Investigative Reporting in Latin America
Authors: Magdalena Saldaña, Lourdes M. Cueva Chacón
Year: 2024Source: Routledge eBooks
Despite the growing scholarship on investigative journalism in Latin America, very few studies have addressed collaboration across newsrooms in the region. By analyzing the responses of 251 journalist...
Implications of Regulations on the Use of AI and Generative AI for Human-Centered Responsible Artificial Intelligence
Authors: Mohammad Tahaei, Edyta P. Bogucka, Seán Kennedy et al.
Year: 2024Source: arXiv (Cornell University)
With the upcoming AI regulations (e.g., EU AI Act) and rapid advancements in generative AI, new challenges emerge in the area of Human-Centered Responsible Artificial Intelligence (HCR-AI). As AI beco...
The Threat of Misinformation on Journalism’s Epistemology: Exploring the Gap between Journalist’s and Audience’s Expectations when Facing Fake Content
Authors: Marcelo Mendoza, Sebastián Valenzuela, Eliana Providel et al.
Year: 2024Source: Digital Journalism
This study analyzes the discourse of reporters, editors and audiences in focus groups and in-depth interviews, examining the expectations on journalists when facing misinformation. While both groups a...
A Family of Centrality Measures for Graph Data Based on Subgraphs
Authors: Cristian Riveros, Sebastián Bugedo, Jorge Salas
Year: 2024Source: ACM Transactions on Database Systems
We present the theoretical foundations and first experimental study of a new approach in centrality measures for graph data. The main principle is straightforward: the more relevant subgraphs around a...
Work in Progress: A Cross-sectional Survey Study for Understanding and Addressing the Needs of Engineering Students During COVID-19
Authors: Jorge Baier, Isabel Hilliger, Constanza Melian et al.
Year: 2024Source: 2020 ASEE Virtual Annual Conference Content Access Proceedings
His research focuses on areas of automated reasoning in Artificial Intelligence; specifically, automated planning, search and knowledge representation.Currently his research focuses on understanding h...
Iterated Straight-Line Programs
Authors: Gonzalo Navarro, C. Urbina
Year: 2024Source: arXiv (Cornell University)
We explore an extension to straight-line programs (SLPs) that outperforms, for some text families, the measure $\delta$ based on substring complexity, a lower bound for most measures and compressors e...
Stronger compact representations of object trajectories
Authors: Gonzalo Navarro, Travis Gagie, Adrián Gómez-Brandón et al.
Year: 2024Source: Geo-spatial Information Science
GraCT and ContaCT were the first compressed data structures to represent object trajectories, demonstrating that it was possible to use orders of magnitude less space than classical indexes while stay...
The Ring: Worst-case Optimal Joins in Graph Databases using (Almost) No Extra Space
Authors: Aidan Hogan, Juan Reutter, Gonzalo Navarro et al.
Year: 2024Source: ACM Transactions on Database Systems
We present an indexing scheme for triple-based graphs that supports join queries in worst-case optimal (wco) time within compact space. This scheme, called a ring , regards each triple as a cyclic str...
The Well-being Teaching Assistant: A Proactive Approach to Caring for Students with Academic and Personal Difficulties in Massive Courses
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
Abstract Since the covid pandemic, some higher education institutions have promoted a flexible evaluation approach for students who face a variety of problems. Instructors willing to implement such fl...
Social ties, mental well-being and academic self-regulation. Exploring effects through Structural Equation Modeling.
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
A long tradition of studies in both psychology and sociology has shown that social ties have positive effects on mental well-being of both the population in general and in educational contexts in part...
Link Prediction with Relational Hypergraphs
Authors: Pablo Barceló, Michael M. Bronstein, Miguel Romero Orth et al.
Year: 2024Source: arXiv (Cornell University)
Link prediction with knowledge graphs has been thoroughly studied in graph machine learning, leading to a rich landscape of graph neural network architectures with successful applications. Nonetheless...
WIP: Exploring differences in student sense of belonging inside and outside the engineering classroom
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2024
Abstract This Work-in-Progress (WIP) aims to explore differences in engineering students' sense of belonging. By sense of belonging, researchers have referred to the feeling of mattering to a communit...
The Distributional Uncertainty of the SHAP score in Explainable Machine Learning
Authors: Leopoldo Bertossi, Nina Pardal, Santiago Cifuentes et al.
Year: 2024Source: arXiv (Cornell University)
Attribution scores reflect how important the feature values in an input entity are for the output of a machine learning model. One of the most popular attribution scores is the SHAP score, which is an...
The problem of estimation and forecasting of obesity prevalence using sparsely collected data
Authors: Jocelyn Dunstan, Cristóbal Cuadrado, Luis Rojo-González et al.
Year: 2024Source: Engineering Applications of Artificial Intelligence
The Meso News-Space as a Framework for Studying Mobile Instant Messaging Services
Authors: Sebastián Valenzuela, Marcelo Santos
Year: 2024Source: Digital Journalism
Automatic Detection of Distant Metastasis Mentions in Radiology Reports in Spanish
Authors: Jocelyn Dunstan, Matías Rojas, Pablo Báez et al.
Year: 2024Source: JCO Clinical Cancer Informatics
A critical task in oncology is extracting information related to cancer metastasis from electronic health records. Metastasis-related information is crucial for planning treatment, evaluating patient ...
A pseudonymized corpus of occupational health narratives for clinical entity recognition in Spanish
Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2024Source: Research Square (Research Square)
<title>Abstract</title> Despite the high creation cost, annotated corpora are indispensable for robust natural language processing systems. In the clinical field, apart from annotating medical entitie...
Securing Verified IO Programs Against Unverified Code in F*
Authors: Eric Tanter, Cătălin Hriţcu, Ştefan Ciobâcă et al.
Year: 2024Source: Proceedings of the ACM on Programming Languages
We introduce SCIO*, a formally secure compilation framework for statically verified programs performing input-output (IO). The source language is an F* subset in which a verified program interacts wit...
Responsible AI in Farming: A Multi-Criteria Framework for Sustainable Technology Design
Authors: Ricardo Baeza-Yates, Kevin Mallinger, Ricardo Baeza‐Yates
Year: 2024Source: Applied Sciences
The continuous fusion of artificial intelligence (AI) and autonomous farming machinery (e.g., drones and field robots) provides a significant shift in the daily work experience of farmers. Faced with ...
A Transforming Digital Journalism Editorial Team Calls for a Tribute and a Welcome
Authors: Magdalena Saldaña, Oscar Westlund
Year: 2024Source: Digital Journalism
Toward an AI Knowledge Assistant for Context-Aware Learning Experiences in Software Capstone Project Development
Authors: Marcelo Mendoza, Andrés Neyem, Juan Pablo Sandoval Alcocer et al.
Year: 2024Source: IEEE Transactions on Learning Technologies
Software assistants have significantly impacted software development for both practitioners and students, particularly in capstone projects. The effectiveness of these tools varies based on their know...
Cross-Lingual Cross-Domain Transfer Learning for Rumor Detection
Authors: Marcelo Mendoza, Mauricio Solar, Eliana Providel
Year: 2024
Chile: La deriva del sistema político y el fracaso del nuevo proceso constitucional
Authors: Sergio Toro, AGUSTINA NOGUERA
Year: 2024Source: Revista de ciencia política
Enumeration and Updates for Conjunctive Linear Algebra Queries Through Expressibility
Authors: Thomas Muñoz, Cristian Riveros, Stijn Vansummeren
Year: 2024Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Due to the importance of linear algebra and matrix operations in data analytics, there is significant interest in using relational query optimization and processing techniques for evaluating (sparse) ...
All Models are Wrong, But Some are Deadly: Inconsistencies in Emotion Detection in Suicide-related Tweets
Authors: Ricardo Baeza-Yates, Resmi Ramachandranpillai, Annika Marie Schoene et al.
Year: 2024
A Credibility Divide? Discerning Truth From Misinformation in Chile
Authors: Sebastián Valenzuela, Ingrid Bachmann, Daniel Halpern et al.
Year: 2024Source: International Journal of Public Opinion Research
Abstract Studies on misinformation often overlook people’s assessment of true information, focusing instead on beliefs in and sharing of false content. This is problematic, as it limits scholars’ ...
Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation
Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2024Source: Findings of the Association for Computational Linguistics: ACL 2022
Overconfidence is Key: Verbalized Uncertainty Evaluation in Large Language and Vision-Language Models
Authors: Matías Toro, Tobias Groot
Year: 2024
Post-processing of Medical Image for Neurosurgical Planning with Academic Purposes
Authors: Pablo Barceló, Rocío Buenamaizón, Ricardo Berjano et al.
Year: 2024Source: IFMBE proceedings
Post-Processing Applied to Brain Tumor Surgery: Case studies
Authors: Pablo Barceló, Rocío Buenamaizón, Ricardo Berjano et al.
Year: 2024Source: IFMBE proceedings
An optimized relational database for querying structural patterns in proteins
Authors: Renzo Angles, Roberto García, Mauricio Arenas‐Salinas et al.
Year: 2024Source: Database
Abstract A database is an essential component in almost any software system, and its creation involves more than just data modeling and schema design. It also includes query optimization and tuning. T...
YARS-PG: Property Graphs Representation for Publication and Exchange
Authors: Renzo Angles, Dominik Tomaszuk, Łukasz Szeremeta
Year: 2024Source: IEEE Access
Graph serialization is a critical aspect of advancing graph-oriented systems and applications. Despite the importance of standardized serialization for property graphs, there is a lack of a universal ...
Path Querying in Graph Databases: A Systematic Mapping Study
Authors: Renzo Angles, Roberto García
Year: 2024Source: IEEE Access
Path querying refers to the evaluation of path queries in a graph database. New research in this topic is crucial for the development of graph database systems as path queries are associated with rele...
The Property Graph Data Format (PGDF)
Authors: Renzo Angles, Sebastián Ferrada, Ignacio Burgos
Year: 2024Source: IEEE Access
Property graphs are popular in both industry and academia due to their versatility in modeling complex data across diverse application domains, ranging from social networks to knowledge graphs. Despit...
Responsible AI: An Urgent Mandate
Authors: Ricardo Baeza-Yates, Usama M. Fayyad, Ricardo Baeza‐Yates
Year: 2024Source: IEEE Intelligent Systems
AI is rapidly becoming essential in various industries, raising societal expectations. AI's societal consequences include impacts on mental health; misinformation; workforce displacement; and economic...
iHealth-Chile-3&2 at RRG24: Template Based Report Generation
Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2024
iHealth-Chile-1 at RRG24: In-context Learning and Finetuning of a Large Multimodal Model for Radiology Report Generation
Authors: Denis Parra, Pablo Messina, Rafael Elberg et al.
Year: 2024
Speedy Gonzales: A Collection of Fast Task-Specific Models for Spanish
Authors: Felipe Bravo-Márquez, José Cañete
Year: 2024
Wheeler Maps
Authors: Gonzalo Navarro, Travis Gagie, Jouni Sirén et al.
Year: 2024Source: Lecture notes in computer science
Iterated Straight-Line Programs
Authors: Gonzalo Navarro, C. Urbina
Year: 2024Source: Lecture notes in computer science
Space-Efficient Conversions from SLPs
Authors: Gonzalo Navarro, Travis Gagie, Adrián Goga et al.
Year: 2024Source: Lecture notes in computer science
News Gathering: Leveraging Transformers to Rank News
Authors: Hans Löbel, Maximiliano Ojeda, María José Apolo et al.
Year: 2024Source: Lecture notes in computer science
A Privacy-Preserving Corpus for Occupational Health in Spanish: Evaluation for NER and Classification Tasks
Authors: Jocelyn Dunstan, Víctor Rocco, Fabián Villena et al.
Year: 2024
Claudio Aracena, Luis Miranda, Thomas Vakili, Fabián Villena, Tamara Quiroga, Fredy Núñez-Torres, Victor Rocco, Jocelyn Dunstan. Proceedings of the 6th Clinical Natural Language Processing Workshop...
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Authors: Toqeer Ehsan, Jiahui Geng, Tiago Timponi Torrent et al.
Year: 2024
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual a...
How Could Be Used Student Comments for Delivering Feedback to Instructors in Higher Education?
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo
Year: 2024Source: Communications in computer and information science
Geospatial Raster Data Processing Applying Neural Networks
Authors: Magdalena Saldaña, Carlos Guzmán Sanchéz-Mejorada, Rolando Quintero et al.
Year: 2024Source: Communications in computer and information science
Publications for 2023
Displaying 159 publication(s) for 2023
ACHS-Privacy Corpus
Authors: Tamara Quiroga, Thomas Vakili, Claudio Aracena et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
ACHS-Privacy Corpus
Authors: Tamara Quiroga, Thomas Vakili, Claudio Aracena et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
A Panel Study on the Dynamics of Social Media Use and Conspiracy Thinking
Authors: Sebastián Valenzuela, Daniel Halpern, Sangwon Lee et al.
Year: 2023Source: Media Psychology
Studies exploring the association between social media use and belief in conspiracy theories have yielded mixed evidence. To address this inconsistency, we focus on conspiracy thinking – a predispos...
K-Focal Search for Slow Learned Heuristics
Authors: Jorge Baier, Carlos Hernández, Jorge Toro et al.
Year: 2023Source: IEEE Access
Bounded suboptimal heuristic search is a family of search algorithms capable of solving hard combinatorial problems, returning suboptimal solutions within a given bound.Recent machine learning approac...
Unveiling Backbone Effects in CLIP: Exploring Representational Synergies and Variances
Authors: Felipe Bravo-Márquez, Edison Marrese-Taylor, I. Jara et al.
Year: 2023Source: arXiv (Cornell University)
Contrastive Language-Image Pretraining (CLIP) stands out as a prominent method for image representation learning. Various neural architectures, spanning Transformer-based models like Vision Transforme...
Predicting disease severity in multiple sclerosis using multimodal data and machine learning
Authors: Susanna Asseyer, Synne Brune-Ingebretse, Tone Berge et al.
Year: 2023Source: Journal of Neurology
Multiple sclerosis patients would benefit from machine learning algorithms that integrates clinical, imaging and multimodal biomarkers to define the risk of disease activity.
Ciencias, golpe de Estado y Dictadura en Chile
Authors: Claudio Gutiérrez
Year: 2023Source: Anales de la Universidad de Chile
de «limpieza» física.En la tercera, abordamos la «limpieza» ideológica y disciplinaria.En la cuarta, tratamos la
Bias and the Web
Authors: Ricardo Baeza-Yates, Leena Murgai
Year: 2023
Abstract Bias is everywhere, sometimes blatantly explicit, but most of the time it’s hidden, as it often arises from that which is missing, the gaps in our knowledge or data. In this chapter, we cov...
Measuring Bias
Authors: Ricardo Baeza-Yates, Aida Sharif Rohani
Year: 2023Source: 2021 IEEE International Conference on Big Data (Big Data)
The extensive use of machine learning (ML) for supporting or making major decisions such as employment, credit card approval, or juridical decisions has resulted in rising concerns over the widespread...
Differential privacy and SPARQL
Authors: Federico Olmedo, Carlos Buil-Aranda, Jorge Lobo
Year: 2023Source: Semantic Web
Differential privacy is a framework that provides formal tools to develop algorithms to access databases and answer statistical queries with quantifiable accuracy and privacy guarantees. The notions o...
Sherlock-wannabes or when the audience fact-checks. How ideology, education, and alternative media use explain fact-checking behaviors
Authors: Magdalena Saldaña, Marcelo Santos
Year: 2023Source: Estudios sobre el Mensaje Periodístico
When confronted with suspicious information, the most common advice is to rely on trusted, well-known news media outlets to verify it. However, in a high-choice, fragmented media ecosystem, news reade...
Local Government, Social Media and Management of COVID-19: The Case of Chilean Mayoral Communication
Authors: Daniel Alcatruz, Fernando Rosenblatt, Cristian Pérez Muñóz et al.
Year: 2023Source: Political Communication
Most research on governments' use of social media focuses on the national or federal level. We therefore know little about the way local authorities harness social media platforms to communicate with ...
Report on the 46th ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023): Reflections from the Program Co-Chairs
Authors: Bárbara Poblete, Josiane Mothe, Makoto P. Kato
Year: 2023Source: ACM SIGIR Forum
The ACM SIGIR Conference on Research and Development in Information Retrieval has been experiencing significant growth over the past few years. In 2023, SIGIR received a total of 822 full paper submis...
Politics and Media in Journalism & Mass Communication Quarterly: A Centennial Research Retrospective
Authors: Sebastián Valenzuela, Homero Gil de Zúñiga, Ingrid Bachmann et al.
Year: 2023Source: Journalism & Mass Communication Quarterly
Based on computerized and manual content analyses, we examined the theories, methods, topics, and authors’ backgrounds of the empirical articles revolving around politics and media published by Jour...
Report on the 13th Workshop on Temporal Web Analytics (TempWeb 2023) at WWW 2023
Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2023Source: ACM SIGIR Forum
TempWeb is an established Workshop (series) with a long-standing tradition as a co-located event at The Web Conference. Considering the constantly evolving Web as a primary object of research, TempWeb...
FairXAI - A Taxonomy and Framework for Fairness and Explainability Synergy in Machine Learning
Authors: Ricardo Baeza-Yates, Fredrik Heintz, Resmi Ramachandranpillai
Year: 2023
<p>Explainable Artificial Intelligence (XAI) and Fair Learning have made significant strides in various application domains, including criminal recidivism predictions, healthcare settings, toxic...
FairXAI - A Taxonomy and Framework for Fairness and Explainability Synergy in Machine Learning
Authors: Ricardo Baeza-Yates, Fredrik Heintz, Resmi Ramachandranpillai
Year: 2023
<p>Explainable Artificial Intelligence (XAI) and Fair Learning have made significant strides in various application domains, including criminal recidivism predictions, healthcare settings, toxic...
Near-Optimal Search Time in $$\delta $$-Optimal Space, and Vice Versa
Authors: Gonzalo Navarro, Tomasz Kociumaka, Francisco Javier Vidal Olivares
Year: 2023Source: Algorithmica
Generative AI models should include detection mechanisms as a condition for public release
Authors: Ricardo Baeza-Yates, Raja Chatila, David Eyers et al.
Year: 2023Source: Ethics and Information Technology
Abstract The new wave of ‘foundation models’—general-purpose generative AI models, for production of text (e.g., ChatGPT) or images (e.g., MidJourney)—represent a dramatic advance in the state...
A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
Authors: Denis Parra, Hans Löbel, Andrés Carvallo et al.
Year: 2023Source: Data in Brief
The COVID-19 pandemic has underlined the need for reliable information for clinical decision-making and public health policies. As such, evidence-based medicine (EBM) is essential in identifying and e...
Bias Invariant Approaches for Improving Word Embedding Fairness
Authors: Bárbara Poblete, Vanessa Murdock, Rongting Zhang et al.
Year: 2023
Many public pre-trained word embeddings have been shown to encode different types of biases. Embeddings are often obtained from training on large pre-existing corpora, and therefore resulting biases c...
A Uniform Language to Explain Decision Trees
Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2023Source: arXiv (Cornell University)
The recent development of formal explainable AI has disputed the folklore claim that "decision trees are readily interpretable models", showing different interpretability queries that are computationa...
A neuro-symbolic framework for answering conjunctive queries
Authors: Pablo Barceló, Juan Reutter, Floris Geerts et al.
Year: 2023Source: arXiv (Cornell University)
The challenge of answering graph queries over incomplete knowledge graphs is gaining significant attention in the machine learning community. Neuro-symbolic models have emerged as a promising approach...
Logical Languages Accepted by Transformer Encoders with Hard Attention
Authors: Pablo Barceló, Anthony W. Lin, Alexander Kozachinskiy et al.
Year: 2023Source: arXiv (Cornell University)
We contribute to the study of formal languages that can be recognized by transformer encoders. We focus on two self-attention mechanisms: (1) UHAT (Unique Hard Attention Transformers) and (2) AHAT (Av...
Natural language processing analysis of the psychosocial stressors of mental health disorders during the pandemic
Authors: Susana Eyheramendy, Maria Paz Hermosilla, Isidora Paiva-Mack et al.
Year: 2023Source: npj Mental Health Research
Abstract Over the past few years, the COVID-19 pandemic has exerted various impacts on the world, notably concerning mental health. Nevertheless, the precise influence of psychosocial stressors on thi...
Evaluation of 3D Reconstruction for Cultural Heritage Applications
Authors: Benjamín Bustos, Ivan Sipiran, Cristián Llull et al.
Year: 2023
In recent years, we have seen the emergence of methods for creating 3D digital reproductions of objects using photos. These techniques, particularly when combined with handheld video devices like smar...
An empirical study of the effect of video encoders on Temporal Video Grounding
Authors: Felipe Bravo-Márquez, Edison Marrese-Taylor, I. Jara et al.
Year: 2023
Temporal video grounding is a fundamental task in computer vision, aiming to localize a natural language query in a long, untrimmed video. It has a key role in the scientific community, in part due to...
On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters
Authors: Pablo Barceló, Matthias Lanzinger
Year: 2023Source: arXiv (Cornell University)
Seminal research in the field of graph neural networks (GNNs) has revealed a direct correspondence between the expressive capabilities of GNNs and the $k$-dimensional Weisfeiler-Leman ($k$WL) test, a ...
No Agreement Without Loss: Learning and Social Choice in Peer Review
Authors: Pablo Barceló, Tomasz Steifer, Cristóbal Rojas et al.
Year: 2023Source: Frontiers in artificial intelligence and applications
In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on th...
Uncovering Bias in Personal Informatics
Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2023Source: Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies
Personal informatics (PI) systems, powered by smartphones and wearables, enable people to lead healthier lifestyles by providing meaningful and actionable insights that break down barriers between use...
2.2 kW single-mode narrow-linewidth laser delivery through a hollow-core fiber
Authors: Denis Parra, Matthew Cooper, Joseph Wahlen et al.
Year: 2023Source: Optica
Antiresonant hollow-core fibers (AR-HCFs) have opened up exciting possibilities for high-energy and high-power laser delivery because of their exceptionally low nonlinearities and high damage threshol...
Los presidencialismos y la inestabilidad política en América Latina: Contención e incorporación del conflicto durante el siglo XIX
Authors: Sergio Toro, Juan Carlos Arellano González, Alejandro Olivares
Year: 2023Source: Revista Chilena de Derecho y Ciencia Política
Una de las principales características de los presidencialismos de América Latina es que, a lo largo de la historia, se han mostrado diversos momentos de inestabilidad. En búsqueda de algunas expli...
Optimizing RPQs over a compact graph representation
Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2023Source: The VLDB Journal
SparqLog: A System for Efficient Evaluation of SPARQL 1.1 Queries via Datalog
Authors: Renzo Angles, Georg Gottlob, Reinhard Pichler et al.
Year: 2023Source: Proceedings of the VLDB Endowment
Over the past decade, Knowledge Graphs have received enormous interest both from industry and from academia. Research in this area has been driven, above all, by the Database (DB) community and the Se...
Truth be told: How “true” and “false” labels influence user engagement with fact-checks
Authors: Ernesto Calvo, Sebastián Valenzuela, Ingrid Bachmann et al.
Year: 2023Source: New Media & Society
When do users share fact-checks on social media? We describe a survey experiment conducted during the 2019 election in Argentina measuring the propensity of voters to share corrections to political mi...
‘Does she know how to read?’ An intersectional perspective to explore Twitter users’ portrayal of women Mapuche leaders
Authors: Magdalena Saldaña, Ximena Orchard, Isabel Pavez et al.
Year: 2023Source: Information Communication & Society
ABSTRACTSocial media offer new opportunities for women in politics, but also new ground for the expression of bias and stereotypes. Drawing upon literature about mediated representations of women in p...
Trends in the Global Information Environment: 2023 Expert Survey Results
Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
The information environment is rapidly evolving, with algorithmic bias, manipulation and misinformation having a significant impact on public life. The global network of researchers is an important so...
Expert Survey on the Global Information Environment 2023: Lessons for Technology Policy and Design
Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
The global information environment is impacted by both technology design and public policy. This Summary for Policymakers summarizes Trends in the Global Information Environment: 2023 Expert Survey Re...
Efficient construction of the BWT for repetitive text using string compression
Authors: Gonzalo Navarro, Diego Díaz-Domínguez
Year: 2023Source: Information and Computation
We present a new semi-external algorithm that builds the Burrows–Wheeler transform variant of Bauer et al. (a.k.a., BCR BWT) in linear expected time. Our method uses compression techniques to reduce...
Artificial intelligence-based decision-making: can ChatGPT replace a multidisciplinary tumour board?
Authors: Sebastián Valenzuela, Javier Vela Ulloa, Christophe Riquoir Altamirano et al.
Year: 2023Source: British journal of surgery
Artificial intelligence (AI) has been around for a while.Recent reports 1,2 have evaluated its role in assisting clinical decision-making, with promising results.After its launch in 2022 by OpenAI (Sa...
DIVERGÊNCIA KULLBACK-LEIBLER APLICADA A FWI
Authors: Juan Pablo Luna, Gilberto Barbosa Neto Carvalho, Virgílio José Martins Ferreira Filho
Year: 2023Source: Revista Contemporânea
A FWI (FUll-Waveform Inversion) é um dos métodos mais robustos para extrair informações sísmicas. Contudo, a norma L2 usada para medir a diferença entre dados sísmicos nem sempre é a melhor op...
The Shapley Value in Database Management
Authors: Leopoldo Bertossi, Mikaël Monet, Ester Livshits et al.
Year: 2023Source: ACM SIGMOD Record
Attribution scores can be applied in data management to quantify the contribution of individual items to conclusions from the data, as part of the explanation of what led to these conclusions. In Arti...
Fair Multilingual Vandalism Detection System for Wikipedia
Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
This paper presents a novel design of the system aimed at supporting the Wikipedia community in addressing vandalism on the platform. To achieve this, we collected a massive dataset of 47 languages, a...
Gradual Sensitivity Typing
Authors: Matías Toro, Eric Tanter, Damián Árquez et al.
Year: 2023Source: arXiv (Cornell University)
Reasoning about the sensitivity of functions with respect to their inputs has interesting applications in various areas, such as differential privacy. In order to check and enforce sensitivity, severa...
Physics-informed neural networks for blood flow inverse problems
Authors: Jeremías Garay, Francisco Sahli Costabal, Jocelyn Dunstan et al.
Year: 2023Source: arXiv (Cornell University)
Physics-informed neural networks (PINNs) have emerged as a powerful tool for solving inverse problems, especially in cases where no complete information about the system is known and scatter measureme...
LECTURE HELD AT THE ACADEMIA EUROPAEA BUILDING BRIDGES CONFERENCE 2022
Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2023Source: European Review
Artificial intelligence (AI) has finally reached most people on our planet thanks to generative AI tools for text and other media. This has started a controversy about the possible benefits and risks,...
Towards a Comprehensive Human-Centred Evaluation Framework for Explainable AI
Authors: Denis Parra, Katrien Verbert, Ivania Donoso-Guzmán et al.
Year: 2023Source: arXiv (Cornell University)
While research on explainable AI (XAI) is booming and explanation techniques have proven promising in many application domains, standardised human-centred evaluation procedures are still missing. In a...
Attribution-Scores in Data Management and Explainable Machine Learning
Authors: Leopoldo Bertossi
Year: 2023Source: arXiv (Cornell University)
We describe recent research on the use of actual causality in the definition of responsibility scores as explanations for query answers in databases, and for outcomes from classification models in mac...
Influence of quality of reduction using radiological criteria on kinematics and kinetics in ankle fractures with unstable syndesmotic injury
Authors: Aidan Hogan, Ursula Trinler, Paul Alfred Grützner et al.
Year: 2023Source: Clinical Biomechanics
Although, the data did not show that radiological reduction criteria have a statistically significant effect on active functional outcome after a mean follow up time of 5.7 years, tendencies for a bet...
How are AI assistants changing higher education?
Authors: Ricardo Baeza-Yates, Michael Neumann, Eva-Maria Schön et al.
Year: 2023Source: Frontiers in Computer Science
Context Higher education is changing at an accelerating pace due to the widespread use of digital teaching and emerging technologies. In particular, AI assistants such as ChatGPT pose significant chal...
Evaluating Regular Path Queries on Compressed Adjacency Matrices
Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón
Year: 2023Source: arXiv (Cornell University)
Regular Path Queries (RPQs), which are essentially regular expressions to be matched against the labels of paths in labeled graphs, are at the core of graph database query languages like SPARQL. A way...
Wikipedia Multilingual Vandalism Detection Dataset
Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
This dataset accompanies a research paper that introduces a novel system designed to support the Wikipedia community in combating vandalism on the platform. The dataset has been prepared to enhance th...
Wikipedia Multilingual Vandalism Detection Dataset
Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
This dataset accompanies a research paper that introduces a novel system designed to support the Wikipedia community in combating vandalism on the platform. The dataset has been prepared to enhance th...
A transcription and information extraction system to facilitate EHR documentation in Spanish
Authors: Fredy Núñez Torres, M. M. Rojas Fernández, Jocelyn Dunstan et al.
Year: 2023Source: Research Square (Research Square)
<title>Abstract</title> The large and diverse access to data sources in healthcare has boosted the application of novel computer techniques that can extract meaningful information to improve patients'...
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
Authors: Bárbara Poblete, Hsin‐Hsi Chen, Josiane Mothe et al.
Year: 2023
International audience
RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data Streams
Authors: Felipe Bravo-Márquez, Gabriel Iturra-Bocaz
Year: 2023Source: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
Word embeddings have become essential components in various information retrieval and natural language processing tasks, such as ranking, document classification, and question answering. However, desp...
Criminal Politics and Botched Development in Contemporary Latin America
Authors: Andreas Emil Feldmann, Juan Pablo Luna
Year: 2023Source: Cambridge University Press eBooks
This Element investigates the relationship between the narcotics industry and politics and assesses how it influences domestic political dynamics, including economic development prospects in Latin Ame...
SparqLog: A System for Efficient Evaluation of SPARQL 1.1 Queries via Datalog [Experiment, Analysis and Benchmark]
Authors: Renzo Angles, Georg Gottlob, Reinhard Pichler et al.
Year: 2023Source: arXiv (Cornell University)
Over the past decade, Knowledge Graphs have received enormous interest both from industry and from academia. Research in this area has been driven, above all, by the Database (DB) community and the Se...
State Responses to Autonomy Demands: Indigenous Movements and Regional Threats in Bolivia and Ecuador
Authors: Carla Alberti, Shannan Mattiace
Year: 2023Source: Journal of Politics in Latin America
In this paper, we examine the political factors that explain state responses to demands for indigenous territorial autonomy in Ecuador and Bolivia. Specifically, we aim to explain why the 2009 Bolivia...
Automatic Coding at Scale: Design and Deployment of a Nationwide System for Normalizing Referrals in the Chilean Public Healthcare System
Authors: Felipe Van Der Huck Arias, Jorge E. Pacheco, Paulina Vera et al.
Year: 2023Source: arXiv (Cornell University)
The disease coding task involves assigning a unique identifier from a controlled vocabulary to each disease mentioned in a clinical document. This task is relevant since it allows information extracti...
The Impact of the Web on Information Retrieval
Authors: Ricardo Baeza-Yates, Peter Mika
Year: 2023Source: ACM eBooks
chapter Share on The Impact of the Web on Information Retrieval Authors: Peter Mika Search about this author , Ricardo Baeza-Yates Search about this author Authors Info & Claims Linking the World's In...
Uncovering Bias in Personal Informatics
Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
REmatch: A Novel Regex Engine for Finding All Matches
Authors: Cristian Riveros, Domagoj Vrgoč, Nicolás Van Sint Jan
Year: 2023Source: Proceedings of the VLDB Endowment
In this paper, we present the REmatch system for information extraction. REmatch is based on a recently proposed enumeration algorithm for evaluating regular expressions with capture variables support...
Visual Exploration of Repetitive Patterns on Ancient Peruvian Pottery
Authors: Benjamín Bustos, Ivan Sipiran, Tobias Schreck et al.
Year: 2023Source: Journal of WSCG
The analysis and understanding of artefact properties and their relationships is a key goal in archaeological analysis of cultural heritage objects. There are many aspects of concern, including shape ...
Strategies for Improving the Global Information Environment: Results from a Systematic Review and Meta-Analysis
Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
Which design solutions mitigate the impact of misinformation on social media platforms, according to the latest scientific research? This IPIE Summary for Policymakers presents the main findings of tw...
Joint models reveal genetic architecture of transitions between pubertal stages and their association with BMI in a Latino population
Authors: Susana Eyheramendy, Lucas Vicuña, Verónica Mericq et al.
Year: 2023Source: medRxiv (Cold Spring Harbor Laboratory)
Abstract Early or late pubertal onset can lead to disease in adulthood, including cancer, obesity, type 2 diabetes, metabolic disorders, bone fractures and psychopathologies. Thus, knowing the age at ...
Special Issue on “Artificial Intelligence‐Driven Decision Making in Health and Medicine”
Authors: Leopoldo Bertossi, Herb Kunze, Marc Poulin et al.
Year: 2023Source: International Transactions in Operational Research
Artificial Intelligence (AI) refers to an interdisciplinary area which embraces computer science, robotics, engineering, mathematics, and statistics, and is largely based on the ability of a machine t...
Radiografía de un mito: la representación estereotipada de los videojugadores puesta a prueba en Chile
Authors: Magdalena Saldaña, Marco Jaramillo
Year: 2023Source: Comunicación y Medios
Los jugadores de videojuegos han sido representados en medios de comunicación y en productos culturales como niños u hombres jóvenes con escasa vida social, sin participación en la vida comunitari...
Human-AI Coevolution
Authors: Ricardo Baeza-Yates, Dino Pedreschi, Alistair Knott et al.
Year: 2023Source: arXiv (Cornell University)
The rise of large-scale socio-technical systems in which humans interact with artificial intelligence (AI) systems (including assistants and recommenders, in short AIs) multiplies the opportunity for ...
Evaluating Pre-training Strategies for Collaborative Filtering
Authors: Denis Parra, Leandro Balby Marinho, Rodrygo L. T. Santos et al.
Year: 2023
Pre-training is essential for effective representation learning models, especially in natural language processing and computer vision-related tasks. The core idea is to learn representations, usually ...
From Database Repairs to Causality in Databases and Beyond
Authors: Leopoldo Bertossi
Year: 2023Source: arXiv (Cornell University)
We describe some recent approaches to score-based explanations for query answers in databases. The focus is on work done by the author and collaborators. Special emphasis is placed on the use of count...
A Copernican Revolution in Data
Authors: Claudio Gutiérrez
Year: 2023Source: arXiv (Cornell University)
Half a century ago, Charles Bachman foresaw the significance and centrality of data in the digital world. In this short paper, we delve into the evolution of these ideas within the database community ...
MillenniumDB: An Open-Source Graph Database System
Authors: Cristian Riveros, Aidan Hogan, Carlos Buil-Aranda et al.
Year: 2023Source: Data Intelligence
Abstract In this systems paper, we present MillenniumDB: a novel graph database engine that is modular, persistent, and open source. MillenniumDB is based on a graph data model, which we call domain g...
PG-Schema: Schemas for Property Graphs
Authors: Renzo Angles, Domagoj Vrgoč, Juan Sequeda et al.
Year: 2023Source: Proceedings of the ACM on Management of Data
Property graphs have reached a high level of maturity, witnessed by multiple robust graph database systems as well as the ongoing ISO standardization effort aiming at creating a new standard Graph Que...
Do Fiscal Transfers Affect Local Democracy? Lessons from Chilean Municipalities
Authors: Carla Alberti, Diego Díaz Rioseco, Ignacio Riveros
Year: 2023Source: Latin American Politics and Society
ABSTRACT Extant literature concurs that fiscal transfers affect local democracy when they grant subnational governments nontax revenue. Yet there is nonetheless a mismatch between this concept and exi...
Evaluating Regular Path Queries in GQL and SQL/PGQ: How Far Can The Classical Algorithms Take Us?
Authors: Domagoj Vrgoč, Carlos Rojas, Benjamín Farías
Year: 2023Source: arXiv (Cornell University)
Path queries are a core feature of modern graph query languages such as Cypher, SQL/PGQ, and GQL. These languages provide a rich set of features for matching paths, such as restricting to certain path...
Cross-Lingual and Cross-Domain Crisis Classification for Low-Resource Scenarios
Authors: Bárbara Poblete, Hernán Sarmiento, Andrés Abeliuk et al.
Year: 2023Source: Proceedings of the International AAAI Conference on Web and Social Media
Social media data has emerged as a useful source of timely information about real-world crisis events. One of the main tasks related to the use of social media for disaster management is the automatic...
Characterizing and Identifying Socially Shared Self-Descriptions in Product Reviews
Authors: Bárbara Poblete, Vanessa Murdock, Chia-Jung Lee et al.
Year: 2023Source: Proceedings of the International AAAI Conference on Web and Social Media
Online e-commerce product reviews can be highly influential in a customer's decision-making processes. Reviews often describe personal experiences with a product and provide candid opinions about a pr...
GPC: A Pattern Calculus for Property Graphs
Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2023
International audience
Fair multilingual vandalism detection system for Wikipedia
Authors: Ricardo Baeza-Yates, Diego Sáez-Trumper, Mykola Trokhymovych et al.
Year: 2023Source: arXiv (Cornell University)
This paper presents a novel design of the system aimed at supporting the Wikipedia community in addressing vandalism on the platform. To achieve this, we collected a massive dataset of 47 languages, a...
The ACM PODS Alberto O. Mendelzon Test-of-Time Award 2023
Authors: Marcelo Arenas, Wenfei Fan, Frank Neven
Year: 2023
Citations for the The ACM PODS Alberto O. Mendelzon Test-of-Time Award 2023
Data Stories of Water: Studying the Communicative Role of Data Visualizations within Long‐form Journalism
Authors: Denis Parra, Manuela Garretón, Francesca Morini et al.
Year: 2023Source: Computer Graphics Forum
Abstract We present a methodology for making sense of the communicative role of data visualizations in journalistic storytelling and share findings from surveying water‐related data stories. Data st...
Framing school choice and merit: news media coverage of an education policy in Chile
Authors: Magdalena Saldaña, Cristian Cabalín, M. Beatriz Fernández
Year: 2023Source: Discourse Studies in the Cultural Politics of Education
School choice is a controversial issue in the public discussion of education. In Chile, the new School Admission System (SAE) was recently implemented to gradually reverse the country’s high educati...
The value of mathematical modelling approaches in epidemiology for public health decision making
Authors: Ricardo Baeza-Yates, Martha Ospina, Oscar H. Franco et al.
Year: 2023Source: Colombian Journal of Anesthesiology
It is discussed the relevance of quantitative approaches, specifically mathematical modelling in epidemiology, in the public health decision-making process. This topic is discussed here based on the e...
Engineering Rank/Select Data Structures for Big-Alphabet Strings
Authors: Diego Arroyuelo, Erick Sepúlveda, Francisco Riveros et al.
Year: 2023Source: arXiv (Cornell University)
Big-alphabet strings are common in several scenarios such as information retrieval and natural-language processing. The efficient storage and processing of such strings usually introduces several chal...
Separating Automatic Relations
Authors: Pablo Barceló, Diego Figueira, Rémi Morvan
Year: 2023Source: arXiv (Cornell University)
We study the separability problem for automatic relations (i.e., relations on finite words definable by synchronous automata) in terms of recognizable relations (i.e., finite unions of products of reg...
MUSIB: musical score inpainting benchmark
Authors: Denis Parra, Felipe Bravo-Márquez, Rodrigo F. Cádiz et al.
Year: 2023Source: EURASIP Journal on Audio Speech and Music Processing
Abstract Music inpainting is a sub-task of automated music generation that aims to infill incomplete musical pieces to help musicians in their musical composition process. Many methods have been devel...
Uneven States, Unequal Societies, and Democracy’s Unfulfilled Promises: Citizenship Rights in Chile and Contemporary Latin America
Authors: Rodrigo M. Medel, Juan Pablo Luna
Year: 2023Source: Latin American Politics and Society
ABSTRACT In contemporary Latin America, deep-seated social discontent with political elites and institutions has been, paradoxically, the counterpart of democratic stability and resilience. This parad...
RDF Playground: An Online Tool for Learning about the Semantic Web
Authors: Aidan Hogan, Raúl Cid, Bastián Inostroza
Year: 2023
We present RDF Playground: a web-based tool to assist those who wish to learn or teach about the Semantic Web. The tool integrates functionalities relating to the key features of RDF, allowing users t...
Templet: A Collaborative System for Knowledge Graph Question Answering over Wikidata
Authors: Aidan Hogan, Francisca Suárez
Year: 2023
We present Templet: an online question answering (QA) system for Wikidata. Templet is based on the collaboratively-edited repository QAWiki, which collects questions in multiple natural languages alon...
Wikidata Atlas: Putting Wikidata on the Map
Authors: Aidan Hogan, Benjamín Del Pino
Year: 2023
Wikidata Atlas is an online system that allows users to explore Wikidata items on an interactive global map; for example, users can explore the global distribution of all lighthouses described by Wiki...
A convolutional architecture for 3D model embedding using image views
Authors: Benjamín Bustos, Ivan Sipiran, Arniel Labrada
Year: 2023Source: The Visual Computer
13th Temporal Web Analytics Workshop (TempWeb) Overview
Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2023
International audience
Lacking time: A case study of student and faculty perceptions of academic workload in the COVID‐19 pandemic
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo
Year: 2023Source: Journal of Engineering Education
Abstract Background To avoid the spread of COVID‐19, most engineering programs rapidly shifted to emergency online education, and prior research has associated online education with academic overloa...
Using diversity as a source of scientific innovation for the Web
Authors: Bárbara Poblete
Year: 2023Source: Proceedings of the ACM Web Conference 2022
The Web has become a resource that allows us to make sense of social phenomena around the world. This started the moment users became content creators, and has grown with the emergence of social platf...
A Study on Information Disorders on Social Networks during the Chilean Social Outbreak and COVID-19 Pandemic
Authors: Marcelo Mendoza, Sebastián Valenzuela, Claudia López et al.
Year: 2023Source: Applied Sciences
Information disorders on social media can have a significant impact on citizens’ participation in democratic processes. To better understand the spread of false and inaccurate information online, th...
10 Years of Digital Journalism (Studies): The Past, the Present, the Future
Authors: Magdalena Saldaña, Oscar Westlund, Edson C. Tandoc et al.
Year: 2023Source: Digital Journalism
The Digital Journalism editorial team is thrilled to introduce this 10th anniversary special issue. At the beginning of 2022, we invited our international editorial board to contribute to this importa...
Digital Journalism: The Journal and the Path that Brought us Here
Authors: Magdalena Saldaña, Oscar Westlund, Edson C. Tandoc et al.
Year: 2023Source: Digital Journalism
Click to increase image sizeClick to decrease image size Disclosure StatementNo potential conflict of interest was reported by the author(s).
Human-Centered Responsible Artificial Intelligence: Current & Future Trends
Authors: Jessica Vitak, Mohammad Tahaei, Seán Kennedy et al.
Year: 2023
In recent years, the CHI community has seen significant growth in research on\nHuman-Centered Responsible Artificial Intelligence. While different research\ncommunities may use different terminology t...
The long memory of the land: Pre-colonial origins of Mapuche mobilization in Chile
Authors: Sergio Toro, Carla Alberti, Juan Pablo Luna et al.
Year: 2023Source: Political Geography
Contextual Linear Types for Differential Privacy
Authors: Matías Toro, Eric Tanter, Federico Olmedo et al.
Year: 2023Source: ACM Transactions on Programming Languages and Systems
Language support for differentially private programming is both crucial and delicate. While elaborate program logics can be very expressive, type-system-based approaches using linear types tend to be ...
A Gradual Probabilistic Lambda Calculus
Authors: Matías Toro, Federico Olmedo, Wenjia Ye
Year: 2023Source: Proceedings of the ACM on Programming Languages
Probabilistic programming languages have recently gained a lot of attention, in particular due to their applications in domains such as machine learning and differential privacy. To establish invarian...
GenoVi, an open-source automated circular genome visualizer for bacteria and archaea
Authors: Carlos Buil-Aranda, Mauricio Araya, Nicolás Jara et al.
Year: 2023Source: PLoS Computational Biology
The increase in microbial sequenced genomes from pure cultures and metagenomic samples reflects the current attainability of whole-genome and shotgun sequencing methods. However, software for genome v...
Studying the Downstream Effects of Fact-Checking on Social Media: Experiments on Correction Formats, Belief Accuracy, and Media Trust
Authors: Sebastián Valenzuela, Ingrid Bachmann
Year: 2023Source: Social Media + Society
Repeated exposure to misinformation not only reduces the accuracy of people’s beliefs, but it also decreases confidence in institutions such as the news media. Can fact-checking—journalism’s mai...
Compact representations of spatial hierarchical structures with support for topological queries
Authors: Gonzalo Navarro, José Fuentes‐Sepúlveda, Diego Seco et al.
Year: 2023Source: Information and Computation
A Researcher's Digest of GQL
Authors: Filip Murlak, Nadime Francis, Victor Marsault et al.
Year: 2023Source: HAL (Le Centre pour la Communication Scientifique Directe)
GQL (Graph Query Language) is being developed as a new ISO standard for graph query languages to play the same role for graph databases as SQL plays for relational. In parallel, an extension of SQL fo...
Three iterations of $(d-1)$-WL test distinguish non isometric clouds of $d$-dimensional points
Authors: Pablo Barceló, Mircea Petrache, Alexander Kozachinskiy et al.
Year: 2023Source: arXiv (Cornell University)
The Weisfeiler--Lehman (WL) test is a fundamental iterative algorithm for checking isomorphism of graphs. It has also been observed that it underlies the design of several graph neural network archite...
The Chilean Waiting List sub-Corpus with medical entities normalized to UMLS terminology
Authors: Leonardo Campillos Llanos, Jocelyn Dunstan, Pablo Báez
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
A collection of 2000 medical referrals from the Chilean Waiting List Corpus, manually annotated with six entity types (Finding, Procedure, Disease, Family Member, Body Part, and Medication) and manual...
The Chilean Waiting List sub-Corpus with medical entities normalized to UMLS terminology
Authors: Leonardo Campillos Llanos, Jocelyn Dunstan, Pablo Báez
Year: 2023Source: Zenodo (CERN European Organization for Nuclear Research)
A collection of 2000 medical referrals from the Chilean Waiting List Corpus, manually annotated with six entity types (Finding, Procedure, Disease, Family Member, Body Part, and Medication) and manual...
Efficient Computation of Shap Explanation Scores for Neural Network Classifiers via Knowledge Compilation
Authors: Leopoldo Bertossi, Jorge E. Leon
Year: 2023Source: arXiv (Cornell University)
The use of Shap scores has become widespread in Explainable AI. However, their computation is in general intractable, in particular when done with a black-box classifier, such as neural network. Recen...
A named entity recognition framework using transformers to identify relevant clinical findings from mammographic radiological reports
Authors: Denis Parra, Eduardo Godoy, Alejandro Veloz et al.
Year: 2023
Detecting and extracting findings in a radiological report is crucial for text mining tasks in several applications. In this case, a labeled process for the image associated with the radiological repo...
Attribution-Scores and Causal Counterfactuals as Explanations in Artificial Intelligence
Authors: Leopoldo Bertossi
Year: 2023Source: arXiv (Cornell University)
In this expository article we highlight the relevance of explanations for artificial intelligence, in general, and for the newer developments in {\em explainable AI}, referring to origins and connecti...
Representing Paths in Graph Database Pattern Matching
Authors: Domagoj Vrgoč, Carlos Rojas, Stijn Vansummeren et al.
Year: 2023Source: Proceedings of the VLDB Endowment
Modern graph database query languages such as GQL, SQL/PGQ, and their academic predecessor G-Core promote paths to first-class citizens in the sense that their pattern matching facility can return pat...
Attitudinal effects of data visualizations and illustrations in data stories
Authors: Denis Parra, Manuela Garretón, Francesca Morini et al.
Year: 2023Source: IEEE Transactions on Visualization and Computer Graphics
Journalism has become more data-driven and inherently visual in recent years. Photographs, illustrations, infographics, data visualizations, and general images help convey complex topics to a wide aud...
Influence of surgical reduction on dynamic balance in patients after unstable ankle fracture.
Authors: Aidan Hogan, Ursula Trinler, Sven Y. Vetter et al.
Year: 2023Source: Gait & Posture
A Theory of Link Prediction via Relational Weisfeiler-Leman on Knowledge Graphs
Authors: Pablo Barceló, Miguel Romero Orth, Xingyue Huang et al.
Year: 2023Source: arXiv (Cornell University)
Graph neural networks are prominent models for representation learning over graph-structured data. While the capabilities and limitations of these models are well-understood for simple graphs, our und...
New insights from GWAS on BMI-related growth traits in a longitudinal cohort of admixed children with Native American and European ancestry
Authors: Susana Eyheramendy, Lucas Vicuña, Tomás Norambuena et al.
Year: 2023Source: iScience
Body-mass index (BMI) is a hallmark of adiposity. In contrast with adulthood, the genetic architecture of BMI during childhood is poorly understood. The few genome-wide association studies (GWAS) on c...
Online estimation methods for irregular autoregressive models
Authors: Susana Eyheramendy, Wilfredo Palma, Felipe Elorrieta et al.
Year: 2023Source: arXiv (Cornell University)
In the last decades, due to the huge technological growth observed, it has become increasingly common that a collection of temporal data rapidly accumulates in vast amounts. This provides an opportuni...
Predicting no-show appointments in a pediatric hospital in Chile using machine learning
Authors: Juan Peypouquet, Víctor Riquelme, Héctor Ramírez et al.
Year: 2023Source: Health Care Management Science
The Chilean public health system serves 74% of the country's population, and 19% of medical appointments are missed on average because of no-shows. The national goal is 15%, which coincides with the a...
The Personal Is the Political? What Do WhatsApp Users Share and How It Matters for News Knowledge, Polarization and Participation in Chile
Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2023Source: Routledge eBooks
Stronger and Safer Together
Authors: Magdalena Saldaña, Lourdes M. Cueva Chacón
Year: 2023Source: Routledge eBooks
Indigenous autonomy and Latin American state security in contexts of criminal violence: the cases of Cauca in Colombia and Guerrero in Mexico
Authors: Carla Alberti, Shannan Mattiace
Year: 2023Source: Latin American and Caribbean Ethnic Studies
Scholars writing on Indigenous autonomy in the Americas have focused mainly on social movement demands and on the implementation of laws that enshrine autonomy rights. The motives of state officials i...
Predicting disease severity in Multiple Sclerosis using multimodal data and machine learning
Authors: Nicole Kerlero de Rosbo, Janina Behrens, Susanna Asseyer et al.
Year: 2023Source: Research Square (Research Square)
Abstract Background Multiple Sclerosis patients would benefit from machine learning algorithms that integrates clinical, imaging, and multimodal biomarkers to define the risk of disease activity. Meth...
Enumeration and updates for conjunctive linear algebra queries through expressibility
Authors: Cristian Riveros, Stijn Vansummeren, Thomas Muñoz
Year: 2023Source: arXiv (Cornell University)
Due to the importance of linear algebra and matrix operations in data analytics, there is significant interest in using relational query optimization and processing techniques for evaluating (sparse) ...
Towards a Comprehensive Human-Centred Evaluation Framework for Explainable AI
Authors: Denis Parra, Katrien Verbert, Ivania Donoso-Guzmán et al.
Year: 2023Source: Communications in computer and information science
MillenniumDB: An Open-Source Graph Database System
Authors: Cristian Riveros, Aidan Hogan, Carlos Buil-Aranda et al.
Year: 2023Source: Data Intelligence
ABSTRACT In this systems paper, we present MillenniumDB: a novel graph database engine that is modular, persistent, and open source. MillenniumDB is based on a graph data model, which we call domain g...
Constant Time and Space Updates for the Sigma-Tau Problem
Authors: Gonzalo Navarro, Aaron Williams, Zsuzsanna Lipták et al.
Year: 2023Source: Lecture notes in computer science
Dynamic Compact Data Structure for Temporal Reachability with Unsorted Contact Insertions
Authors: Gonzalo Navarro, Bruno Augusto Nassif Travençolo, Marcelo Keese Albertini et al.
Year: 2023Source: arXiv (Cornell University)
Temporal graphs represent interactions between entities over time. Deciding whether entities can reach each other through temporal paths is useful for various applications such as in communication net...
Wheeler maps
Authors: Gonzalo Navarro, Travis Gagie, Jouni Sirén et al.
Year: 2023Source: arXiv (Cornell University)
Motivated by challenges in pangenomic read alignment, we propose a generalization of Wheeler graphs that we call Wheeler maps. A Wheeler map stores a text $T[1..n]$ and an assignment of tags to the ch...
Maintaining the cycle structure of dynamic permutations
Authors: Gonzalo Navarro, Zsuzsanna Lipták, Francesco Masillo
Year: 2023Source: arXiv (Cornell University)
We present a new data structure for maintaining dynamic permutations, which we call a $\textit{forest of splay trees (FST)}$. The FST allows one to efficiently maintain the cycle structure of a permut...
A Simple Grammar-Based Index for Finding Approximately Longest Common Substrings
Authors: Gonzalo Navarro, Travis Gagie, Sana Kashgouli
Year: 2023Source: Lecture notes in computer science
Faster Maximal Exact Matches with Lazy LCP Evaluation
Authors: Gonzalo Navarro, Travis Gagie, Adrián Goga et al.
Year: 2023Source: arXiv (Cornell University)
MONI (Rossi et al., {\it JCB} 2022) is a BWT-based compressed index for computing the matching statistics and maximal exact matches (MEMs) of a pattern (usually a DNA read) with respect to a highly re...
Bimodal Neural Style Transfer for Image Generation Based on Text Prompts
Authors: Marcelo Mendoza, Diego Gutiérrez
Year: 2023Source: Lecture notes in computer science
Supporting Users in Refining and Comparing Topic Models: An Experimental Study
Authors: Marcelo Mendoza, Evangelos Milios, Fernando V. Paulovich et al.
Year: 2023
Topic modeling is a statistical approach for extracting themes from high volumes of textual data. Humans are needed to interpret its outputs, which include sets of terms and scores. Lately, visualizat...
Bimodal Style Transference from Musical Composition to Image Using Deep Generative Models
Authors: Marcelo Mendoza, María José Apolo
Year: 2023Source: Lecture notes in computer science
Work-in-Progress: Decision Support System for the Process of Student Academic Registration
Authors: Renzo Angles, Luís Silvestre, Fabian Olivares et al.
Year: 2023Source: Lecture notes in networks and systems
Evaluating Regular Path Queries on Compressed Adjacency Matrices
Authors: Gonzalo Navarro, Diego Arroyuelo, Adrián Gómez-Brandón et al.
Year: 2023Source: Lecture notes in computer science
A Comprehensive and Curated Dataset of Covid-19 and Epistemonikos Evidence
Authors: Denis Parra, Hans Löbel, Andrés Carvallo et al.
Year: 2023Source: SSRN Electronic Journal
The emergence of COVID-19 has highlighted the importance of reliable information for clinical decision-making and public health policies. Evidence-based medicine (EBM) seeks to identify and evaluate s...
Globalization & The Challenging Political Economy of Governing (and Researching) Islands in Contemporary Times
Authors: Juan Pablo Luna
Year: 2023Source: Social and ecological interactions in the Galapagos Islands
Uncovering Bias in Personal Informatics
Authors: Ricardo Baeza-Yates, Athena Vakali, Pavlos Sermpezis et al.
Year: 2023Source: arXiv (Cornell University)
Personal informatics (PI) systems, powered by smartphones and wearables, enable people to lead healthier lifestyles by providing meaningful and actionable insights that break down barriers between use...
Understanding Search Behavior Bias in Wikipedia
Authors: Ricardo Baeza-Yates, Bruno Scarone, Erik Bernhardson
Year: 2023Source: Communications in computer and information science
Attribution-Scores and Causal Counterfactuals as Explanations in Artificial Intelligence
Authors: Leopoldo Bertossi
Year: 2023Source: Lecture notes in computer science
From Database Repairs to Causality in Databases and Beyond
Authors: Leopoldo Bertossi
Year: 2023Source: Lecture notes in computer science
Reasoning Web. Causality, Explanations and Declarative Knowledge
Authors: Leopoldo Bertossi, Guohui Xiao
Year: 2023Source: Lecture notes in computer science
Efficient Computation of Shap Explanation Scores for Neural Network Classifiers via Knowledge Compilation
Authors: Leopoldo Bertossi, Jorge Esquiche León
Year: 2023Source: Lecture notes in computer science
Attribution-Scores in Data Management and Explainable Machine Learning
Authors: Leopoldo Bertossi
Year: 2023Source: Lecture notes in computer science
Countermeasures for Mitigating Digital Misinformation: A Systematic Review
Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
This Synthesis Report provides a formal systematic review of scientific literature on countermeasures for mitigating digital misinformation. We focus on 588 peer-reviewed publications, drawn from arou...
Platform Responses to Misinformation: A Meta-Analysis of Data
Authors: Sebastián Valenzuela, Wendy Hui Kyong Chun, Philip N. Howard et al.
Year: 2023
Digital misinformation is a critical issue affecting the global information environment. Countering misinformation and its effects is a major objective of governments, international organizations, con...
A Novel First-Order Autoregressive Moving Average Model to Analyze Discrete-Time Series Irregularly Observed
Authors: Susana Eyheramendy, Wilfredo Palma, César Ojeda et al.
Year: 2023Source: Contributions to statistics
A novel first-order autoregressive moving average model for analyzing discrete-time series observed at irregularly spaced times is introduced. Under Gaussianity, it is established that the model is st...
Extending time-series models for irregular observational gaps with a moving average structure for astronomical sequences
Authors: Susana Eyheramendy, Wilfredo Palma, César Ojeda et al.
Year: 2023Source: RAS Techniques and Instruments
ABSTRACT In this study, we introduce a novel moving-average model for analyzing stationary time-series observed irregularly in time. The process is strictly stationary and ergodic under normality and ...
r-indexing without backward searching
Authors: Ben Langmead, Omar Ahmed, Mohsen Zakeri et al.
Year: 2023Source: arXiv (Cornell University)
Suppose we are given a text $T$ of length $n$ and a straight-line program for $T$ with $g$ rules. Let $\bar{r}$ be the number of runs in the Burrows-Wheeler Transform of the reverse of $T$. We can ind...
Pre-trained language models in Spanish for health insurance coverage
Authors: Claudio Aracena, Nicolás Rodríguez, Jocelyn Dunstan et al.
Year: 2023
The field of clinical natural language processing (NLP) can extract useful information from clinical text. Since 2017, the NLP field has shifted towards using pre-trained language models (PLMs), impro...
Development of pre-trained language models for clinical NLP in Spanish
Authors: Claudio Aracena, Jocelyn Dunstan
Year: 2023
Clinical natural language processing aims to tackle language and prediction tasks using text from medical practice, such as clinical notes, prescriptions, and discharge summaries. Several approaches h...
Automatic Coding at Scale: Design and Deployment of a Nationwide System for Normalizing Referrals in the Chilean Public Healthcare System
Authors: Felipe Van Der Huck Arias, Jorge E. Pacheco, Paulina Vera et al.
Year: 2023
The disease coding task involves assigning a unique identifier from a controlled vocabulary to each disease mentioned in a clinical document. This task is relevant since it allows information extracti...
Online Estimation Methods for Irregular Autoregressive Models
Authors: Susana Eyheramendy, Wilfredo Palma, Felipe Elorrieta et al.
Year: 2023Source: Contributions to statistics
In the last decades, due to the huge technological growth observed, it has become increasingly common that a collection of temporal data rapidly accumulates in vast amounts. This provides an opportuni...
Size Bounds and Algorithms for Conjunctive Regular Path Queries
Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2023Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
Conjunctive regular path queries (CRPQs) are one of the core classes of queries over graph databases. They are join intensive, inheriting their structure from the relational setting, but they also all...
How Do Centrality Measures Choose the Root of Trees?
Authors: Cristian Riveros, Jorge Salas, Oskar Skibski
Year: 2023Source: arXiv (Cornell University)
Centrality measures are widely used to assign importance to graph-structured data. Recently, understanding the principles of such measures has attracted a lot of attention. Given that measures are div...
Separating Automatic Relations
Authors: Pablo Barceló, Diego Figueira, Rémi Morvan
Year: 2023Source: Leibniz-Zentrum für Informatik (Schloss Dagstuhl)
We study the separability problem for automatic relations (i.e., relations on finite words definable by synchronous automata) in terms of recognizable relations (i.e., finite unions of products of reg...
Compact Data Structures Meet Databases (Invited Talk)
Authors: Cristian Riveros, Aidan Hogan, Carlos Buil-Aranda et al.
Year: 2023Source: arXiv (Cornell University)
We describe two success stories on the application of compact data structures (cds) to solve the problem of the excessively redundant space requirements posed by worst-case-optimal (wco) algorithms fo...
Publications for 2022
Displaying 210 publication(s) for 2022
Medios de comunicación y confianza política en América Latina: análisis individual y contextual del rol de las noticias en la confianza en el gobierno y el Estado
Authors: Sebastián Valenzuela, Ingrid Bachmann, Daniela Grassau et al.
Year: 2022Source: Revista Internacional de Sociología
¿Cuál es la asociación entre exposición a noticias y confianza política en Latinoamérica? ¿Hay diferencias según la libertad del sistema de medios y los niveles de polarización política? Par...
Practical Random Access to SLP-Compressed Texts
Authors: Gonzalo Navarro, Travis Gagie, Giovanni Manzini et al.
Year: 2022Source: Lecture notes in computer science
Grammar-based compression is a popular and powerful approach to compressing repetitive texts but until recently its relatively poor time-space trade-offs during real-life construction made it impracti...
On Dynamic Succinct Graph Representations
Authors: Gonzalo Navarro, Guillermo de Bernardo, Susana Ladra et al.
Year: 2022
We address the problem of representing dynamic graphs using k <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> -trees. The k <sup xmlns:mml="http:...
Approximating Optimal Bidirectional Macro Schemes
Authors: Gonzalo Navarro, Luís M. S.Russo, Alexandre P. Francisco et al.
Year: 2022
Lempel-Ziv is an easy-to-compute member of a wide family of so-called macro schemes; it restricts pointers to go in one direction only. Optimal bidirectional macro schemes are NP-complete to find, but...
Two-Dimensional Block Trees
Authors: Gonzalo Navarro, Travis Gagie, Adrián Gómez-Brandón et al.
Year: 2022Source: The Computer Journal
Abstract The Block Tree is a data structure for representing repetitive sequences in compressed space, which reaches space comparable with that of Lempel–Ziv compression while retaining fast direct ...
Learning to cluster urban areas: two competitive approaches and an empirical validation
Authors: Marcelo Mendoza, Sergio Toro, Hans Löbel et al.
Year: 2022Source: EPJ Data Science
Abstract Urban clustering detects geographical units that are internally homogeneous and distinct from their surroundings. It has applications in urban planning, but few studies compare the effectiven...
Exploration Trade-offs in Web Recommender Systems
Authors: Ricardo Baeza-Yates, Giovanni Delnevo, Ricardo Baeza‐Yates
Year: 2022Source: 2021 IEEE International Conference on Big Data (Big Data)
One of the main problems of web recommender systems is exposure bias, due to the fact that the web system itself is partly generating its own future, as users can only click on items shown to them. Th...
A scalable and energy efficient GPU thread map for m-simplex domains
Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: Future Generation Computer Systems
Test datasets for GenoVi: draft and complete genomes
Authors: Carlos Buil-Aranda, Mauricio Araya, Nicolás Jara et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Test dataset for GenoVi (Genome Visualizer).<br> All the genomes available in this repository were used to create all the analysis done by Cumsille et al., 2022 for the publication of GenoVi.
Test datasets for GenoVi: draft and complete genomes
Authors: Carlos Buil-Aranda, Mauricio Araya, Nicolás Jara et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Test dataset for GenoVi (Genome Visualizer).<br> All the genomes available in this repository were used to create all the analysis done by Cumsille et al., 2022 for the publication of GenoVi.
Extending Sticky-Datalog+/- via Finite-Position Selection Functions: Tractability, Algorithms, and Optimization
Authors: Leopoldo Bertossi, Mostafa Milani
Year: 2022Source: Information Systems
Trie-Compressed Intersectable Sets
Authors: Diego Arroyuelo, Juan P. Castillo
Year: 2022Source: arXiv (Cornell University)
We introduce space- and time-efficient algorithms and data structures for the offline set intersection problem. We show that a sorted integer set $S \subseteq [0{..}u)$ of $n$ elements can be represen...
Human vs. Artificial Intelligence
Authors: Ricardo Baeza-Yates, Pablo Villoslada
Year: 2022
In this essay we compare human and artificial intelligence from two points of view: computational and neuroscience. We discuss the differences and limitations of AI with respect to our intelligence, e...
Report on the 12th Temporal Web Analytics Workshop (TempWeb 2022) at WWW 2022
Authors: Ricardo Baeza-Yates, Omar Alonso, Marc Spaniol
Year: 2022Source: ACM SIGIR Forum
TempWeb focuses on investigating infrastructures, scalable methods, and innovative software for aggregating, querying, and analyzing heterogeneous data at Web scale. Emphasis is given to data analysis...
Using Automated Planning to Provide Feedback during Collaborative Problem-Solving
Authors: Jorge Baier, Miguél Nussbaum, María Fernanda Rodríguez et al.
Year: 2022Source: International Journal of Artificial Intelligence in Education
Weisfeiler and Leman Go Relational
Authors: Pablo Barceló, Mikhail Galkin, Christopher G. Morris et al.
Year: 2022Source: arXiv (Cornell University)
Knowledge graphs, modeling multi-relational data, improve numerous applications such as question answering or graph logical reasoning. Many graph neural networks for such data emerged recently, often ...
LSQ 2.0: A linked dataset of SPARQL query logs
Authors: Aidan Hogan, Carlos Buil-Aranda, Axel-Cyrille Ngonga Ngomo et al.
Year: 2022Source: Semantic Web
We present the Linked SPARQL Queries (LSQ) dataset, which currently describes 43.95 million executions of 11.56 million unique SPARQL queries extracted from the logs of 27 different endpoints. The LSQ...
Gradual System F
Authors: Elizabeth Labrada, Matías Toro, Eric Tanter et al.
Year: 2022Source: Journal of the ACM
Bringing the benefits of gradual typing to a language with parametric polymorphism like System F, while preserving relational parametricity, has proven extremely challenging: first attempts were formu...
Fiscal Origins of Subnational Democracy: Evidence from Argentina
Authors: Carla Alberti, Diego Díaz Rioseco
Year: 2022Source: Politics & Society
Subnational governments are generally funded by fiscal rents, that is, transfers of centrally levied taxes. Existing literature concurs that fiscal federalism breeds rentierism and, consequently, hind...
Toward a Definitive Compressibility Measure for Repetitive Sequences
Authors: Gonzalo Navarro, Nicola Prezza, Tomasz Kociumaka
Year: 2022Source: IEEE Transactions on Information Theory
While the k th order empirical entropy is an accepted measure of the compressibility of individual sequences on classical text collections, it is useful only for small values of k and thus fails to ca...
Counting the Answers to a Query
Authors: Cristian Riveros, Marcelo Arenas, Rajesh Jayaram et al.
Year: 2022Source: ACM SIGMOD Record
Counting the answers to a query is a fundamental problem in databases, with several applications in the evaluation, optimization, and visualization of queries. Unfortunately, counting query answers is...
PG-Schema: Schemas for Property Graphs
Authors: Renzo Angles, Domagoj Vrgoč, Juan Sequeda et al.
Year: 2022Source: arXiv (Cornell University)
Property graphs have reached a high level of maturity, witnessed by multiple robust graph database systems as well as the ongoing ISO standardization effort aiming at creating a new standard Graph Que...
Issue Information
Authors: Gonzalo Navarro, Christoph Lange, Han‐Na Kim et al.
Year: 2022Source: European Journal Of Haematology
“What a nasty girl!” incivility and gendered symbolic violence in news discussions
Authors: Magdalena Saldaña, Valentina Proust
Year: 2022Source: Feminist Media Studies
This study examines conversations developed in the virtual public sphere to identify if a user’s gender affects the presence of incivility in news comment sections. By relying on a mixed-method anal...
No Agreement Without Loss: Learning and Social Choice in Peer Review
Authors: Pablo Barceló, Tomasz Steifer, Cristóbal Rojas et al.
Year: 2022Source: arXiv (Cornell University)
In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on th...
La salud en la era digital
Authors: Claudio Gutiérrez, Mercedes López
Year: 2022Source: Revista Médica Clínica Las Condes
¿Qué cambios trae el mundo digital a la forma como abordamos la salud? ¿Cómo están incidiendo las tecnologías digitales en la medicina? Este artículo presenta una panorámica sobre estos temas,...
Aplicaciones de aprendizaje automático en salud
Authors: Jocelyn Dunstan, Fabián Villena, Claudio Aracena et al.
Year: 2022Source: Revista Médica Clínica Las Condes
Resumen: El presente trabajo tiene por objetivo mostrar algunas aplicaciones recientes de aprendizaje automático en el área de la salud. El aprendizaje automático o machine learning es una rama de ...
Procesamiento de lenguaje natural para texto clínico en español: el caso de las listas de espera en Chile
Authors: Jocelyn Dunstan, Pablo Báez, Fredy Núñez Torres et al.
Year: 2022Source: Revista Médica Clínica Las Condes
The waiting lists not covered by the Explicit Health Guarantee Plan for new specialty consultation in Chile increased due to the effects of the SARS-CoV-2 coronavirus (COVID-19) pandemic. This represe...
On Computing Probabilistic Explanations for Decision Trees
Authors: Pablo Barceló, Bernardo Subercaseaux, Marcelo Arenas et al.
Year: 2022Source: Conference on Neural Information Processing Systems (NeurIPS 2022)
On the expressiveness of Lara: A proposal for unifying linear and relational algebra
Authors: Pablo Barceló, Nelson Higuera, Jorge Pérez et al.
Year: 2022Source: Theoretical Computer Science
GPC: A Pattern Calculus for Property Graphs
Authors: Domagoj Vrgoč, Leonid Libkin, Wim Martens et al.
Year: 2022Source: arXiv (Cornell University)
The development of practical query languages for graph databases runs well ahead of the underlying theory. The ISO committee in charge of database query languages is currently developing a new standar...
Datasets of Time- and Space-Efficient Regular Path Queries
Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Datasets that were used in the experiments of our work <em>Time- and Space-Efficient Regular Path Queries.</em>
Datasets of Time- and Space-Efficient Regular Path Queries
Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Datasets that were used in the experiments of our work <em>Time- and Space-Efficient Regular Path Queries.</em>
Space/time-efficient RDF stores based on circular suffix sorting
Authors: Gonzalo Navarro, Guillermo de Bernardo, Nieves R. Brisaboa et al.
Year: 2022Source: The Journal of Supercomputing
CLNews: The First Dataset of the Chilean Social Outbreak for Disinformation Analysis
Authors: Marcelo Mendoza, Eliana Providel, Daniel Toro-González et al.
Year: 2022Source: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
Disinformation is one of the main threats that loom on social networks. Detecting disinformation is not trivial and requires training and maintaining fact-checking teams, which is labor-intensive. Rec...
Simple and efficient bi-objective search algorithms via fast dominance checks
Authors: Jorge Baier, Carlos Hernández, Luis Suazo et al.
Year: 2022Source: Artificial Intelligence
Another Violent Protest? New Perspectives to Understand Protest Coverage
Authors: Magdalena Saldaña, Valentina Proust
Year: 2022Source: Media and Communication
This study assesses the relationship between two well-established sets of frames to better understand the news coverage of massive political protests. By relying on Semetko and Valkenburg’s generic ...
Gradual C0: Symbolic Execution for Gradual Verification
Authors: Eric Tanter, Joshua Sunshine, Jonathan Aldrich et al.
Year: 2022Source: arXiv (Cornell University)
Current static verification techniques support a wide range of programs. However, such techniques only support complete and detailed specifications, which places an undue burden on users. To solve thi...
An automatic methodology to measure drivers’ behavior in public transport
Authors: Hans Löbel, Juan Carlos Herrera, Hernan F. Catalan
Year: 2022Source: Journal of Intelligent Transportation Systems
The way in which public transport buses are driven has an influence in users’perception and satisfaction with the service. Bus driver’s behavior is usually obtained surveying passengers and/or usi...
Multi-Agent Path Finding: A New Boolean Encoding
Authors: Roberto Asin Acha, Rodrigo Lopez, Sebastian Hagedorn et al.
Year: 2022Source: Journal of Artificial Intelligence Research
Multi-agent pathfinding (MAPF) is an NP-hard problem. As such, dense maps may be very hard to solve optimally. In such scenarios, compilation-based approaches, via Boolean satisfiability (SAT) and ans...
Constant-delay enumeration for SLP-compressed documents
Authors: Cristian Riveros, Martı́n Muñoz
Year: 2022Source: arXiv (Cornell University)
We study the problem of enumerating results from a query over a compressed document. The model we use for compression are straight-line programs (SLPs), which are defined by a context-free grammar tha...
Answer-Set Programs for Repair Updates and Counterfactual Interventions
Authors: Leopoldo Bertossi
Year: 2022Source: arXiv (Cornell University)
We briefly describe -- mainly through very simple examples -- different kinds of answer-set programs with annotations that have been proposed for specifying: database repairs and consistent query answ...
Explainable neural image recommendation using Network Dissection visual concepts
Authors: Denis Parra, Hans Löbel, Antonio Ossa-Guerra
Year: 2022
Training and intrinsic evaluation of lightweight word embeddings for the clinical domain in Spanish
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Frontiers in Artificial Intelligence
Resources for Natural Language Processing (NLP) are less numerous for languages different from English. In the clinical domain, where these resources are vital for obtaining new knowledge about human ...
Propositional Equality for Gradual Dependently Typed Programming
Authors: Ronald Garcia, Joseph Eremondi, Éric Tanter et al.
Year: 2022Source: Proceedings of the ACM on Programming Languages-PACMPL
Gradual dependent types can help with the incremental adoption of dependently typed code by providing a principled semantics for imprecise types and proofs, where some parts have been omitted. Current...
A Reasonably Gradual Type Theory
Authors: Eric Tanter, Kenji Maillard, Meven Lennon-Bertrand et al.
Year: 2022Source: Proceedings of the ACM on Programming Languages
Gradualizing the Calculus of Inductive Constructions (CIC) involves dealing with subtle tensions between normalization, graduality, and conservativity with respect to CIC. Recently, GCIC has been prop...
Faster compressed quadtrees
Authors: Gonzalo Navarro, Travis Gagie, Guillermo de Bernardo et al.
Year: 2022Source: Journal of Computer and System Sciences
Can political alignment reduce crime? Evidence from Chile
Authors: Carla Alberti, Diego Díaz Rioseco, Giancarlo Visconti
Year: 2022Source: Political Science Research and Methods
Abstract Research has shown that presidents tend to benefit local level copartisans when distributing resources, which can improve the provision of public goods, such as security. Considering that fea...
UMLS Heading Sequences in Spanish
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
UMLS Heading Sequences in Spanish used to compute Word embeddings for the Spanish clinical language
Medical Journals in Spanish
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Medical Journals in Spanish used to compute Word embeddings for the Spanish clinical language
Chilean waiting list corpus
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
The chilean waiting list corpus used to compute Word embeddings for the Spanish clinical language
UMLS Heading Sequences in Spanish
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
UMLS Heading Sequences in Spanish used to compute Word embeddings for the Spanish clinical language
Medical Journals in Spanish
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Medical Journals in Spanish used to compute Word embeddings for the Spanish clinical language
Chilean waiting list corpus
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
The chilean waiting list corpus used to compute Word embeddings for the Spanish clinical language
People are more engaged on Facebook as they get older, especially in politics: evidence from users in 46 countries
Authors: Sergio Toro, Juan Pablo Luna, Gabriel Vommaro et al.
Year: 2022Source: Journal of Quantitative Description Digital Media
A growing body of literature has noted an age pattern in the sharing of false news in social media, with older people sharing more often misinformation than younger users. In this article we supplemen...
Cross-Lingual and Cross-Domain Crisis Classification for Low-Resource Scenarios
Authors: Jorge Pérez, Bárbara Poblete, Hernán Sarmiento et al.
Year: 2022Source: arXiv (Cornell University)
Social media data has emerged as a useful source of timely information about real-world crisis events. One of the main tasks related to the use of social media for disaster management is the automatic...
Dynamic Data Structures for Timed Automata Acceptance
Authors: Cristian Riveros, Alejandro Grez, Filip Mazowiecki et al.
Year: 2022Source: Algorithmica
Abstract We study a variant of the classical membership problem in automata theory, which consists of deciding whether a given input word is accepted by a given automaton. We do so through the lenses ...
A Reasonably Gradual Type Theory
Authors: Kenji Maillard, Meven Lennon-Bertrand, Nicolas Tabareau et al.
Year: 2022Source: HAL (Le Centre pour la Communication Scientifique Directe)
Gradualizing the Calculus of Inductive Constructions (CIC) involves dealing with subtle tensions between normalization, graduality, and conservativity with respect to CIC. Recently, GCIC has been prop...
Actitudes políticas y solicitudes de ayuda directa a los gobiernos locales en América Latina
Authors: Sergio Toro, Danytza González-Ceballos
Year: 2022Source: AMÉRICA LATINA HOY
Este artículo estudia la relación entre las ayudas directas de los gobiernos y autoridades locales con las actitudes políticas de la ciudadanía. Se analizan datos de la encuesta Barómetro de las ...
Interactive annotation of geometric ornamentation on painted pottery assisted by deep learning
Authors: Ivan Sipiran, Tobias Schreck, Reinhold Preiner et al.
Year: 2022Source: it - Information Technology
Abstract In Greek art, the phase from 900 to 700 BCE is referred to as the Geometric period due to the characteristically simple geometry-like ornamentations appearing on painted pottery surfaces duri...
Grammar Compression by Induced Suffix Sorting
Authors: Gonzalo Navarro, Simon Gog, Maurício Ayala-Rincón et al.
Year: 2022Source: ACM Journal of Experimental Algorithmics
A grammar compression algorithm, called GCIS, is introduced in this work. GCIS is based on the induced suffix sorting algorithm SAIS, presented by Nong et al. in 2009. The proposed solution builds on ...
Biomechanical comparison of a 3D-printed prosthetic foot with conventional feet in people with transtibial amputation: A prospective cohort study
Authors: Aidan Hogan, Ursula Trinler, Mathias Rehg et al.
Year: 2022Source: Prosthetics and Orthotics International
The method of 3D printing is increasingly gaining utilization in clinical applications and may support prosthetic fitting. The aim was to compare biomechanical outcomes of people with a transtibial am...
WIP: Exploring differences in student sense of belonging inside and outside the engineering classroom
Authors: Jorge Baier, Isabel Hilliger, Maria Javiera de los Rios et al.
Year: 2022Source: ASEE Annual Conference and Exposition, Conference Proceedings
Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients
Authors: Gonzalo Navarro, Karen Oróstica, Álvaro Olivera‐Nappa et al.
Year: 2022Source: Journal of Translational Medicine
Abstract Background Recently, extensive cancer genomic studies have revealed mutational and clinical data of large cohorts of cancer patients. For example, the Pan-Lung Cancer 2016 dataset (part of Th...
Changing Media Landscapes and Political Participation
Authors: Sebastián Valenzuela, Marcelo Santos
Year: 2022Source: Oxford University Press eBooks
Abstract This chapter discusses how a constantly changing media landscape affects political participation. After pointing out the affordances brought forward by digital media and communication technol...
Semantics and canonicalisation of SPARQL 1.1
Authors: Aidan Hogan, Jaime Salas
Year: 2022Source: Semantic Web
Navigating planar topologies in near-optimal space and time
Authors: Gonzalo Navarro, José Fuentes‐Sepúlveda, Diego Seco
Year: 2022Source: Computational Geometry
Gradualizing the Calculus of Inductive Constructions
Authors: Eric Tanter, Kenji Maillard, Meven Lennon-Bertrand et al.
Year: 2022Source: ACM Transactions on Programming Languages and Systems
We investigate gradual variations on the Calculus of Inductive Construction (CIC) for swifter prototyping with imprecise types and terms. We observe, with a no-go theorem, a crucial trade-off between ...
Representing Paths in Graph Database Pattern Matching
Authors: Domagoj Vrgoč, Stijn Vansummeren, Wim Martens et al.
Year: 2022Source: arXiv (Cornell University)
Modern graph database query languages such as GQL, SQL/PGQ, and their academic predecessor G-Core promote paths to first-class citizens in the sense that paths that match regular path queries can be r...
Real-Time Heuristic Search with LTLf Goals
Authors: Jorge Baier, Jiame Middleton, Rodrigo Toro
Year: 2022Source: IJCAI International Joint Conference on Artificial Intelligence
The structure of political conflict. The oligarchs and the bourgeoisie in the Chilean Congress, 1834–1894
Authors: Naim Bro
Year: 2022Source: Theory and Society
Focal Discrepancy Search for Learned Heuristics
Authors: Jorge Baier, Matias Greco, Pablo Araneda
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
Machine learning allows learning accurate but inadmissible heuristics for hard combinatorial puzzles like the 15-puzzle, the 24-puzzle, and Rubik's cube. In this paper, we investigate how to exploit t...
Avoiding Errors in Learned Heuristics in Bounded-Suboptimal Search
Authors: Jorge Baier, Matias Greco
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
Despite being very effective, learned heuristics in bounded-suboptimal search can produce heuristic plateaus or move the search to zones of the state space that do not lead to a solution. In addition,...
K-Focal Search for Slow Learned Heuristics (Extended Abstract)
Authors: Jorge Baier, Matias Greco, Jorge Toro et al.
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
Learned heuristics, though inadmissible, can provide very good guidance for bounded-suboptimal search. Given a single search state s and a learned heuristic h, evaluating h(s) is typically very slow r...
Subset Approximation of Pareto Regions with Bi-Objective A* (Extended Abstract)
Authors: Jorge Baier, Nicolás Rivera, Carlos Hernández Ulloa
Year: 2022Source: Proceedings of the International Symposium on Combinatorial Search
In bi-objective search, we are given a graph in which each directed arc is associated with a pair of non-negative weights, and the objective is to find the Pareto-optimal solution set. Unfortunately, ...
Data from the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation"
Authors: Marcelo Mendoza, Sergio Toro, Hans Löbel et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Data for urban clustering used in the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation". We release two datasets for urban clustering based on data acq...
Data from the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation"
Authors: Marcelo Mendoza, Sergio Toro, Hans Löbel et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Data for urban clustering used in the paper "Learning to clusterize urban areas: two competitive approaches and an empirical validation". We release two datasets for urban clustering based on data acq...
Hierarchical Transformers for Group-Aware Sequential Recommendation: Application in MOBA Games
Authors: Denis Parra, Vladimir Araujo, Andrés Villa et al.
Year: 2022
In recent years, several recommendation systems have been introduced to improve the user experience of players in video games. In Multiplayer Online Battle Arena (MOBA) games, a popular game genre, th...
Reflections on a Legacy: Thoughts from Scholars about Agenda-Setting Past and Future
Authors: Sebastián Valenzuela, Maxwell McCombs, Лэй Гуо et al.
Year: 2022Source: Mass Communication & Society
In response to Perloff's (this issue) essay examining the development and future of agenda setting, a series of scholars offer their own reactions to the essay and the broader issues it raises.
Real-Time Heuristic Search with LTLf Goals
Authors: Jorge Baier, Rodrigo Toro Icarte, Jaime Middleton
Year: 2022Source: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence
In Real-Time Heuristic Search (RTHS) we are given a search graph G, a heuristic, and the objective is to find a path from a given start node to a given goal node in G. As such, one does not impose any...
On Computing Probabilistic Explanations for Decision Trees
Authors: Pablo Barceló, Marcelo Arenas, Bernardo Subercaseaux et al.
Year: 2022Source: arXiv (Cornell University)
Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases ...
Replication Data for: Corruption and Political Knowledge Erosion. A Cautionary Tale from Latin America
Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2022Source: Harvard Dataverse
This study employs data from the two-wave face-to-face panel survey conducted by the authors of this study. The survey employed a probability-based sample representative of all adults (18 years or old...
Subset Approximation of Pareto Regions with Bi-objective A
Authors: Jorge Baier, Nicolás Rivera, Carlos Hernández
Year: 2022Source: Proceedings of the AAAI Conference on Artificial Intelligence
In bi-objective search, we are given a graph in which each directed arc is associated with a pair of non-negative weights, and the objective is to find the Pareto-optimal solution set. Unfortunately, ...
Replication Data for Local Government, Social Media and Management of COVID-19: The Case of Chilean Mayoral Communication
Authors: Fernando Rosenblatt, Cristian Pérez Muñóz, Juan Pablo Luna et al.
Year: 2022Source: Harvard Dataverse
Code and data of the analysis and result of the paper
Bots don’t Vote, but They Surely Bother!
Authors: Ricardo Baeza-Yates, Eduardo Graells-Garrido, Ricardo Baeza‐Yates
Year: 2022
Comunicació presentada a 14th ACM Web Science Conference 2022 (WebSci '22), celebrat del 26 al 29 de juny de 2022 a Barcelona, Espanya.
PromoterLCNN: A Light CNN-Based Promoter Prediction and Classification Model
Authors: Dary Hernández, Nicolás Jara, Mauricio Araya et al.
Year: 2022Source: GENES
Promoter identification is a fundamental step in understanding bacterial gene regulation mechanisms. However, accurate and fast classification of bacterial promoters continues to be challenging. New m...
Word embeddings for the Spanish clinical language
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Word embeddings for the Spanish clinical language Corpora used to compute the embeddings: Chilean waiting list corpus - https://zenodo.org/record/7072314 Medical Journal in Spanish - https://zenodo.or...
Word embeddings for the Spanish clinical language
Authors: Jocelyn Dunstan, Cecilia Besa, Fabián Villena et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Word embeddings for the Spanish clinical language Corpora used to compute the embeddings: Chilean waiting list corpus - https://zenodo.org/record/7072314 Medical Journal in Spanish - https://zenodo.or...
Efficient Enumeration for Annotated Grammars
Authors: Cristian Riveros, Martín Muñoz, Antoine Amarilli et al.
Year: 2022
International audience
Graph Pattern Matching in GQL and SQL/PGQ
Authors: Domagoj Vrgoč, Oskar van Rest, Stefan Plantikow et al.
Year: 2022Source: Proceedings of the 2022 International Conference on Management of Data
International audience
A Reasonably Gradual Type Theory – Artifact
Authors: Eric Tanter, Meven Lennon-Bertrand, Kenji Maillard et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Accompanying artifact to the article <em>A Reasonably Gradual Type Theory.</em> It consists of two parts: - a Coq formalization of the model described in the article, - a proof of concept using rewrit...
A Reasonably Gradual Type Theory – Artifact
Authors: Eric Tanter, Meven Lennon-Bertrand, Kenji Maillard et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
Accompanying artifact to the article <em>A Reasonably Gradual Type Theory.</em> It consists of two parts: - a Coq formalization of the model described in the article, - a proof of concept using rewrit...
Improving matrix-vector multiplication via lossless grammar-compressed matrices
Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: Proceedings of the VLDB Endowment
As nowadays Machine Learning (ML) techniques are generating huge data collections, the problem of how to efficiently engineer their storage and operations is becoming of paramount importance. In this ...
Corruption and Political Knowledge Erosion. A Cautionary Tale from Latin America
Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2022Source: International Journal of Public Opinion Research
Abstract Previous research has shown that corruption diminishes citizens’ level of political support and engagement. We extend this line of reasoning and evaluate whether previous levels of perceive...
The Next Generation Virgo Cluster Survey. XXXIII. Stellar Population Gradients in the Virgo Cluster Core Globular Cluster System
Authors: Susana Eyheramendy, Andrés Jordán, Laura Ferrarese et al.
Year: 2022Source: The Astrophysical Journal
Abstract We present a study of the stellar populations of globular clusters (GCs) in the Virgo Cluster core with a homogeneous spectroscopic catalog of 692 GCs within a major-axis distance R maj = 840...
Identifying and Characterizing New Expressions of Community Framing during Polarization
Authors: Bárbara Poblete, Hernán Sarmiento, Felipe Bravo-Márquez et al.
Year: 2022Source: Proceedings of the International AAAI Conference on Web and Social Media
Chile experienced a series of important protests between October and December 2019. This social unrest, as it was called, was fueled by social inequity and radically affected the nation's status quo. ...
Technical Perspective - No PANE, No Gain
Authors: Aidan Hogan
Year: 2022Source: ACM SIGMOD Record
The machine learning community has traditionally been proactive in developing techniques for diverse types of data, such as text, audio, images, videos, time series, and, of course, matrices, tensors,...
Reevaluando los diseños institucionales: El efecto del presidencialismo sobre la corrupción
Authors: Sergio Toro
Year: 2022Source: Revista Chilena de Derecho y Ciencia Política
La presente nota de investigación analiza la incidencia de las va-riables institucionales sobre el Índice de Percepción de la Corrupción (CPI). Utilizando la metodología de paressobre una base de...
Knowledge graphs
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: ACM Computing Surveys
In this article, we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting di...
Multilayer graphs
Authors: Aidan Hogan, Renzo Angles, Domagoj Vrgoč et al.
Year: 2022
In this short position paper, we argue that there is a need for a unifying data model that can support popular graph formats such as RDF, RDF* and property graphs, while at the same time being powerfu...
Technical perspective: The compression power of the BWT
Authors: Gonzalo Navarro
Year: 2022Source: Communications of the ACM
No abstract available.
Educational Tools for Mapuzugun
Authors: Claudio Gutiérrez, Antonios Anastasopoulos, Cristian Ahumada
Year: 2022Source: arXiv (Cornell University)
Mapuzugun is the language of the Mapuche people. Due to political and historical reasons, its number of speakers has decreased and the language has been excluded from the educational system in Chile a...
Slicing of Probabilistic Programs based on Specifications
Authors: Marcelo Navarro, Federico Olmedo
Year: 2022Source: Science of Computer Programming
Probabilistic Automata of Bounded Ambiguity
Authors: Cristian Riveros, Nathanaël Fijalkow, James Worrell
Year: 2022Source: HAL (Le Centre pour la Communication Scientifique Directe)
Probabilistic automata are an extension of nondeterministic finite automata in which transitions are annotated with probabilities. Despite its simplicity, this model is very expressive and many of the...
Plausible sealing for gradual parametricity
Authors: Elizabeth Labrada, Matías Toro, Eric Tanter et al.
Year: 2022Source: Proceedings of the ACM on Programming Languages-PACMPL
Graduality and parametricity have proven to be extremely challenging notions to bring together. Intuitively, enforcing parametricity gradually requires possibly sealing values in order to detect viola...
LSCDiscovery: A shared task on semantic change discovery and detection in Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: arXiv (Cornell University)
We present the first shared task on semantic change discovery and detection in Spanish and create the first dataset of Spanish words manually annotated for semantic change using the DURel framework (S...
Towards Effective Blended Learning Through the Eyes of Students: A Survey Study in Transition into Face-to-Face Education
Authors: Jorge Baier, Isabel Hilliger, Gabriel Astudillo et al.
Year: 2022Source: Educating for a new future: Making sense of technology-enhaced learning adoption. EC-TEL 2022
CORE: a Complex Event Recognition Engine
Authors: Marco Bucchi, Alejandro Grez, Andrés Quintana et al.
Year: 2022Source: VLDB 2022
Complex Event Recognition (CER) systems are a prominent technology for finding user-defined query patterns over large data streams in real time. CER query evaluation is known to be computationally cha...
Slicing of Probabilistic Programs based on Specifications
Authors: Federico Olmedo, Marcelo Navarro
Year: 2022Source: arXiv (Cornell University)
This paper presents the first slicing approach for probabilistic programs based on specifications. We show that when probabilistic programs are accompanied by their specifications in the form of pre- ...
Squeeze: Efficient compact fractals for tensor core GPUs
Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: Future Generation Computer Systems
Propositional Equality for Gradual Dependently Typed Programming
Authors: Eric Tanter, Joseph Eremondi, Ronald G. García
Year: 2022Source: arXiv (Cornell University)
Gradual dependent types can help with the incremental adoption of dependently typed code by providing a principled semantics for imprecise types and proofs, where some parts have been omitted. Current...
Time- and Space-Efficient Regular Path Queries
Authors: Aidan Hogan, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: 2022 IEEE 38th International Conference on Data Engineering (ICDE)
We introduce a time- and space-efficient technique to solve regular path queries over labeled (RDF) graphs. We combine a bit-parallel simulation of the Glushkov automaton of the regular expression wit...
Exploration of Knowledge Graphs via Online Aggregation
Authors: Aidan Hogan, Benny Kimelfeld, Oren Kalinsky et al.
Year: 2022Source: 2022 IEEE 38th International Conference on Data Engineering (ICDE)
Exploration systems over large-scale RDF knowl-edge graphs often rely on aggregate count queries to indicate how many results the user can expect for the possible next steps of exploration. Such syste...
Temporal Regular Path Queries
Authors: Marcelo Arenas, Julia Stoyanovich, Pedro Bahamondes et al.
Year: 2022Source: 2022 IEEE 38th International Conference on Data Engineering (ICDE)
In the last decade, substantial progress has been made towards standardizing the syntax of graph query languages, and towards understanding their semantics and complexity of evaluation. In this paper,...
12th Temporal Web Analytics Workshop (TempWeb) Overview
Authors: Ricardo Baeza-Yates, Marc Spaniol, Ómar Alonso
Year: 2022Source: Companion Proceedings of the The Web Conference 2018
TempWeb focuses on investigating infrastructures, scalable methods, and innovative software for aggregating, querying, and analyzing heterogeneous data at Web scale. Emphasis is given to data analysis...
Evaluating regular path queries under the all-shortest paths semantics
Authors: Domagoj Vrgoč
Year: 2022Source: arXiv (Cornell University)
The purpose of this report is to explain how the textbook breadth-first search algorithm (BFS) can be modified in order to also create a compact representation of all shortest paths connecting a singl...
ALBETO and DistilBETO: Lightweight Spanish Language Models
Authors: Felipe Bravo-Márquez, Andrés Carvallo, Vladimir Araujo et al.
Year: 2022Source: arXiv (Cornell University)
In recent years there have been considerable advances in pre-trained language models, where non-English language versions have also been made available. Due to their increasing use, many lightweight v...
Correction to: Graph Compression for Adjacency-Matrix Multiplication
Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: SN Computer Science
Evaluation Benchmarks for Spanish Sentence Representations
Authors: Marcelo Mendoza, Felipe Bravo-Márquez, Álvaro Soto et al.
Year: 2022Source: arXiv (Cornell University)
Due to the success of pre-trained language models, versions of languages other than English have been released in recent years. This fact implies the need for resources to evaluate these models. In th...
Gobernanza Criminal y la Crisis de los Estados Latinoamericanos Contemporáneos
Authors: Juan Pablo Luna, Andreas Feldmann
Year: 2022Source: Annual Review of Sociology
Crecientemente las sociedades latinoamericanas enfrentan el surgimiento de nuevos órdenes en que los funcionarios estatales y las autoridades políticas comparten el poder con organizaciones criminal...
Efficient Construction of the BWT for Repetitive Text Using String Compression
Authors: Gonzalo Navarro, Diego Díaz-Domínguez
Year: 2022Source: arXiv (Cornell University)
We present a new semi-external algorithm that builds the Burrows--Wheeler transform variant of Bauer et al. (a.k.a., BCR BWT) in linear expected time. Our method uses compression techniques to reduce ...
Expressiveness and Approximation Properties of Graph Neural Networks
Authors: Juan Reutter, Floris Geerts, Juan L. Reutter
Year: 2022Source: arXiv (Cornell University)
Characterizing the separation power of graph neural networks (GNNs) provides an understanding of their limitations for graph learning tasks. Results regarding separation power are, however, usually ge...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
Automatic Extraction of Nested Entities in Clinical Referrals in Spanish
Authors: Felipe Bravo-Márquez, Jocelyn Dunstan, Fabián Villena et al.
Year: 2022Source: ACM Transactions on Computing for Healthcare
Here we describe a new clinical corpus rich in nested entities and a series of neural models to identify them. The corpus comprises de-identified referrals from the waiting list in Chilean public hosp...
Criminal Governance and the Crisis of Contemporary Latin American States
Authors: Juan Pablo Luna, Andreas Feldmann
Year: 2022Source: Annual Review of Sociology
Across Latin America, societies are confronting the rise of novel orders in which state officials and political authorities share power with criminal organizations. Criminal governance (i.e., the crea...
A Novel First-Order Autoregressive Moving Average Model to Analyze Discrete-Time Series Irregularly Observed
Authors: Susana Eyheramendy, Wilfredo Palma, César Ojeda et al.
Year: 2022Source: arXiv (Cornell University)
A novel first-order autoregressive moving average model for analyzing discrete-time series observed at irregularly spaced times is introduced. Under Gaussianity, it is established that the model is st...
Social Media and Belief in Misinformation in Mexico: A Case of Maximal Panic, Minimal Effects?
Authors: Sebastián Valenzuela, Marcelo Santos, Carlos Múñiz
Year: 2022Source: The International Journal of Press/Politics
Contrary to popular narratives, it is not clear whether using social media for news increases belief in political misinformation. Several of the most methodologically sound studies find small to nonex...
Similarity-Based Explanations meet Matrix Factorization via Structure-Preserving Embeddings
Authors: Denis Parra, Leandro Balby Marinho, Rodrygo L. T. Santos et al.
Year: 2022
Embeddings are core components of modern model-based Collaborative Filtering (CF) methods, such as Matrix Factorization (MF) and Deep Learning variations. In essence, embeddings are mappings of the or...
Graph Compression for Adjacency-Matrix Multiplication
Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: SN Computer Science
Abstract Computing the product of the (binary) adjacency matrix of a large graph with a real-valued vector is an important operation that lies at the heart of various graph analysis tasks, such as com...
DWUG ES: Diachronic Word Usage Graphs for Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
This data collection contains diachronic Word Usage Graphs (WUGs) for Spanish. Find a description of the data format, code to process the data and further datasets on the WUGsite. Please find more inf...
HOLZ: High-Order Entropy Encoding of Lempel-Ziv Factor Distances
Authors: Gonzalo Navarro, Dominik Köppl, Nicola Prezza
Year: 2022
We propose a new representation of the offsets of the Lempel-Ziv (LZ) factorization based on the co-lexicographic order of the text's prefixes. The selected offsets tend to approach the k-th order emp...
Answer-Set Programs for Reasoning about Counterfactual Interventions and Responsibility Scores for Classification
Authors: Leopoldo Bertossi, Gabriela Reyes, Gabriela de los Ángeles Díaz Reyes
Year: 2022Source: Inductive Logic Programming (ILP 2021)
Optimal Joins Using Compressed Quadtrees
Authors: Juan Reutter, Gonzalo Navarro, Diego Arroyuelo et al.
Year: 2022Source: ACM Transactions on Database Systems
Worst-case optimal join algorithms have gained a lot of attention in the database literature. We now count several algorithms that are optimal in the worst case, and many of them have been implemented...
Language Modeling on Location-Based Social Networks
Authors: Bárbara Poblete, Felipe Bravo-Márquez, Juglar Diaz
Year: 2022Source: ISPRS International Journal of Geo-Information
The popularity of mobile devices with GPS capabilities, along with the worldwide adoption of social media, have created a rich source of text data combined with spatio-temporal information. Text data ...
Ethical Challenges in AI
Authors: Ricardo Baeza-Yates, Ricardo Baeza‐Yates
Year: 2022Source: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining
In the first part we address four current specific challenges through examples: (1) discrimination (e.g., facial recognition, justice, sharing economy, language models); (2) stupid models (e.g., lack ...
Natural Language Processing Of Helpline Chat Data Before And During The Pandemic Revealed Significant Decrease In Self-image Appreciation And Changes In Other Traits
Authors: Susana Eyheramendy, Fernanda Barriga, María P. Raveau et al.
Year: 2022Source: Preprints.org
During the last two years the COVID-19 pandemic has affected the world population in several ways. An important increase in mental health problems is a consequence of this pandemic that is ubiquitous ...
A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images
Authors: Denis Parra, Pablo Messina, Álvaro Soto et al.
Year: 2022Source: ACM Computing Surveys
Every year physicians face an increasing demand of image-based diagnosis from patients, a problem that can be addressed with recent artificial intelligence methods. In this context, we survey works in...
For better and for worse: A panel survey of how mobile-only and hybrid Internet use affects digital skills over time
Authors: Sebastián Valenzuela, Teresa Correa, Isabel Pavez
Year: 2022Source: New Media & Society
Public policies across the world are tackling Internet access inequality through mobile connections, which has led to an increase in mobile-only use. However, digital skills remain as a stumbling bloc...
Spanish SciELO Crawled Biomedical Corpus
Authors: Jocelyn Dunstan, Fabián Villena, Carolina Chiu
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We present a corpus of Spanish medical articles extracted from the SciELO website (https://scielo.cl/). The corpus was constructed using web scraping extraction techniques and consists of 5694 article...
Spanish SciELO Crawled Biomedical Corpus
Authors: Jocelyn Dunstan, Fabián Villena, Carolina Chiu
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We present a corpus of Spanish medical articles extracted from the SciELO website (https://scielo.cl/). The corpus was constructed using web scraping extraction techniques and consists of 5694 article...
Semantics and canonicalisation of SPARQL 1.1
Authors: Aidan Hogan, Jaime Salas
Year: 2022Source: Semantic Web
We define a procedure for canonicalising SPARQL 1.1 queries. Specifically, given two input queries that return the same solutions modulo variable names over any RDF graph (which we call congruent quer...
Cultural, scientific and technical antecedents of the Cybersyn project in Chile
Authors: Claudio Gutiérrez, Juan David Ortega-Alvarez
Year: 2022Source: AI & Society
Morbimortality assessment in abdominal surgery: are we predicting or overreacting?
Authors: Sebastián Valenzuela, Daniel Rosselló-Jiménez, Ricardo Nassar et al.
Year: 2022Source: BMC Surgery
Abstract Background High-risk surgical procedures represent a fundamental part of general surgery practice due to its significant rates of morbidity and mortality. Different predictive tools have been...
A comprehensive review of the video-to-text problem
Authors: Jorge Pérez, Benjamín Bustos, Ivan Sipiran et al.
Year: 2022Source: Artificial Intelligence Review
CLNews19-20: A new dataset for rumor detection in Spanish
Authors: Marcelo Mendoza, Eliana Providel, Daniel Toro-González et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We create CLNews, a dataset for rumor detection in Spanish. Based on fact-checking agencies' data, we mapped related tweets to verify news into four categories: non-rumor, true rumor, false rumor, and...
CLNews19-20: A new dataset for rumor detection in Spanish
Authors: Marcelo Mendoza, Eliana Providel, Daniel Toro-González et al.
Year: 2022Source: Zenodo (CERN European Organization for Nuclear Research)
We create CLNews, a dataset for rumor detection in Spanish. Based on fact-checking agencies' data, we mapped related tweets to verify news into four categories: non-rumor, true rumor, false rumor, and...
Efficient and compact representations of some non-canonical prefix-free codes
Authors: Gonzalo Navarro, Travis Gagie, Antonio Fariña et al.
Year: 2022Source: Theoretical Computer Science
Reactive and Asymmetric Communication Flows: Social Media Discourse and Partisan News Framing in the Wake of Mass Shootings
Authors: Sebastián Valenzuela, Dhavan V. Shah, Jon Pevehouse et al.
Year: 2022Source: The International Journal of Press/Politics
Marked by both deep interconnectedness and polarization, the contemporary media system in the United States features news outlets and social media that are bound together, yet deeply divided along par...
Universal coding and prediction on ergodic random points
Authors: Lukasz Debowski, Tomasz Steifer
Year: 2022Source: Bulletin of Symbolic Logic
A Universal Screening Tool for Dyslexia by a Web-Game and Machine Learning
Authors: Ricardo Baeza-Yates, Luz Rello, Maria Rauschenberger et al.
Year: 2022Source: Frontiers in Computer Science
Children with dyslexia have difficulties learning how to read and write. They are often diagnosed after they fail school even if dyslexia is not related to general intelligence. Early screening of dys...
Efficient Enumeration Algorithms for Annotated Grammars
Authors: Cristian Riveros, Martı́n Muñoz, Antoine Amarilli et al.
Year: 2022Source: arXiv (Cornell University)
We introduce annotated grammars, an extension of context-free grammars which allows annotations on terminals. Our model extends the standard notion of regular spanners, and is more expressive than the...
Knowledge-based programs as building blocks for planning
Authors: Jorge Baier, Sheila A. McIlraith
Year: 2022Source: Artificial Intelligence
3 Overview of Talks 3.1 SHAP Explanations with Booleans Circuit Classifiers
Authors: Leopoldo Bertossi
Year: 2022Source: Dagstuhl Reports
Score-Based Explanations in Data Management and Machine Learning: An Answer-Set Programming Approach to Counterfactual Analysis
Authors: Leopoldo Bertossi
Year: 2022Source: Reasoning Web. Declarative Artificial Intelligence. Reasoning Web 2021
Data Graphs
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
At the foundation of any knowledge graph is the principle of first applying a graph abstraction to data, resulting in an initial data graph. We now discuss a selection of graph-structured data models ...
Introduction
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
Though the phrase "knowledge graph" has been used in the literature since at least 1972 [Schneider, 1973], the modern incarnation of the phrase stems from the 2012 announcement of the Google Knowledge...
Quality Assessment
Authors: Eva Blomqvist, Lukas Schmelzeisen, José Emilio Labra Gayo et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
Independent of the (kinds of) source(s) from which a knowledge graph is created, the resulting initial knowledge graph will usually be incomplete, and will often contain duplicate, contradictory or ev...
Publication
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
While it may not always be desirable to publish knowledge graphs (for example, those that offer a competitive advantage to a company [Noy et al., 2019]), it maybe desirable or even required to publish...
Deductive Knowledge
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
As humans, we can deduce more from the data graph of Figure 2.1 than what the edges explicitly indicate. We may deduce, for example, that the $${\rm{\tilde N}}$$ am festival ((eidis)) will be located ...
Squeeze: Efficient Compact Fractals for Tensor Core Gpus
Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: SSRN Electronic Journal
This work presents Squeeze, an efficient compact fractal processing scheme for tensor core GPUs. By combining discrete-space transformations between compact and expanded forms, one can do data-paralle...
A Scalable and Energy Efficient GPU Thread Map for m-Simplex Domains
Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: arXiv (Cornell University)
This work proposes a new GPU thread map for $m$-simplex domains, that scales its speedup with dimension and is energy efficient compared to other state of the art approaches. The main contributions of...
Squeeze: Efficient Compact Fractals for Tensor Core GPUs
Authors: Benjamín Bustos, Felipe A. Quezada, Cristóbal A. Navarro et al.
Year: 2022Source: arXiv (Cornell University)
This work presents Squeeze, an efficient compact fractal processing scheme for tensor core GPUs. By combining discrete-space transformations between compact and expanded forms, one can do data-paralle...
Resources for Multilingual Hate Speech Detection
Authors: Jorge Pérez, Bárbara Poblete, Magdalena Saldaña et al.
Year: 2022
Most of the published approaches and resources for hate speech detection are tailored for the English language. In consequence, cross-lingual and cross-cultural perspectives lack some essential resour...
DockerPedia: A Knowledge Graph of Software Images and Their Metadata
Authors: Carlos Buil-Aranda, Daniel Garijo, Maximiliano Osorio et al.
Year: 2022Source: International Journal of Software Engineering and Knowledge Engineering
An increasing amount of researchers use software images to capture the requirements and code dependencies needed to carry out computational experiments. Software images preserve the computational envi...
Knowledge Graphs in Practice
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
In this chapter, we discuss some of the most prominent knowledge graphs that have emerged in the past years. We begin by discussing open knowledge graphs, most of which have been published on the Web ...
Conclusions
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
We have provided a comprehensive introduction to knowledge graphs, which have been receiving more and more attention in recent years. Under the definition of a knowledge graph as a graph ofdata intend...
Knowledge Graph Compression for Big Semantic Data
Authors: Claudio Gutiérrez, Miguel A. Martínez‐Prieto, Javier D. Fernández et al.
Year: 2022Source: Encyclopedia of Big Data Technologies
Educational Tools for Mapuzugun
Authors: Claudio Gutiérrez, Antonios Anastasopoulos, Cristian Ahumada
Year: 2022
Mapuzugun is the language of the Mapuche people. Due to political and historical reasons, its number of speakers has decreased and the language has been excluded from the educational system in Chile a...
Schema, Identity, and Context
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
In this chapter we describe extensions of the data graph–relating to schema, identity, and context–that provide additional structures for accumulating knowledge. Henceforth, we refer to a data gra...
Creation and Enrichment
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
In this chapter, we discuss the principal techniques by which knowledge graphs can be created and subsequently enriched from diverse sources of legacy data that range from plain text to structured for...
Inductive Knowledge
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
While deductive knowledge is characterized by precise logical consequences, inductively acquiring knowledge involves generalizing patterns from a given set of input observations, which can then be use...
Refinement
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
Beyond assessing the quality of a knowledge graph, there exist techniques to refine the knowledge graph, in particular to (semi-)automatically complete and correct the knowledge graph [Paul-heim, 2017...
Knowledge Graphs
Authors: Aidan Hogan, Claudio Gutiérrez, Eva Blomqvist et al.
Year: 2022Source: Synthesis lectures on data, semantics and knowledge
This book provides a comprehensive and accessible introduction to knowledge graphs, which have recently garnered notable attention from both industry and academia. Knowledge graphs are founded on the
WDBench: A Wikidata Graph Query Benchmark
Authors: Aidan Hogan, Carlos Buil-Aranda, Renzo Angles et al.
Year: 2022Source: Lecture notes in computer science
The Semantic Web – ISWC 2022
Authors: Aidan Hogan, Claudia d’Amato, Giuseppe Pirró et al.
Year: 2022Source: Lecture notes in computer science
The ISWC 2022 proceedings details advances in research, technology, and applications of the semantic web, linked data, and knowledge graphs on the web.
truthy_direct_properties.nt.bz2
Authors: Aidan Hogan, Renzo Angles, Domagoj Vrgoč et al.
Year: 2022Source: Figshare
Wikidata truthy direct properties
A Critical Analysis Of Nlp and Clinical Correctness Metrics to Measure Progress on X-Ray Report Generation
Authors: Denis Parra, Jocelyn Dunstan, Cecilia Besa et al.
Year: 2022Source: SSRN Electronic Journal
Background: Radiologists face an increasing demand for image-based diagnosis from patients every year,and computer-aided diagnosis systems seem like a promising way to alleviate their workload. Many a...
LSCDiscovery: A shared task on semantic change discovery and detection in Spanish
Authors: Felipe Bravo-Márquez, Frank D. Zamora-Reina, Dominik Schlechtweg
Year: 2022
We present the first shared task on semantic change discovery and detection in Spanish and create the first dataset of Spanish words manually annotated for semantic change using the DURel framework Th...
DockerPedia: A Knowledge Graph of Software Images and Their Metadata
Authors: Maximiliano Osorio, Carlos Buil-Aranda, Idafen Santana-Perez et al.
Year: 2022Source: INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING
Clinical Flair: A Pre-Trained Language Model for Spanish Clinical Natural Language Processing
Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas
Year: 2022
Word embeddings have been widely used in Natural Language Processing (NLP) tasks. Although these representations can capture the semantic information of words, they cannot learn the sequence-level sem...
Divide and Conquer: An Extreme Multi-Label Classification Approach for Coding Diseases and Procedures in Spanish
Authors: Jocelyn Dunstan, Andrés Abeliuk, Matías Rojas et al.
Year: 2022
Clinical coding is the task of transforming medical documents into structured codes following a standard ontology. Since these terminologies are composed of hundreds of codes, this problem can be cons...
A Knowledge-Graph-Based Intrinsic Test for Benchmarking Medical Concept Embeddings and Pretrained Language Models
Authors: Jocelyn Dunstan, Fabián Villena, Matías Rojas et al.
Year: 2022
Using language models created from large data sources has improved the performance of several deep learning-based architectures, obtaining state-of-the-art results in several NLP extrinsic tasks. Howe...
Assessing the Limits of Straightforward Models for Nested Named Entity Recognition in Spanish Clinical Narratives
Authors: Jocelyn Dunstan, Matías Rojas, Aitor Gonzalez‐Agirre et al.
Year: 2022
Nested Named Entity Recognition (NER) is an information extraction task that aims to identify entities that may be nested within other entity mentions. Despite the availability of several corpora with...
Space-efficient conversions from SLPs
Authors: Gonzalo Navarro, Travis Gagie
Year: 2022Source: arXiv (Cornell University)
We give algorithms that, given a straight-line program (SLP) with $g$ rules that generates (only) a text $T [1..n]$, builds within $O(g)$ space the Lempel-Ziv (LZ) parse of $T$ (of $z$ phrases) in tim...
L-systems for Measuring Repetitiveness*
Authors: Gonzalo Navarro, C. Urbina
Year: 2022Source: arXiv (Cornell University)
An L-system (for lossless compression) is a CPD0L-system extended with two parameters $d$ and $n$, which determines unambiguously a string $w = \tau(\varphi^d(s))[1:n]$, where $\varphi$ is the morphis...
Compact data structures to represent spatial hierarchical structures
Authors: Gonzalo Navarro, Diego Seco, M. Andrea Rodríguez et al.
Year: 2022Source: Figshare
Implementation of three compact data structures to represent spatial hierarchical structures, with applications on the topological model.<br>Datasets are also included
Compact data structures to represent spatial hierarchical structures
Authors: Gonzalo Navarro, Diego Seco, M. Andrea Rodríguez et al.
Year: 2022Source: Figshare
Implementation of three compact data structures to represent spatial hierarchical structures, with applications on the topological model.<br>For replication of the experiments, please, check the file ...
Compact data structures to represent spatial hierarchical structures
Authors: Gonzalo Navarro, Diego Seco, M. Andrea Rodríguez et al.
Year: 2022Source: Figshare
Implementation of three compact data structures to represent spatial hierarchical structures, with applications on the topological model.<br>For replication of the experiments, please, check the file ...
Near-Optimal Search Time in $$\delta $$-Optimal Space
Authors: Gonzalo Navarro, Tomasz Kociumaka, Francisco Javier Vidal Olivares
Year: 2022Source: Lecture notes in computer science
Balancing Run-Length Straight-Line Programs
Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2022Source: Lecture notes in computer science
Balancing Run-Length Straight-Line Programs*
Authors: Gonzalo Navarro, Francisco Javier Vidal Olivares, C. Urbina
Year: 2022Source: arXiv (Cornell University)
It was recently proved that any SLP generating a given string $w$ can be transformed in linear time into an equivalent balanced SLP of the same asymptotic size. We show that this result also holds for...
Computing MEMs and Relatives on Repetitive Text Collections
Authors: Gonzalo Navarro
Year: 2022Source: arXiv (Cornell University)
We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern $P[1 .. m]$ on a large repetitive text collection $T[1 .. n]$, which is represented as a (hopefully much smalle...
Near-Optimal Search Time in $δ$-Optimal Space, and Vice Versa
Authors: Tomasz Kociumaka, Francisco Javier Vidal Olivares, Gonzalo Navarro
Year: 2022Source: arXiv (Cornell University)
Two recent lower bounds on the compressibility of repetitive sequences, $\delta \le \gamma$, have received much attention. It has been shown that a length-$n$ string $S$ over an alphabet of size $\sig...
Empirical Evaluation of Machine Learning Ensembles for Rumor Detection
Authors: Eliana Providel, Andrés Zapata, Marcelo Mendoza
Year: 2022Source: Lecture notes in computer science
An Algebra for Path Manipulation in Graph Databases
Authors: Renzo Angles, Roberto García
Year: 2022Source: Lecture notes in computer science
Lenguajes y modelos subyacentes a los grafos de conocimiento
Authors: Renzo Angles
Year: 2022Source: Actas del Congreso Internacional de Ingeniería de Sistemas
Un grafo de conocimiento es una gran base de datos que integra información desde distintas fuentes de datos, con el objetivo de poder extraer conocimiento y transformarlo en valor para los usuarios. ...
Bots don't Vote, but They Surely Bother! A Study of Anomalous Accounts in a National Referendum
Authors: Ricardo Baeza-Yates, Eduardo Graells-Garrido
Year: 2022Source: arXiv (Cornell University)
The Web contains several social media platforms for discussion, exchange of ideas, and content publishing. These platforms are used by people, but also by distributed agents known as bots. Although bo...
Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices
Authors: Gonzalo Navarro, Travis Gagie, Dominik Köppl et al.
Year: 2022Source: arXiv (Cornell University)
As nowadays Machine Learning (ML) techniques are generating huge data collections, the problem of how to efficiently engineer their storage and operations is becoming of paramount importance. In this ...
Replication Data for: Corruption and Political Knowledge Erosion. A Cautionary Tale from Latin America
Authors: Sebastián Valenzuela, Matías Bargsted, Ingrid Bachmann
Year: 2022Source: Harvard Dataverse
This study employs a two-wave face-to-face panel survey data conducted by the authors of this study in Santiago, Chile. The survey employed a probability-based sample and is representative of all adul...
An Empirical Evaluation of k-Means Coresets
Authors: Gonzalo Navarro, Eva Rotenberg, Grzegorz Herman et al.
Year: 2022Source: Research Portal Denmark
Coresets are among the most popular paradigms for summarizing data. In particular, there exist many high performance coresets for clustering problems such as k-means in both theory and practice. Curio...
A Local Search Algorithm for Large Maximum Weight Independent Set Problems
Authors: Gonzalo Navarro, Nikos Parotsidis, Yuanyuan Dong et al.
Year: 2022Source: Research Portal Denmark
Motivated by a real-world vehicle routing application, we consider the maximum-weight independent set problem: Given a node-weighted graph, find a set of independent (mutually nonadjacent) nodes whose...
Streaming Enumeration on Nested Documents
Authors: Cristian Riveros, Martı́n Muñoz
Year: 2022Source: arXiv (Cornell University)
Some of the most relevant document schemas used online, such as XML and JSON, have a nested format. In the last decade, the task of extracting data from nested documents over streams has become especi...
Scaling up ML-based Black-box Planning with Partial STRIPS Models
Authors: Jorge Baier, Matias Greco, Hector H. Palacios et al.
Year: 2022Source: arXiv (Cornell University)
A popular approach for sequential decision-making is to perform simulator-based search guided with Machine Learning (ML) methods like policy learning. On the other hand, model-relaxation heuristics ca...
Amplifying Counter-Public Spheres on Social Media: News Sharing of Alternative Versus Traditional Media After the 2019 Chilean Uprising
Authors: Sergio Toro, Sebastián Valenzuela, Juan Pablo Luna
Year: 2022Source: Social Media + Society
While much research exists on the role of digital media use in protest movements, few studies compare the long-term impact of protests on online use of alternative and mainstream digital media. This h...
The Impact of United States Engagement with Chile: 2000-2020
Authors: Juan Pablo Luna, Bruna Fonseca de Barros
Year: 2022Source: SSRN Electronic Journal
This study examines the diversity, scale, and impacts of efforts undertaken by the U.S. government and civil society to boost prosperity in Chile. It provides quantitative assessments of resource flow...
Graph Path Navigation
Authors: Pablo Barceló, Marcelo Arenas, Leonid Libkin
Year: 2022Source: Encyclopedia of Big Data Technologies
Error loading publications: cURL error 28: Operation timed out after 30002 milliseconds with 0 bytes received
Error loading publications: HTTP Error: 500








