LINES OF RESEARCH

The development of new models to integrate and describe complex and heterogeneous data implies new challenges in terms of efficient, secure and correct implementation. This area will therefore be devoted to the study of algorithms, data structures and programmatic solutions necessary for the implementation of effective storage schemes for complex data repositories.

We will focus on:

  • Support flexible querying and manipulation in highly complex data scenarios: We will investigate how to represent, index and query legacy data in different formats, which has not been well explored, but which turns out to be crucial for handling complex and heterogeneous data.
  • Efficient management of high-volume data repositories: We will exploit the growing gap between the access time of hierarchical levels of memory through the development of compressed data structures that operate within an entropy bounded space. In addition, to support data mining operations, we will investigate indexes that break the entropy barrier through data discarding and offering only aggregated queries (queries by aggregation).
  • Reinforce correct compliance and verification of correct access and privacy: We will build on the body of techniques developed in the areas of language programming and program verification, with a view to developing techniques for verifying the expression properties of data-centric systems, such as functional correctness, differential privacy and confidentiality.

Partner universities