Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2
Published in Spire 2021, 2021
Approximate membership query (AMQ) structures such as Cuckoo filters or Bloom filters are widely used for representing large sets of elements. Their lightweight space usage explains their success, mainly as they are the only way to scale hundreds of billions or trillions of elements. However, they suffer by nature from non-avoidable false-positive calls that bias downstream analyses of methods using these data structures.
Download here
Published in Bioinformatics, 2023
High throughput sequencing technologies generate massive amounts of biological sequence datasets as costs fall. One of the current algorithmic challenges for exploiting these data on a global scale consists in providing efficient query engines on these petabyte-scale datasets. Most methods indexing those datasets rely on indexing words of fixed length k, called k-mers. Many applications, such as metagenomics, require the abundance of indexed k-mers as well as their simple presence or absence, but no method scales up to petabyte-scaled datasets. This deficiency is primarily because storing abundance requires explicit storage of the k-mers in order to associate them with their counts. Using counting Approximate Membership Queries (cAMQ) data structures, such as counting Bloom filters, provides a way to index large amounts of k-mers with their abundance, but at the expense of a sensible false positive rate. Results
Download here
Published:
This talk was about the proof of concept of a “query time filtration” algorithm, which, applied on an AMQ, reduces its false positive rate.
Published:
This talk was about findere, our published paper at the SPIRE conference.
Published:
This talk was about findere and the work in progress about fimpera.
Published:
This talk was about findere, our first published paper.
Published:
This talk was about fimpera, an algorithm wrapping datastructures recording the abundance of kmers in biological sequences. fimpera reduces the overestimations cause by the underlying datastructure it wraps.
Published:
This talk was about findere, our first published paper.
Published:
This talk was about fimpera.
course, Rennes 1, ISTIC, 2021
INF1 is the Java course taught during the first semester to undergraduate students in Rennes University.
course, Rennes 1, ISTIC, 2022
INF2 is an IT course taught during the second semester to undergraduate students in Rennes University.
course, ENSAI, 2022
I taught a c++ course during a semester of practical work sessions.
course, ENSAI, 2023
I taught an introduction to LaTeX during a semester of practical work sessions.