I led Coveo’s A.I. and MLOps roadmap from scale-up to IPO, and built out Coveo Labs, an applied R&D practice rooted in word-class collaborations (Stanford, Bocconi, Outerbounds, Uber, Microsoft, NVIDIA), open source and open science.
I talk a lot, and I’m often invited to do so by folks in industry (BBC, Walmart, Pinterest, eBay, Farfetch) and academia (SIRIP, CiE, KDD, Stanford).
I’m currently a proud NLP advisor for Civic Eagle, and also Professor of ML at NYU, which is mostly notable because it is the only job I ever had that my parents (sort of) understand.
Selected talks, papers and datasets are highlighted (for the brave reader) at the very end of this page.
RecList is a testing library for recommender systems: RecList spawned a popular open source package, a CIKM competition, and a paper at WWW 2022. RecList successfully raised funds from MLOps companies to sponsor its open development.
Having built end-to-end systems at garage, scale-up and IPO scale, I had the privilege of making a lot of mistakes in DataOps and MLops. To share my learnings, I introduced the Reasonable Scale ML in “You don’t need a bigger boat” (and related articles).
Current interests: ML testing, developing in the Modern Data Stack.
My recent research is in (mostly) applied and (sometimes) theoretical topics at the intersection of language, learning and retrieval.
I am co-organizer of SIGIR eCom, Industry Sponsorship Chair for CIKM 2022, and I’ve been involved in organizing many NLP events (COLING, ECONLP, ECNLP, EMNLP). My work has been presented in venues such as NAACL, WWW, RecSys: our work on cognitively-inspired query embeddings won the Best Paper Award at NAACL 21.
As a true SFI alumnus, I am an old-fashioned generalist, and I gave small contributions as papers, projects or reviews to a bunch of topics outside “traditional A.I.”: computational social sciences, agent-based models, urban studies, philosophy of mind.
Current interests: multi-modal representations.
In previous lives, I managed to get a Ph.D., work for a professional basketball team, simulate a pre-Columbian civilization and give an academic talk on videogames (among others improbable “achievements”).
Last update: August 2022.
Quick links to selected talks, papers, datasets: if there’s a paper, talk, slide deck you know I have, but you can’t find it (here or elsewhere), please do get in touch directly.
Aside from research and tutorials, our datasets have been successfully used by dozens of master students to defend their thesis at Tillburg University and Politecnico in Milan.