Marian-Andrei Rizoiu

Professional Webpage

Presenting at the conference ICTAI 2012, November, Athens


My research revolves around artificial intelligence, machine learning and data mining. More specifically, I am interested in Social Media Analysis, Information Diffusion Models, Popularity Modelling and Prediction.

My research project aims to link the dynamics of collective human attention to the individual actions of the users of online platforms. The benefit of my research is to understand novel societal, such as the spread of misinformation and the role of social bots in recent political elections.

See more about my research.


2018-06: Just returned from ICWSM 2018 in Palo Alto, California, where our team presented three papers. I also visited and gave invited talks at Netflix Research and Facebook Core Research. See more details here.

2018-05: After the WWW 2018 conference in Lyon, France, I have visited for a week the Max Planck Institute for Software Systems in Kaiserslautern, Germany, hosted by the team of Manuel Gomez Rodriguez, one of the top groups world-wide in stochastic modeling.

Back to Top

My Publications.

Conference proceedings


Book chapters

PhD thesis

Masters thesis

Back to Top

Research topics.

Online social media is increasingly prevalent in shaping “offline” events, ranging from Twitter’s role in the 2016 US presidential campaign to the recent allegations that Facebook was used to convey hateful and racist messages towards the Rohingya minority in Myanmar. My research aims to tackle the grand challenge of assessing and mitigating the impact of online social media on the fundamental processes of our society. By successfully modeling and predicting the dynamics of online human attention, the benefit of my research is to understand novel societal and economic “offline” phenomena, like the spread of misinformation and the role of social bots in recent political elections, the link between the spread of hateful messages and violences towards minorities, or the emergence of disruptive business models like Uber or Airbnb.

A little more details

My current research interest is to model theoretically popularity on online media, as well as estimate the influence of media content and network characteristics on online attention. We established a generative model that predicts online attention, based on an exogenously-driven Hawkes self-exciting processes. We also examine the geographical diffusion of media content over time and the goal is to generate statistical descriptions of content diffusion over time and geographical areas. We are handling very large Twitter datasets (the network), which relate to Youtube videos (the content).

My previous work dealt with how partial expert information can be leveraged into a non-supervised learning algorithm that treats complex data. This complex data is of different natures (text, image), it is temporal and structured, linked to knowledge repository (e.g. ontology) and/or labeled. Semi-supervised clustering is used to model the additional information (structure, labels, time) and to inject the heterogeneous information into the learning algorithm. A series of application emerge from the theoretical research: using the temporal dimension to detect temporal patters and typical evolutions, using the image labels to improve image numerical representation and an automatic topic evaluation using concept trees.

Science slam

A science slam is the challenge to describe a research topic to a non-expert, with a twist of humor. At the ICWSM'17 science slam I talked about how to link exogenous stimuli and endogenous reactions to explain online popularity.

Back to Top

Grants & funding, projects and invited talks

Competitive grants and funding

Research projects

Seminars and invited talks

Back to Top


Advanced Databases
and Data Mining

ANU Undergraduate

This course is for the third year of undergraduates in the Research School of Computer Science. It presents relational theory and conceptual modelling; privacy and security; statistical databases; distributed databases; data warehousing; data cleaning and integration; and data mining concepts and techniques. I give lectures concerning databases and data warehousing and data cleaning and integration.

Document Analysis

ANU Postgraduate

This course is for the third year of undergraduates in the Research School of Computer Science, as well as Honnors students. It presents techniques related to processing online document, such as (A) information retrieval, (B) natural language processing, (C) machine learning for documents, and (D) relevant tools for the Web. I give lectures concerning the machine learning part and the social media and sentiment analysis part.

Numerical Machine Learning

European M2 DMKM

This course is for the second year in the Excellence European Master DMKM. It presents advanced machine learning techniques. Together with S. Lallich , we present association rules mining and class rules mining, ensemble methods (bagging, boosting) and statistical testing procedures (cross-validation, student t-test, etc.).

I give practical lectures for this course.

Back to Top


Since March 2016, I am a Lecturer (previously Research Fellow) with the College of Engineering and Computer Science at the Australian National University in Canberra. I am equally affiliated with the Data61 unit of CSIRO, in the Decision Sciences team.


Between May 2014 and February 2016,I was a researcher within the National ICT Australia in Canberra Australia, working in the Optimization Research Group. I was equally an adjunct lecturer with the College of Engineering and Computer Science at the Australian National University in Canberra.

Between September 2013 and May 2014, I was a PostDoctoral researcher with the ERIC Laboratory, financed by the ImagiWeb Research Project. I was equally an assistant professor with the DIS Department, at the University Lumière in Lyon.

Between 2009 and 2013, I was a PhD student at the ERIC Laboratory with the University Lumière Lyon2, under the supervision of Stéphane Lallich and Julien Velcin. I defended my PhD thesis on June 24th 2013, with honors "Très Honorable".

In July 2009, I obtained my MSc (graduating first of promotion, with honors.) in Data Mining and Knowledge Management from the Polytechnic School of the University of Nantes, France and wrote my Master’s Thesis on "Textual Data Clustering and Cluster Naming" after an internship at the ERIC Laboratory.

Between 2004 and 2009, I did my undergrad and obtained in September 2009 my Engineer Diploma (double diploma, in parallel with the French Master's) in System and Computer Engineering from the Faculty of Automatic Control and Computers of the Polytechnic University Bucharest, Romania.

In 2004, I did my Scientific Baccalaureate in Mathematics and Computer Science, Romania.

Back to Top