Who


Francisco Pereira. If you would like to contact me, try the most obvious address at the most obvious commercial provider.
There are other places where you can find me online: LinkedIn, Twitter, Bluesky, Mastodon. I may not have much to say, though!

Where

I lead the Machine Learning Core at the National Institute of Mental Health (NIMH).

Prior to this, I was a researcher at Medical Imaging Technologies, Siemens Healthcare, where I managed the Computational Neuroscience program. I was also the PI of a team in the IARPA Knowledge Representation in Neural Systems program. I was a postdoc in the Botvinick Lab (and a frequent lurker in the Computational Memory Lab) at the Princeton Neuroscience Institute. I got my Ph.D. in the Computer Science Department at CMU, working with Tom Mitchell and Geoff Gordon. I was also a student in the graduate training program of the Center for the Neural Basis of Cognition, working in collaboration with the Center for Cognitive Brain Imaging. I got my undergraduate degree at the Computer Science Department of the University of Porto, in Portugal.

What

My primary role is leading the NIMH Machine Learning Core, a research group that I started in 2018. The mission of the Machine Learning Core is to support researchers in the NIMH and NIDA intramural research programs who want to address research problems in psychology, psychiatry, and neuroscience, using statistics and machine learning approaches. We do this by consulting with individual researchers and guiding them in the use of the appropriate tools and methods, or by taking on the analysis process ourselves, if this is more expedient. In parallel, we are a machine learning research group and, as such, develop new methods and analysis approaches, motivated by the needs of researchers or by the practical possibilities arising from advances in the field. In general, I am very interested in the use of artificial intelligence and machine learning to augment human capability, in domains such as scientific discovery or decision making.

My main interest within cognitive (neuro)science is in the question of how semantic knowledge is represented in the brain, and is evoked and transformed by language. I have worked on methods to extract semantic information from text corpora and combine it with structured knowledge databases, in order to build models of human performance on various semantic tasks. I also worked on methods to relate models of semantic mental processes to brain imaging data, validating those models through brain decoding tasks.

Publications, preprints, and software from our research group

(please see my Google Scholar for more citation details, or MLC for other publications by members of the group)

preprints under review software packages
papers about methods and tools papers about applications

Publications prior to NIMH

(from my time in industry, postdoc, or graduate school)