José Carlos Almeida Santos Contact Education Mobile +44 7794244865 / +351 919172725 E-mail: [email protected]/[email protected] WWW: www.doc.ic.ac.uk/∼jcs06 Imperial College London, United Kingdom Ph.D., Computer Science, 2010 • Thesis: Ecient Learning and Evaluation of Complex Concepts in Inductive Logic Programming • Area: Machine Learning • Supervisor: Professor Stephen H. Muggleton • Examiners: Professor Peter Flach and Professor Marek Sergot M.Sc. (1st year of the 4-year PhD programme), Bioinformatics, 2007 • Project: Predicting anti-cancer molecule activity using machine learning algo- rithms • Supervisor: Professor Michael J. E. Sternberg • Examiners: Professor Simon Colton and Professor Michael Stumpf Universidade Nova de Lisboa, Portugal M.Sc., Articial Intelligence, 2006 • • • • • Thesis: Mining Protein Structure Data Supervisor: Professor Pedro Barahona Co-supervisor: Professor Ludwig Krippahl Examiner: Professor Arlindo Oliveira Grade: 18/20 Licenciatura (5-year degree), Informatics Engineering, 2004 • The "Licenciatura"degree consisted of ≈ 50 semestral courses providing a solid background in the main areas of Computer Engineering and Mathematics. • Grade: 16/20 (best student from around 130, in the 1999-2004 Licenciatura) Awards/Honours • September 2006 - Wellcome Trust Scholarship for the 4-year PhD programme at Imperial College. • June 2004 - Most Valuable Student award given by Microsoft. Refereed Journal Papers José C. A. Santos, Houssam Nassif, David Page, Stephen H. Muggleton and Michael J. E. Sternberg. Automated identication of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study BMC Bioinformatics, 13:162, 2012 (open access article). Jianzhong Chen, Stephen Muggleton, and José Carlos Almeida Santos. Learning probabilistic logic models from probabilistic examples. Machine Learning, 73(1):5585, 2008. (UKPMC pdf). Industry Experience Microsoft London Search Technology Center Software Development Engineer 1 of 3 January 2012 - present • Development and improvement of Bing's query rewriting module for the main European markets leading to more relevant search results. Optimization of the CAL building pipeline. Mining of search logs. Microsoft Portugal Post-doctoral researcher at MLDC January 2011 - December 2011 • Post-doctoral researcher at the Microsoft Language Development Center (MLDC), working with the Query Rewriting group of the Munich Search Technology Center. The main work has been on improving the relevance of the Combined Alterations module (CAL) of Bing in Portuguese. The CAL module is responsible for expanding a query so that it conveys more meanings. Department of Computing, Imperial College, London, Teaching assistant of Prolog, Introduction to Articial Intelligence and Introduction to Bioinformatics 2007-2010 • • • • Prolog: 2009/2010 Introduction to Articial Intelligence: 2007/2008 Introduction to Bioinformatics: 2007/2008 Helped students solving the course exercises. Marking course work and exams. Microsoft USA, Redmond Software Design Engineer intern July 2005 - September 2005 • Software Design Engineer summer intern at Microsoft in Redmond, Washington, USA. Worked at Windows Server Clusters - High Availability team helping in the development of the Cluster Management GUI for Longhorn Server. Main work was development of controls, error reporting infrastructure with Watson integration and product stabilization. Code developed was shipped with Longhorn Server (Windows Server 2008). Novabase Business Intelligence Junior consultant August 2004 - June 2005 • Developed a tool to Caixa Geral de Depósitos (the largest Portuguese Bank), to automatize the monthly processing of hundredths of Excel spreadsheets. This C# tool programmatically called the Excel API and an external OLAP plugin to execute a set of complex operations on the workbooks. The tool required no human intervention and as of 2010 continued in production. • Development of a Java framework to extend the HTML rendering framework of Microstrategy. • Development of a Data Quality tool (for cleaning data, matching similar records, etc) to compete with QualityStage. Responsible for the whole GUI which was done from scratch, about 15.000 C# lines and 1.000 lines in C++. Intern April 2004 - July 2004 • Intern doing nal undergraduate project. The title of project was: "Decision Support System for Stock Market Investment using Data Mining" earning a grade of 19/20. The project covered many aspects: automatic retrieval of data from the web to a database (SQL Server 2000), computation of nancial indicators, statistical treatment, data mining with SPSS Clementine, development of a GUI and simulation tool in .Net Framework (C#). 2 of 3 Portuguese Competition Authority Programmer July 2003 - August 2003 Development of a company merger and acquisition simulator implementing the Cornout and Perry-Porter models. This merger simulator was done under the guidance of Economics Professor Duarte Brito. As of 2010 the merger simulator was still being used at the Competition Authority. Department of Computer Science, FCT, Universidade Nova de Lisboa, Teaching assistant of Programming I and Programming II courses • • • • 2002-2006 Programming I (C++): 2002/2003, 2003/2004, 2004/2005 Programming II (advanced C++): 2001/2002, 2002/2003, 2003/2004, 2005/2006 Responsible for the practical component of the courses (3-6 hours per week). Helped students solving the course exercises. Marking course work and exams. Programming Contests ACM International Programming Contests Leader of the Caparica Lions team 2001 - 2004 • Represented my University FCT-UNL in 4 editions of the Portuguese InterUniversities Programming Contest, in 3 editions of National Contest in Logic Programming and in 4 editions of the ACM Southwestern European Regional Contest. • The best results were 3rd place at PIUP 2002, 3rd place at NCLP 2004 and 14th place at SWERC 2001. • Over 400 ACM programming problems solved individually (mainly in C) and veried to be correct through their automated judging system. These problems cover algorithms, data structures and mathematical concepts fundamental in Computer Science, such as: graph search, sorting, primality tests, congruences, matrix operations, backtracking, permutations, geometry, binary trees, tries, hash tables, etc. Portuguese Informatics Olympiads Member of the organization 2001-2004 • Together with Professor Pedro Guerreiro and Pedro Ribeiro, I was responsible for the coaching and scientic preparation of the Portuguese team in the International Olympiads in Informatics. • Member of the jury Contestant 1998 • Second place at the Portuguese Informatics Olympiads, representing Portugal in the 1998 International Olympiads in Informatics. Misc. • 7 scientic peer-reviewed articles, see here for full academic prole. • Full member of the Portuguese Engineer's Guild since 2005 • Native Portuguese speaker. International English Language Testing System (IELTS): 7.0/ 9.0. • Author of several free games with some elements of Articial Intelligence: Chess, Sudoku and Maze Trap. Maze Trap is an Android strategic action game I developed as a hobby. The game has over 5000 downloads in the Android market. • Interest in the stock market and statistical arbitrage. 3 of 3