2018 CODATA-RDA School of Research Data Science

December 3 – 14, 2018

São Paulo, Brazil

ICTP-SAIFR/NCC-UNESP

Home

The ever-accelerating volume and variety of data being generated is having a huge impact on a wide variety of research disciplines, from the sciences to the humanities. The international, collective ability to create, share and analyse vast quantities of data is having a profound, transformative effect. This ‘Data Revolution’ offers great opportunities for students with modern data skills, both in conducting their research and in entering a jobs market where those skills are in demand.

Contemporary research – particularly when addressing the most significant, transdisciplinary research challenges – cannot be done effectively without a range of skills relating to data. This includes the principles and practice of Open Science and research data management and curation, the use of a range of data platforms and infrastructures, large scale analysis, statistics, visualisation and modelling techniques, software development and annotation and more. We define ‘Research Data Science’ as the ensemble of these skills.

The International School on Data Science will focus on growing competence in accessing, analyzing, visualising, and publishing data. It is open to participants from all disciplines and/or background from the sciences to humanities. This activity will cover topics on principles and practice of Open Science, research data management and curation, use of a range of research compute infrastructures, large scale analysis, statistics, visualisation and modeling techniques, automation and scripting.

There is no registration fee. Limited funds are available to support accommodation, subsistence and travel support. For financial reasons, it is likely that priority will be given to travel support for students from the South America region, except in the case of particularly exceptional students.

Applicants from any region are strongly encouraged to look for alternative sources of funding to support their participation. For the recent CODATA-RDA School of Research Data Science in Trieste a large number of students were able to support their participation with funding from a variety of sources including their institutions, various grants, charities and sponsoring organisations etc

The training provided by the CODATA-RDA School of Research Data Science is primarily targeted at Early Career Researchers (advanced masters students, doctoral candidates, post-docs and young or early career academics). The data skills taught are also useful for (data) librarians and other research support staff, such as those who envisage a career as data steward or data analysts. Furthermore, people who are more advanced in their careers who would like to improve their data skills as a form of continuing professional development are also eligible.

Applicants are expected to have a baseline of data skills and these are tested by an online form. In addition, applicants should pay particular attention to their personal statement and communicate persuasively their reasons for wishing to attend the School: how do they intend to use these skills, how will it benefit their research or the institution in which they work? Finally, candidates should take pains to ensure that their application is well supported by references from their past or present tutors or line managers. This is particularly important so that the School directors have confidence in the candidate and that the skills learnt will have the maximum benefit and impact.

 

Announcement

Online application is now closed. 

Directors:

  • Marcela Alfaro Córdoba (University of Costa Rica)
  • Simon Hodson (CODATA, Committee on Data of the International Council for Science, France)
  • Sarah Jones (Digital Curation Centre, University of Glasgow, UK)
  • Robert E Quick (Indiana University, USA)
  • Hugh Shanahan (Royal Holloway, University of London, UK)

Local Organizers:

  • Nathan Berkovits (ICTP-SAIFR/IFT-UNESP, Brazil)
  • Raphael Cobe (NCC, Brazil)
  • Ivan Girotto (ICTP-Trieste, Italy)
  • Rogério Iope (NCC, Brazil)
  • Beraldo Leal (NCC, Brazil)
  • Sérgio Novaes (NCC and SPRACE, Brazil)

Lecturers

Lecturers:

  • Marcela Alfaro (Universidad de Costa Rica, Costa Rica)
  • Raphael Cobe (NCC, Brazil)
  • Steve Diggs (University of California, USA)
  • Filipe Fernandes (NOOA/IOOS & Software Carpentry, Brazil)
  • Robert E Quick (Indiana University, USA)
  • Natália da Silva (Universidad de La República, Uruguay)

Photos

2018 CODATA-RDA School of Research Data Science

Program

School program: PDF updated on Dec. 3, 2018

Link to school material: here

Monday, 3 December 2018

09:30 – 10:30 Registration, Administrative, and Financial formalities

11:00 – 12:30 Introduction and Open Science 1 (Marcela Alfaro Córdoba) Class

12:30 – 13:30 Lunch

13:30 – 17:30 The Unix Shell (SW Carpentry – Filipe Fernandes) Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 The Unix Shell (SW Carpentry – Filipe Fernandes) Lab

 

Tuesday, 4 December 2018

9:30 – 11:00 Version Control with Git (SW Carpentry – Filipe Fernandes) Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Version Control with Git (SW Carpentry – Filipe Fernandes) Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Introduction to R (R-ladies SãoPaulo) Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Introduction to R (R-ladies SãoPaulo) Lab

 

Wednesday, 5 December 2018

09:30 – 11:00 Introduction to  R (R-ladies SãoPaulo) Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Introduction to  R (R-ladies SãoPaulo) Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Introduction to  R (R-ladies SãoPaulo) Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Introduction to  R (R-ladies SãoPaulo) Lab

 

Thursday, 6 December 2018

09:30 – 11:00 Data Visualisation (Natalia da Silva) Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Data Visualisation (Natalia da Silva) Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Data Visualisation (Natalia da Silva) Lab

15:00 – 15:30 Coffee break

15:30 – 17:00 Data Visualisation (Natalia da Silva) Lab

17:00 – 17:30 Short Intro to RDM  (Steve Diggs) Lab

 

Friday, 7 December 2018

9:30 – 11:00 Research Data Management (Marcela Alfaro Córdoba) Class

11:00 – 11:30 Coffee break

11:30 – 12:30 Research Data Management (Steve Diggs) Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Research Data Management (Steve Diggs) Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Research Data Management (Steve Diggs) Lab

 

Monday, 10 December 2018

9:30 – 10:30 Data Science in Industry: Itaú Bank – Class

10:30 – 11:00 CODATA-RDA intro session (Marcela Alfaro Córdoba/Rob Quick) Class

11:00 – 11:30 Coffee break

11:30 – 12:30 Open Science 2 (Marcela Alfaro Córdoba) Class

12:30 – 13:30 Lunch

13:30 – 15:00 Information Security (Raphael Cóbe/Rob Quick) Class + Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Information Security(Raphael Cóbe/Rob Quick) Class + Lab

 

Tuesday, 11 December 2018

09:30 – 11:00 Overview of Machine Learning (Raphael Cóbe) Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Overview of Machine Learning (Raphael Cóbe) Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Overview of Machine Learning (Raphael Cóbe) Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Overview of Machine Learning (Raphael Cóbe) Lab

 

Wednesday, 12 December 2018

09:30 – 11:00 Artificial Neural Networks (Raphael Cóbe)  Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Artificial Neural Networks (Raphael Cóbe)  Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Artificial Neural Networks (Raphael Cóbe)  Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Artificial Neural Networks (Raphael Cóbe)  Lab

 

Thursday, 13 December 2018

9:30 – 11:00 Research Computational Infrastructure (Rob Quick) Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Research Computational Infrastructure (Rob Quick) Lab

12:30 – 13:30 Lunch

13:30 – 15:00 Research Computational Infrastructure (Rob Quick) Lab

15:00 – 15:30 Coffee break

15:30 – 17:30 Research Computational Infrastructure (Rob Quick) Lab

 

Friday, 14 December 2018

9:30 – 11:00 Research Computational Infrastructure (Rob Quick) Lab

11:00 – 11:30 Coffee break

11:30 – 12:30 Research Computational Infrastructure (Rob Quick) Lab

12:30 – 13:30 Workshop wrapup (Rob Quick) Class

 

Additional Information

Registration: ALL participants should register. The registration will be on December 03 at the institute at 09:30 am.

List of Participants: Updated on November 26, 2018.

Accommodation: Participants, whose accommodation will be provided by the institute, will stay at The Universe Flat. Hotel recommendations are available  here

How to reach the Institute: The workshop will be held at ICTP South American Institute, located at IFT-UNESP, which is across the street from a major bus and subway terminal (Terminal Barra Funda). The address which is closer to the entrance of the IFT-UNESP building is R. Jornalista Aloysio Biondi, 120 – Barra Funda, São Paulo. The easiest way to reach us is by subway or bus, please find instructions here.

Yellow fever vaccination is recommended for travellers going to Brazil. Note that the vaccine needs to be taken at least ten days before the trip to be effective. Information: https://wwwnc.cdc.gov/travel/notices/alert/yellow-fever-brazil

Ground transportation instructions: 

Ground transportation from Guarulhos Airport to The Universe Flat

Ground transportation from Congonhas Airport to the Universe Flat

Ground transportation from The Universe Flat to the institute