Introduction to Spark for Machine Learning
Description
- These are the slides from the 2021 Workshop ‘Introduction to Spark for Machine Learning’, which was part of the series ‘Applying and deploying Artificial Intelligence (AI) in GLAMs’ organised by AI4LAM (Teaching and Learning Working Group) and co-hosted by LIBER and the BnF.
Apache Spark is a popular, open-source machine learning tool with many relevant applications for the GLAM community. With a local deployment and a modestly-sized dataset, it can be used as a teaching/training tool for machine learning. It can also be used as part of a production system or research project with very large datasets and can be deployed economically to cloud clusters. This workshop will demonstrate Spark’s machine learning capabilities, and help participants determine if it would be a good fit for their projects.
The aim of the workshop series is to provide training opportunities for those interested in applying and deploying Artificial Intelligence (AI) in Libraries, Galleries, Archives, and Museums. The series will bring together a diverse community of experts with subject and domain expertise, as well as technologists across GLAM institutions for a collaborative learning event to share tools and experiences and to reflect on the process of applying AI and its implications for GLAM institutions.
Files
Files
(13.1 MB)
Name | Size | Download all |
---|---|---|
md5:e7b244b6d1d8cac88324c971d92ddebc
|
13.1 MB | Download |