Journal article Open Access

Multi-Armed Bandits for Intelligent Tutoring Systems

Clement, Benjamin; Roy, Didier; Oudeyer, Pierre-Yves; Lopes, Manuel


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>We present an approach to Intelligent Tutoring Systems which adaptively personalizes sequences of learning activities to maximize skills acquired by students, taking into account the limited time and motivational resources. At a given point in time, the system proposes to the students the activity which makes them progress faster. We introduce two algorithms that rely on the empirical estimation of the learning progress, RiARiT that uses information about the difficulty of each exercise and ZPDES that uses much less knowledge about the problem. The system is based on the combination of three approaches. First, it leverages recent models of intrinsically motivated learning by transposing them to active teaching, relying on empirical estimation of learning progress provided by specific activities to particular students. Second, it uses state-of-the-art Multi-Arm Bandit (MAB) techniques to efficiently manage the exploration/exploitation challenge of this optimization process. Third, it leverages expert knowledge to constrain and bootstrap initial exploration of the MAB, while requiring only coarse guidance information of the expert and allowing the system to deal with didactic gaps in its knowledge. The system is evaluated in a scenario where 7-8 year old schoolchildren learn how to decompose numbers while manipulating money. Systematic experiments are presented with simulated students, followed by results of a user study across a population of 400 school children.</p>", 
  "license": "https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Inria, Bordeaux", 
      "@type": "Person", 
      "name": "Clement, Benjamin"
    }, 
    {
      "affiliation": "Inria, Bordeaux", 
      "@type": "Person", 
      "name": "Roy, Didier"
    }, 
    {
      "affiliation": "Inria, Bordeaux", 
      "@type": "Person", 
      "name": "Oudeyer, Pierre-Yves"
    }, 
    {
      "affiliation": "Inria, Bordeaux", 
      "@type": "Person", 
      "name": "Lopes, Manuel"
    }
  ], 
  "headline": "Multi-Armed Bandits for Intelligent Tutoring Systems", 
  "image": "https://zenodo.org/static/img/logos/zenodo-gradient-round.svg", 
  "datePublished": "2015-06-18", 
  "url": "https://zenodo.org/record/3554668", 
  "version": "1.0.0", 
  "keywords": [
    "intelligent tutoring systems", 
    "multi-armed bandits", 
    "personalization", 
    "intrinsic motivation", 
    "active teaching", 
    "active learning"
  ], 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.5281/zenodo.3554668", 
  "@id": "https://doi.org/10.5281/zenodo.3554668", 
  "@type": "ScholarlyArticle", 
  "name": "Multi-Armed Bandits for Intelligent Tutoring Systems"
}
120
11
views
downloads
All versions This version
Views 120120
Downloads 1111
Data volume 88.6 MB88.6 MB
Unique views 107107
Unique downloads 1010

Share

Cite as