KRnet

INDEX

  • Àλ縻
  • Á¶Á÷Á¶Á÷
  • ÇÁ·Î±×·¥ÇÁ·Î±×·¥
  • µî·Ï/Çà»ç¾È³»µî·Ï/Çà»ç¾È³»
  • °Ô½ÃÆÇ °Ô½ÃÆÇ
  • Past KRnet
  • ¼¼ºÎÇÁ·Î±×·¥

     

    [B1: AI for Networking] Reinforcement Learning: An Introduction and Applications in Communication Networks
    °ü¸®ÀÚ (krnet) ÀÛ¼ºÀÏ : 2019-04-03 13:58:22 Á¶È¸¼ö : 530
    ÄÚµå¹øÈ£ : 11
    ¹ßÇ¥ÀÚ : ÀÌÁÖÇö
    ¼Ò¼Ó : ÇѾç´ëÇб³
    ºÎ¼­ : ÀüÀÚ°øÇкÎ
    Á÷À§ : ±³¼ö
    ¼¼¼Ç½Ã°£ : 6¿ù 24ÀÏ(¿ù) 09:00~10:50
    ¹ßÇ¥ÀÚ¾à·Â : 2018 – ÇöÀç, ÇѾç´ëÇб³ Á¶±³¼ö
    2014 – 2018, The Ohio State University, Postdoctoral Researcher
    2014 – 2014, KAIST, Postdoctoral Researcher
    2014, KAIST Àü±â¹×ÀüÀÚ°øÇаú °øÇйڻç
    2008, KAIST Àü±â¹×ÀüÀÚ°øÇаú °øÇлç
    °­¿¬¿ä¾à : °­È­ÇнÀ(Reinforcement Learning)Àº ÀüÅëÀûÀÎ ÀΰøÁö´É ¿¬±¸ÀÇ ÇÑ ÃàÀ¸·Î ¾ËÆÄ°í¿Í °°Àº ´ëÇ¥ÀûÀÎ ÀΰøÁö´É ¾Ë°í¸®Áò¿¡ ¾²ÀÌ´Â ÇÙ½ÉÀÌ·ÐÀÌ´Ù. ±âº»ÀûÀÎ °­È­ÇнÀÀÇ ¸ñÇ¥´Â ÇöÀçÀÇ »óÅÂ(State)¸¦ °üÂûÇÏ°í ¼±Åà °¡´ÉÇÑ Çൿ(Action)µé Áß ´©Àû º¸»ó(Cumulative Reward)À» ÃÖ´ëÈ­ÇÏ´Â ÇൿÀ» ¼±ÅÃÇÏ´Â °ÍÀÌ´Ù. ÃÖ±Ù ÀΰøÁö´É¿¡ ´ëÇÑ Æø¹ßÀûÀÎ °ü½É¿¡ µû¶ó, °ÔÀÓ, º¸ÇàµîÀÇ °íÀüÀûÀÎ ÀÀ¿ë»Ó¸¸ ¾Æ´Ï¶ó Åë½Å ³×Æ®¿öÅ©¿¡ °­È­ÇнÀÀ» Àû¿ëÇÑ »ç·Ê°¡ Áõ°¡ÇÏ°í ÀÖ´Ù. º» °­ÀÇ¿¡¼­´Â ´ÙÁß ½½·Ô¸Ó½Å(Multi-armed Bandits), ¸¶¸£ÄÚºê °áÁ¤ °úÁ¤(Markov Decision Process) µîÀÇ ±âÃÊÀûÀÎ °­È­ÇнÀ ¸ðµ¨°ú ´ëÇ¥ÀûÀÎ ¾Ë°í¸®ÁòÀ» ¼Ò°³ÇÏ°í Åë½Å ³×Æ®¿öÅ©¸¦ Áß½ÉÀ¸·Î ÀÀ¿ë »ç·ÊµéÀ» ¾Ë¾Æº»´Ù.
    ¿Â¶óÀÎÇà»çÀå :
    ¿Â¶óÀιßÇ¥Àå :
    .
    ¸ñ·Ïº¸±â

    TOP