up ¹öư
down ¹öư

ÆÄÀ̽㠱â¹Ý °­È­ÇнÀ ¾Ë°í¸®µë - DP, Q-Learning, AC, DQN, TRPO, PPO, DDPG, TD3, Imitation Learning, ESBAS ¾Ë¾Æº¸±â

Á¤°¡ : 30,000 ¿ø

ÀÛ°¡¸í : ¾Èµå·¹¾Æ ·ÐÀÚ (ÁöÀºÀÌ), Á¤»ç¹ü (¿Å±äÀÌ)

ÃâÆÇ»ç : ¿¡ÀÌÄÜÃâÆÇ

Ãâ°£ÀÏ : 2021-08-25

ISBN : 9791161755571 / K292733231

±¸¸Åó

  • ÃâÆÇ»ç
  • ¿¹½º24
  • ¾Ë¶óµò
  • ±³º¸
  • ÀÎÅÍÆÄÅ©
  • ¹Ýµð¾Ø·çÀ̽º
  • ¿µÇ³¹®°í

Ã¥ ¼Ò°³

ÆÄÀ̽㠱â¹Ý °­È­ÇнÀ ¾Ë°í¸®µë - DP, Q-Learning, AC, DQN, TRPO, PPO, DDPG, TD3, Imitation Learning, ESBAS ¾Ë¾Æº¸±â



±¦Âú´Ù°í ¸»ÇÏÁö¸¸ ±¦ÂúÁö ¾ÊÀº ³Ê¿Í ³ª, ¿ì¸®°¡ ¾È°í »ç´Â ¿ì¿ï. ±×¸®°í ±× °¨Á¤ÀÌ °¡Á®¿Â ¸¶À½ÀÇ º´ ¿ì¿ïÁõ. È­Á¦ÀÇ Ã¤³Î




¸ñÂ÷

1ºÎ. ¾Ë°í¸®µë°ú ȯ°æ



1Àå. °­È­ÇнÀÀÇ °³¿ä

__°­È­ÇнÀ ¼Ò°³

______°­È­ÇнÀ°ú ÁöµµÇнÀÀÇ ºñ±³

____°­È­ÇнÀÀÇ ¿ª»ç

____µö °­È­ÇнÀ

__°­È­ÇнÀÀÇ ±¸¼º ¿ä¼Ò

____Æú¸®½Ã

____°¡Ä¡ÇÔ¼ö

____º¸»ó

____¸ðµ¨

__°­È­ÇнÀ ¾ÖÇø®ÄÉÀ̼Ç

____°ÔÀÓ

____·Îº¿°ú Àδõ½ºÆ®¸® 4.0

____±â°èÇнÀ

____°æÁ¦¿Í ±ÝÀ¶

____ÇコÄɾî

____Áö´ÉÇü ±³Åë½Ã½ºÅÛ

____¿¡³ÊÁö ÃÖÀûÈ­¿Í ½º¸¶Æ® ±×¸®µå

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



2Àå. °­È­ÇнÀ »çÀÌŬ°ú OpenAI Gym ±¸ÇöÇϱâ



__ȯ°æ ¼³Á¤Çϱâ

____OpenAI Gym ¼³Ä¡Çϱâ

____·Îº¸½ºÄ𠼳ġÇϱâ

__OpenAI Gym°ú °­È­ÇнÀ »çÀÌŬ

____°­È­ÇнÀ »çÀÌŬ °³¹ßÇϱâ

____°ø°£¿¡ Àͼ÷ÇØÁö±â

____ÅÙ¼­Ç÷οì 2.X

________Áï½Ã ½ÇÇà

________¿ÀÅä±×·¡ÇÁ

__ÅÙ¼­ÇÃ·Î¿ì ±â¹Ý ±â°èÇнÀ ¸ðµ¨ °³¹ß

____ÅÙ¼­

________»ó¼ö

________º¯¼ö

________±×·¡ÇÁ »ý¼ºÇϱâ

____°£´ÜÇÑ ¼±Çüȸ±Í ¿¹Á¦

____ÅÙ¼­º¸µå µµÀÔÇϱâ

__°­È­ÇнÀ ȯ°æÀÇ À¯Çü

____¿Ö ´Ù¸¥ ȯ°æÀΰ¡?

____¿ÀǼҽº ȯ°æ

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



3Àå. µ¿Àû ÇÁ·Î±×·¡¹ÖDPÀ¸·Î ¹®Á¦ ÇØ°áÇϱâ

__MDP

____Æú¸®½Ã

____°¨°¡À²°ú ¸®ÅÏ

____°¡Ä¡ÇÔ¼ö

____º§¸¸ ¹æÁ¤½Ä

__°­È­ÇнÀ ¾Ë°í¸®µë ºÐ·ù

____¸ðµ¨ ÇÁ¸® ¾Ë°í¸®µë

________°¡Ä¡ ±â¹Ý ¾Ë°í¸®µë

________Æú¸®½Ã ±×·¡µð¾ðÆ® ¾Ë°í¸®µë

________¾×ÅÍ Å©¸®Æ½ ¾Ë°í¸®µë

________ÇÏÀ̺긮µå ¾Ë°í¸®µë

____¸ðµ¨ ±â¹Ý °­È­ÇнÀ

____¾Ë°í¸®µë ´Ù¾çÈ­

__DP

____Æú¸®½Ã Æò°¡¿Í Æú¸®½Ã °³¼±

____Æú¸®½Ã ÀÌÅÍ·¹À̼Ç

________ÇÁ·ÎÁð·¹ÀÌÅ©¿¡ Àû¿ëµÈ Æú¸®½Ã ÀÌÅÍ·¹À̼Ç

____°¡Ä¡ ÀÌÅÍ·¹À̼Ç

________ÇÁ·ÎÁð·¹ÀÌÅ©¿¡ Àû¿ëÇÑ °¡Ä¡ ÀÌÅÍ·¹À̼Ç

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



2ºÎ. ¸ðµ¨ ÇÁ¸® °­È­ÇнÀ ¾Ë°í¸®µë



4Àå. Q-·¯´×°ú SARSA ¾ÖÇø®ÄÉÀ̼Ç



__¸ðµ¨¾øÀÌ ÇнÀÇϱâ

____»ç¿ëÀÚ °æÇè

____Æú¸®½Ã Æò°¡

____Ž»ö ¹®Á¦

________¿Ö Ž»öÇØ¾ß Çϴ°¡?

________Ž»ö ¹æ¹ý

__½Ã°£Â÷ ÇнÀ

____½Ã°£Â÷ ¾÷µ¥ÀÌÆ®

____Æú¸®½Ã °³¼±

____¸óÅ×Ä«¸¦·Î¿Í ½Ã°£Â÷ ºñ±³

__SARSA

____¾Ë°í¸®µë

__Taxi-v2¿¡ SARSA Àû¿ëÇϱâ

__Q-·¯´×

____ÀÌ·Ð

____¾Ë°í¸®µë

__Taxi-v2¿¡ Q-·¯´× Àû¿ëÇϱâ

____SARSA¿Í Q-·¯´× ºñ±³

__¿ä¾à

__Áú¹®



5Àå. Deep Q-Network



__½ÉÃþ½Å°æ¸Á°ú Q-·¯´×

____ÇÔ¼ö ±Ù»ç

____½Å°æ¸ÁÀ» ÀÌ¿ëÇÑ Q-·¯´×

____µö Q-·¯´×ÀÇ ºÒ¾ÈÁ¤¼º

__DQN

____ÇØ°áÃ¥

________¸®Ç÷¹ÀÌ ¸Þ¸ð¸®

________Ÿ±ê ³×Æ®¿öÅ©

____DQN ¾Ë°í¸®µë

________¼Õ½ÇÇÔ¼ö

________ÀÇ»çÄÚµå

____¸ðµ¨ ¾ÆÅ°ÅØÃ³

__DQNÀ» Æþ¿¡ Àû¿ëÇϱâ

____¾ÆÅ¸¸® °ÔÀÓ

____Àü ó¸®

____DQN ±¸Çö

________DNN

________°æÇè ¹öÆÛ

________°è»ê ±×·¡ÇÁ¿Í ÈÆ·Ã ·çÇÁ

____°á°ú

__DQN °³¼± ¾Ë°í¸®µë

____Double DQN

________DDQN ±¸Çö

________°á°ú

____DQN µà¾ó¸µÇϱâ

________µà¾ó¸µ DQN ±¸Çö

________°á°ú

____N-½ºÅÜ DQN

________±¸Çö

________°á°ú

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



5Àå. È®·ü ±â¹Ý PG ÃÖÀûÈ­ ÇнÀ



__Æú¸®½Ã ±×·¡µð¾ðÆ® ¸Þ¼Òµå

____Æú¸®½ÃÀÇ ±×·¡µð¾ðÆ®

____Æú¸®½Ã ±×·¡µð¾ðÆ® Á¤¸®

____±×·¡µð¾ðÆ® °è»êÇϱâ

____Æú¸®½Ã

____¿Â-Æú¸®½Ã PG

__REINFORCE ¾Ë°í¸®µë ÀÌÇØÇϱâ

____REINFORCE ±¸ÇöÇϱâ

____REINFORCE¸¦ ÀÌ¿ëÇØ Ž»ç¼± Âø·ú½Ã۱â

________°á°ú ºÐ¼®Çϱâ

__º£À̽º¶óÀÎÀÌ ÀÖ´Â REINFORCE

____º£À̽º¶óÀÎÀ¸·Î REINFORCE ±¸ÇöÇϱâ

__AC ¾Ë°í¸®µë ÇнÀÇϱâ

____¾×ÅͰ¡ ÇнÀÇϵµ·Ï µ½±â À§ÇØ Å©¸®Æ½ »ç¿ëÇϱâ

____n-step AC ¸ðµ¨

____AC ±¸Çö

____AC¸¦ »ç¿ëÇØ Ž»ç¼±spacecraft Âø·ú½Ã۱â

____°í±Þ AC ÆÁ°ú Æ®¸¯

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



7Àå. TRPO¿Í PPO ±¸Çö



__·Îº¸½ºÄð

____¿¬¼Ó ½Ã½ºÅÛ Á¦¾î

__Natural Policy Gradient

____NPG¿¡ ´ëÇÑ ¾ÆÀ̵ð¾î

____¼öÇÐÀû °³³ä

________FIM°ú KL ¹ß»ê

____NG ¹®Á¦

__TRPO

____TRPO ¾Ë°í¸®µë

____TRPO ¾Ë°í¸®µë ±¸Çö

____TRPO ¾ÖÇø®ÄÉÀ̼Ç

__Proximal Policy Optimization

____PPOÀÇ °³¿ä

____PPO ¾Ë°í¸®µë

____PPOÀÇ ±¸Çö

____PPO ¾ÖÇø®ÄÉÀ̼Ç

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



8Àå. DDPG¿Í TD3 ¾ÖÇø®ÄÉÀ̼Ç



__Æú¸®½Ã ±×·¡µð¾ðÆ® ÃÖÀûÈ­¿Í Q-·¯´× °áÇÕÇϱâ

____°áÁ¤·ÐÀû Æú¸®½Ã ±×·¡µð¾ðÆ®

____DDPG ¾Ë°í¸®µë

____DDPG ±¸Çö

____DDPG¸¦ BipedalWalker-v2¿¡ Àû¿ëÇϱâ

__TD3 Æú¸®½Ã ±×·¡µð¾ðÆ®

____°ú´ëÆò°¡ ÆíÇâ ¹®Á¦ ÇØ°á

________TD3ÀÇ ±¸Çö

____ºÐ»ê °¨¼Ò ÇØ°á

________Áö¿¬µÈ Æú¸®½Ã ¾÷µ¥ÀÌÆ®

________Ÿ±ê Á¤±ÔÈ­

____BipedalWalker¿¡ TD3¸¦ Àû¿ëÇϱâ

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



3ºÎ. ¸ðµ¨ ÇÁ¸® ¾Ë°í¸®µë°ú °³¼±



9Àå. ¸ðµ¨ ±â¹Ý °­È­ÇнÀ



__¸ðµ¨ ±â¹Ý ¸Þ¼Òµå

____¸ðµ¨ ±â¹Ý ÇнÀ¿¡ ´ëÇÑ Æø³ÐÀº °üÁ¡

________¾Ë·ÁÁø ¸ðµ¨

________¹ÌÁöÀÇ ¸ðµ¨

____Àå´ÜÁ¡

__¸ðµ¨ ±â¹Ý ÇнÀ°ú ¸ðµ¨ ÇÁ¸® ÇнÀ °áÇÕÇϱâ

____¸ðµ¨ ±â¹Ý°ú ¸ðµ¨ ÇÁ¸® Á¢±Ù¹ýÀÇ À¯¿ëÇÑ Á¶ÇÕ

____À̹ÌÁö¿¡¼­ ¸ðµ¨ ¸¸µé±â

__¿ªÁøÀÚ¿¡ Àû¿ëÇÑ ME-TRPO ¸ðµ¨

____ME-TRPO ÀÌÇØÇϱâ

____ME-TRPO ±¸ÇöÇϱâ

____·Îº¸½ºÄð ½ÇÇèÇϱâ

________·Îº¸½ºÄ𠿪ÁøÀÚ ½ÇÇè °á°ú

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



10Àå. DAgger ¾Ë°í¸®µëÀ¸·Î ¸ð¹æ ÇнÀÇϱâ



__±â¼úÀû ¿ä±¸ »çÇ×

____Flappy Bird ¼³Ä¡

__¸ð¹æ Á¢±Ù

____¿îÀü º¸Á¶ »ç·Ê

____IL°ú RL ºñ±³Çϱâ

____¸ð¹æ ÇнÀ¿¡¼­ Àü¹®°¡ÀÇ ¿ªÇÒ

____IL ±¸Á¶

________¼öµ¿ ¸ð¹æ°ú ´Éµ¿ ¸ð¹æ ºñ±³Çϱâ

__Flappy Bird °ÔÀÓÇϱâ

____ȯ°æÀ» ÀÌ¿ëÇÏ´Â ¹æ¹ý

__µ¥ÀÌÅÍ ÁýÇÕdataset Áý°è ¾Ë°í¸®µë ÀÌÇØÇϱâ

____DAgger ¾Ë°í¸®µë

____DAggerÀÇ ±¸Çö

________Àü¹®°¡ Ãß·Ð ¸ðµ¨ ÀûÀç

________ÇнÀÀÚÀÇ °è»ê ±×·¡ÇÁ ¸¸µé±â

________DAgger loop ¸¸µé±â

____Flappy Bird °á°ú ºÐ¼®

__IRL

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



11Àå. ºí·¢¹Ú½º ÃÖÀûÈ­ ¾Ë°í¸®µë ÀÌÇØÇϱâ



__°­È­ÇнÀÀÇ ´ë¾È

____°­È­ÇнÀ¿¡ ´ëÇÑ °£´ÜÇÑ ¿ä¾à

____´ë¾È

________EAs

__EAÀÇ ÇÙ½É

____À¯ÀüÀÚ ¾Ë°í¸®µëGA

____ÁøÈ­ Àü·«

________CMA-ES

________ES ´ë RL

__È®Àå °¡´ÉÇÑ ÁøÈ­ Àü·«

____ÇÙ½É

________ES º´·ÄÈ­Çϱâ

________´Ù¸¥ Æ®¸¯

________ÀÇ»ç ÄÚµå

____È®Àå °¡´ÉÇÑ ±¸Çö

________¸ÞÀÎ ÇÔ¼ö

________ÀÛ¾÷ÀÚ

__È®Àå °¡´ÉÇÑ ES¸¦ LunarLander¿¡ Àû¿ëÇϱâ

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



12Àå. ESBAS ¾Ë°í¸®µë °³¹ßÇϱâ



__Ž»ö ´ë Ȱ¿ë

____¸ÖƼ ¾Ïµå ¹êµ÷

__Ž»ö Á¢±Ù¹ý

____Ž¿å Àü·«

____UCB ¾Ë°í¸®µë

________UCB1

____Ž»ö º¹Àâµµ

__ESBAS

____¾Ë°í¸®µë ¼±Åà ¾Ë¾Æº¸±â

____ESBAS ³»ºÎ ±¸Á¶

____±¸Çö

____Acrobot ½ÇÇàÇϱâ

________°á°ú

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á



13Àå. °­È­ÇнÀ ¹®Á¦¸¦ ÇØ°áÇϱâ À§ÇÑ ½ÇÁ¦ ±¸Çö

__µö °­È­ÇнÀÀÇ ¸ð¹ü »ç·Ê

____ÀûÇÕÇÑ ¾Ë°í¸®µë ¼±ÅÃÇϱâ

____°­È­ÇнÀ ¾Ë°í¸®µë °³¹ßÇϱâ

__µö °­È­ÇнÀÀÇ µµÀü °úÁ¦

____¾ÈÁ¤¼º°ú ÀçÇö¼º

____È¿À²¼º

____ÀϹÝÈ­

__°í±Þ ±â¼ú

____ºñÁöµµ °­È­ÇнÀ

________³»ÀçÀû º¸»ó

____ÀüÀÌ ÇнÀ

________ÀüÀÌ ÇнÀÀÇ À¯Çü

__Çö½Ç¿¡¼­ÀÇ °­È­ÇнÀ

____°­È­ÇнÀÀ» Çö½Ç¿¡ Àû¿ëÇÒ ¶§ ÇØ°áÇØ¾ß ÇÒ ¹®Á¦

____½Ã¹Ä·¹À̼ǰú Çö½Ç »çÀÌÀÇ Â÷ÀÌ ÁÙÀ̱â

____Àڱ⸸ÀÇ È¯°æ ¸¸µé±â

__°­È­ÇнÀÀÇ ¹Ì·¡¿Í »çȸ¿¡ ¹ÌÄ¡´Â ¿µÇâ

__¿ä¾à

__Áú¹®

__½ÉÈ­ÇнÀ ÀÚ·á

ÀúÀÚ ¼Ò°³

¾Èµå·¹¾Æ ·ÐÀÚ (ÁöÀºÀÌ)



Á¤»ç¹ü (¿Å±äÀÌ)
ÀÇ»ç°áÁ¤°ú ÃÖÀûÈ­ ¹æ¹ý·Ð¿¡ °ü½ÉÀÌ ¸¹´Ù. ¼¼»ó¿¡ Á¸ÀçÇÏ´Â ´Ù¾çÇÑ µ¥ÀÌÅ͸¦ ÀÌ¿ëÇØ ´ç¸éÇÑ ¹®Á¦¸¦ ÇØ°áÇÏ´Â ÀÏÀ» Çϰí ÀÖ´Ù. ´Ù¾çÇÑ Ã¥°ú ÇöÀå °æÇèÀ» ÅëÇØ µ¥ÀÌÅÍ ¼öÁý, Á¤Á¦, ºÐ¼®, º¸°í ¹æ¹ý¿¡ ´ëÇÑ Áö½ÄÀ» ¾ò´Â °Í¿¡ °¨»çÇϰí ÀÖ´Ù. ¿¡ÀÌÄÜÃâÆÇ»ç¿¡¼­ Ãâ°£ÇÑ ¡ºRStudio µû¶óÀâ±â¡»(2013), ¡ºThe R book(Second Edition) Çѱ¹¾îÆÇ¡»(2014), ¡º¿¹Ãø ºÐ¼® ¸ðµ¨¸µ ½Ç¹« ±â¹ý¡»(2014), ¡ºµ¥ÀÌÅÍ ¸¶ÀÌ´× °³³ä°ú ±â¹ý¡»(2015), ¡ºÆÄÀ̽ãÀ¸·Î Ç®¾îº¸´Â ¼öÇС»(2016), ¡ºµ¥ÀÌÅÍ ½ºÅ丮ÅÚ¸µ¡»(2016), ¡ºR¿¡¼­ °´Ã¼ÁöÇâ ÇÁ·Î±×·¡¹Ö »ç¿ëÇϱ⡻(2016), ¡ºÆÄÀ̽ã ÇÁ·Î±×·¡¹Ö °³·Ð¡»(2016), ¡º»ê¾÷ÀÎÅͳÝ(IIOT)°ú ÇÔ²²ÇÏ´Â Àδõ½ºÆ®¸® 4.0¡»(2017), ¡ºÀå°í ¸¶½ºÅÍÇϱ⡻(2017), ¡ºÅÙ¼­Ç÷ηΠ±¸ÇöÇÏ´Â µö·¯´×°ú °­È­ÇнÀ¡»(2017), ¡º¸Ó½Å ·¯´× ¾Ë°í¸®Áò¡»(2019)À» ¹ø¿ªÇß´Ù.


ÀÛ°¡ÀÇ ´Ù¸¥Ã¥

 

ÆÄÀ̽㠱â¹Ý °­È­ÇнÀ ¾Ë°í¸®µë - DP, Q-Learning, AC, DQN, TRPO, PPO, DDPG, TD3, Imitation Learning, ESBAS ¾Ë¾Æº¸

¾Èµå·¹¾Æ ·ÐÀÚ (ÁöÀºÀÌ), Á¤»ç¹ü (¿Å±äÀÌ)
30,000 ¿ø

¿¡ÀÌÄÜÃâÆÇ

ÃâÆÇ»çÀÇ ´Ù¸¥Ã¥

 

AI¿Í µ¥ÀÌÅÍ »çÀ̾𽺠API - FastAPI¸¦ Ȱ¿ëÇÑ ½ÇÀü ¿¹Á¦·Î ¹è¿ì´Â ÆÄÀ̽㠰³¹ß

¶óÀ̾𠵥ÀÌ (ÁöÀºÀÌ), ÀÌÇÑ¿ì, ±è°æÈ¯, Àå±â½Ä, Á¤ÀÎÈ­, Çã¿Á (¿Å±äÀÌ)
33,000 ¿ø

¿¡ÀÌÄÜÃâÆÇ
 

ÆÄÀ̽ã AI ¾ÖÇø®ÄÉÀÌ¼Ç °³¹ß - LLM°ú º¤ÅÍ µ¥ÀÌÅͺ£À̽º·Î ±¸ÇöÇÏ´Â ¸ÂÃãÇü Áö´ÉÇü ¼­ºñ½º

¶ó¼Ð ÆÄ¸Ó, º¥ ÆÞ¸ÓÅÍ, ¾Æ½¬À© °­°¡´Ù, ´ÏÄÝ¶ó½º ¶ó·ç, ½Ã±×ÇÁ¸®µµ ³ª¸£¹Ù¿¡½º, Å丶½º ·òÅ©½´Æ¼½º, Ç À£·¯, ¸®Ä¡¸Õµå ¾Ë¶óÄÉ, ½´¹ã ¶õÀÜ (ÁöÀºÀÌ), Å×Å© Æ®·£½º ±×·ì T4 (
33,000 ¿ø

¿¡ÀÌÄÜÃâÆÇ
 

ÇÁÆ÷ÀÚ¸¦ À§ÇÑ C ÇÁ·Î±×·¡¹Ö


48,000 ¿ø

¿¡ÀÌÄÜÃâÆÇ