Question

In: Computer Science

please answe using typing what is  the idea that MDPs and Reinforcement Learning are useful procedures in...

please answe using typing

what is  the idea that MDPs and Reinforcement Learning are useful procedures in AI

Real life examples and engage in self-reflection, both common practices by researchers developing new AI techniques.

Select a problem using MDPs and/or Reinforcement Learning that may arise in the real world.    

Solutions

Expert Solution

Reinforcement learning -Reinforcement learning is a real life based technique related to artificial intelligence.Using this we learn what to do or what not to do in a particular situation on repeatedly doing that.Reinforcement learning can have 3 main components-State,Activity and Reward.lets us take example in real world of simple robot that is trying to walk on a plane.

State-state is the current situation in learning process.In case of our example the state is the position of the legs while trying to walk safely without any problem.

Activity-Activity is the decision what can we do on a particular state.In our example the robot will decide how much long step should take so that it will not fell down or what should be speed to walk without imbalance.

Reward-Reward means modification is current state after taking particular action.Reward can be positive as well as negative.If it is positive then it is normal and if it is negative then it is punishment .So that will be applied as feedback to the state before applying further action.

In real world complex problems we need to choose the highest reward based on current information.

It is mainly used in gaming,online advertisement ,robotic applications etc.

MPDs-MPD tools are for creating models for the uncertain searching problems for an object.

For example let us say a robot is walking on maze to reach destination.So it need to search the direction left or right from which it will get maximum reward.This we can get using MPD tools.


Related Solutions

What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli? What is the...
What is the difference between associative learning, reinforcement, conditioned stimuli, and discriminative stimuli? What is the difference between incentive salience and goal-directed behavior? Question # 8: Compare and contrast the drive theory of drug addiction and the opponent-process theory of drug addiction? How does animal models of drug self-administration and drug reinstatement related to human models of drug relapse? How does the nucleus accumbens relates to the theories of drug addiction outlined in the chapter?
What are some of the limitations of using reinforcement and punishment
What are some of the limitations of using reinforcement and punishment
Explain what happens in reinforcement learning if the agent always chooses the action that maximizes the...
Explain what happens in reinforcement learning if the agent always chooses the action that maximizes the Q-value. Suggest two ways to force the agent to explore.
To what extent is feedback and reinforcement possible without an instructor present during the learning process?...
To what extent is feedback and reinforcement possible without an instructor present during the learning process? book: instructors and their jobs..w.r. miller, 2nd edition Chapter 2 Learning Process
what is the Wisconsin Idea? ( please no google )
what is the Wisconsin Idea? ( please no google )
For an auditor how are management assertions useful? What are the three types of audit procedures...
For an auditor how are management assertions useful? What are the three types of audit procedures and describe their purpose?
Please using typing vision essay format. Under topic of supply chain management Please have a detail...
Please using typing vision essay format. Under topic of supply chain management Please have a detail introduction of USPS, talk about how the history and how of USPS start from the 1639 where it first start at Boston. And the purpose of UPSP formed. Thank you
3. What are the similarities and differences between positive and negative reinforcement? Please be succinct and...
3. What are the similarities and differences between positive and negative reinforcement? Please be succinct and clear. Answer: 4. Explain the difference between “escape learning” and “avoidance learning,” providing a simple example of each within your explanation. Examples in lecture or textbook are not accepted. Answer: 5. Provide a real world behavior that could be explained by “Learned Helplessness.” You cannot use domestic violence, and your answer shouldn’t be more than a paragraph. Answer: 6. How might you teach your...
((Reinforcement concrete subject)) using ACI-318. Please, answer the following questions and make your answers as short...
((Reinforcement concrete subject)) using ACI-318. Please, answer the following questions and make your answers as short as possible. no need for long answer. You may express your answer as an equation if needed. (((Please only type your answers))) T beams:- 1- What is the primary advantage of using a T beam over a rectangular beam? 2- How do we determine the effective width of a T beam? 3- What is the difference between Case 1 and 2? 4- How do...
((Reinforcement concrete subject)) using ACI-318. Please, answer the following questions and make your answers as short...
((Reinforcement concrete subject)) using ACI-318. Please, answer the following questions and make your answers as short as possible. no need for long answer. You may express your answer as an equation if needed. (((Please only type your answers))) Slabs:- 1- Name two benefits of designing a one way slab as a series of one foot wide strips? 2- How do we calculate the minimum thickness for a slab? 3- Why is it a good idea to use the minimum slab...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT