Course Content
Introduction to Artificial Intelligence
About this learning activity Less than a century old, artificial intelligence (AI) has already undergone three waves of transformative development. Today it gives humanity the most powerful tools for analyzing complex data, not only to find meaning but to learn without human intervention. In this course, you'll survey AI's history and explore ways that it can shed light on unstructured data. What you'll learn After completing this course, you should be able to: Define artificial intelligence Describe three levels of artificial intelligence Describe the history of AI from the past to the possible future Define and describe machine learning Differentiate between structured and unstructured data Describe how machine learning structures data Describe how machine learning structures unstructured data Describe how machine learning uses probabilistic calculation to solve problems Describe three methods by which machine learning analyzes data Describe an ideal relationship between humans and machine learning
0/41
Introduction to Large language models
Welcome to Introduction to Large Language Models! In this module, you'll learn what large language models are, how they work, and some typical business applications. Estimated duration 30 minutes
0/8
IBM – Getting Started with Artificial Intelligence

Reinforcement learning is a machine learning model similar to supervised learning, but the algorithm isn’t trained using sample data. This model learns as it goes by using trial and error. A sequence of successful outcomes is reinforced to develop the best recommendation for a given problem. The foundation of reinforcement learning is rewarding the “right” behavior and punishing the “wrong” behavior.

You might be wondering, what does it mean to “reward” a machine? Good question! Rewarding a machine means that you give your agent positive reinforcement for performing the “right” thing and negative reinforcement for performing the “wrong” things. 

 

As a machine learns through trial and error, it tries a prediction, then compares it with data in its corpus

  • Each time the comparison is positive, the machine receives positive numerical feedback, or a reward.
  • Each time the comparison is negative, the machine receives negative numerical feedback, or a penalty.

Over time, a machine’s predictions will grow to be more accurate. It accomplishes this automatically based on feedback, rather than through human intervention.

Scroll to Top