Close
Register
Close Window

Senior Algorithms

Chapter 6 Limits to Computing

Show Source |    | About   «  6.21. Unsolveable Problems   ::   Contents   ::   6.23. Turing Machines: Advanced Topics  »

6.22. Introduction to Turing Machines

6.22.1. A General Model of Computation

In this module we seek to define a general model of computation that is as simple as possible. The reason is that we want to be able to understand the limits of what is possible in computing, but that is rather hard to do with a complicated definition for a “computer” is. But then, we need to be confident that whatever model we do pick, that it actually represents all of the fundamental capabilities of a “computer”. Specifically, we want a model that is capable of computing any funtion that our “regular” computers can compute.

“State machines” are simple to understand. There are a number of different state machines, with a range of capabilities. We will discuss a particular one, called a Turing machine.

If you take a Formal Languages course, you will learn about a number of simple state machine types, like deterministic and non-deterministic finite automata, and push-down automata. These machines all have some similarities. They all take an input string. They all march across the string from left to right, one character at each step. They stop when they reach the end of the string and then make a simple decision: If the machine is in a final state, then it accepts the string, and otherwise it rejects the string. In other words, they are acceptors for some language. All that they can do is determine if a string is a member of the language (of course that is a pretty useful ability), and the “more powerful” machines can accept or reject strings from more “complicated” languages.

But there is a lot more that we typically expect when we talk about “computation”. Like, well, the ability to “compute” rather than simply accept. What does “compute” actually mean? For example, we would like to be able to take as input the arithmetic expression “\(2 + 3\)” and do more than decide if it is a syntactically correct arithmetic statement. We would like to be able to compute the answer: 5. This means that we need to be able to output that computed value. Or to be precise, we “compute” a value by taking the input string “\(2 + 3\)” and replacing it with the output string “5”.

In its most general form, we can think of everything that a computer does as taking some string as input, and then providing some string as output. Of course, modern peripheral devices like keyboards, mice, and computer monitors give us rich ways to express input strings (perhaps as button presses), and rich ways to interpret output strings (say, as pictures). But it’s not a huge stretch of the imagination to see that there is no loss of ability if we abstract all computation as converting an input string to an output string.

This concept of converting is far more powerful than simple accepting. A machine that takes an input and provides an output is called a transducer.

In the next section, we will introduce a simple machine, called the Turing Machine, that is a transducer. It is only slightly more complicated than other state machines. These differences might appear suprisingly small considering how significant they are. Ultimately, we will see that a Turing Machine can do any computation (in its fundamental abilty to convert one string into another) that even the most sophisticated modern computer can do.

The first difference is memory. DFAs (and NFAs) have no memory beyond whatever process took them to their current state. PDAs have a stack. This made a huge difference in the languages that can be accepted: DFAs only accept regular languages, while (nondeterministic) PDAs accept any CFL.

Consider these three languages:

  • \(L_1 = \{wcw^R\}\) for \(w\) in \(\Sigma^*, \Sigma = \{a, b\}\).

  • \(L_2 = \{ww^R\}\) for \(w\) in \(\Sigma^*, \Sigma = \{a, b\}\).

  • \(L_1 = \{ww\}\) for \(w\) in \(\Sigma^*, \Sigma = \{a, b\}\).

The differences seem pretty minor, but the outcomes are quite different. \(L_1\) is a deterministic CFL. \(L_2\) is a non-determistic CFL. \(L_3\) is not a CFL at all.

The difference between \(L_3\) and the others seems to relate to the limitiations of the stack. As we read the first \(w\) and load it into the stack, we don’t have a good way to get to the bottom of the stack to see the first letter to compare it against the first letter of the second \(w\). But what if we considered other memory models? For example, a queue would solve our problem nicely. But then, a queue would not be able to handle \(L_2\). Two stacks could simulate a queue (do you see how?), and so a machine with two stacks can handle both \(L_1\) and \(L_2\). Perhaps we should investigate what class of languages a two stack machine can handle? Perhaps we should consider other memory models?

As it turns out, none of these ideas are as effective as the simple memory model that the Turing Machine uses.

6.22.2. Turing Machines

Settings

Proficient Saving... Error Saving
Server Error
Resubmit

6.22.3. Interpreting Turing Machines

Next we will look at notation for discussing the concept of configurations and transistions between configurations for Turing machines. We will investigate more about the conventions of halting, accepting, and computing for Turing machines. Finally, we will present notation for doing real computation on numbers.

Settings

Proficient Saving... Error Saving
Server Error
Resubmit

6.22.4. Turing-Decidable vs. Turing-Acceptable Languages

Settings

Proficient Saving... Error Saving
Server Error
Resubmit

   «  6.21. Unsolveable Problems   ::   Contents   ::   6.23. Turing Machines: Advanced Topics  »

Close Window