OS structure

"Process" = A program running. Could be multiple copies of the same program running. (Multiple processes.)

Device speed

See History of OS.
What do we mean "I/O devices are slower than CPU" ?

Easiest to see with user input. User types say 1 character per second average. The CPU takes 100 microseconds to put character into a memory buffer. For 999,900 out of every 1,000,000 microseconds, the CPU is doing nothing.
This holds even for non-user I/O. e.g. Read from disk is at very slow speed compared to speed at which CPU can work.
Even read from RAM is at slow speed compared to speed at which CPU can work. CPU could be doing other jobs while the read is going on.

Device controllers and Interrupts

Device controller
- Memory controller
- Disk controller

Device controllers operate in parallel.
Can read/write devices and memory in parallel. Tell CPU when done.
Instead of OS constantly polling device to see if done, device controller sends an interrupt.
OS when it gets interrupt runs an interrupt handler.

CPU is kept free (responsive to user, able to run other code) while device I/O is going on.

Modern OS driven by interrupts.

Interrupts are to do with the massive asymmetry between CPU speeds and device speeds. They are how the OS program runs on hardware with slow devices.

Interrupts (IRQs) on Windows 10.
My Computer - Properties - Device Manager - View - Resources by Type - Interrupt Request (IRQ)
Note IRQ 0 - System timer.

Examples of Interrupts

Completion of user input (every key press) causes interrupt.
Sometimes each key press is handled just by hardware device controller, which builds up a buffer, and even handles line editing itself - and only when you (say) hit return at end of command-line is interrupt sent to OS.
Imagine a typical command-line: Shell process may have waited hours for completion of that Input.
Completion of any type of Input or Output.
Deliberate system call in a program causes interrupt (essentially, the start of all I/O). System calls:
- Process - exit, load, execute, wait, signal, alloc
- Info - query date, set date, query-set system info
- File - create, open, delete, read-write
- Device - request, release, read-write
- Communications - create connection, send-receive messages
Also errors - Divide by zero causes interrupt.
Bad memory access.
Timer interrupts.

Does an infinite loop cause an interrupt?

Does an infinite loop quickly (or eventually) cause an interrupt?

The Halting Problem (Turing, 1936).

Might just be a long loop. We have no way of knowing.
But still, even if long loop, control must switch occasionally - time-slicing.
It is a timer interrupt that switches control. But loop runs forever, time-sliced.

Unsolved problems in mathematics.
Note that if you could detect an infinite loop in general, then could solve all problems of the form:

Does there exist a solution to f(n)
for n > t?

by asking the OS if the following:

repeat
 n := n+1
 test solution
until solution

is an infinite loop, or just a long loop? Then our OS could solve many of the world's great mathematical problems.
Many mathematical problems can be phrased as infinite-loop problems.

Note that many "infinite loops" actually terminate with a crash, because they are using up some resource each time round the loop. e.g. This program:

f ( int x )
{
 f(x);
}

f(1);

will eventually crash with a stack overflow.
This program however:

while true { }

will run forever, time-sliced.

Interrupts - Keeping the OS in charge

Interrupt idea is a way of periodically keeping the OS in charge so nothing runs forever without reference to the OS. e.g. In time-slicing, periodic timer interrupt to give OS a chance to do something else with the CPU.

Remember, the CPU is just looking for a stream of instructions to process, whether they come from the "OS" or from "user programs". When the OS "runs" a program, it points the CPU towards a stream of instructions in the user program and basically hands over control. How do we know program cannot now control CPU for ever?

Note: Interrupt does not necessarily mean that OS immediately attends to the process that sent it (in the sense of giving it CPU time). It just means the OS is now aware that that process wants attention. The OS will note that its "state" has changed.

Asynchronous programming

An interrupt-driven OS is an example of Asynchronous programming.
It can be hard to code like this, where functions may be called at any time rather than in some predictable order.
Asynchronous programming has become very common with JavaScript on the Web.
And for the same reason: Because JavaScript I/O (like fetching a resource from a server) takes a long time compared to CPU time. So we start the request, and carry on, and we get an "interrupt" when it finishes some time in the future.

CPU modes

There must be a concept of an "OS-type program" (which can do anything, including scheduling ordinary programs) and an "ordinary program" (which is restricted in what it can do on the machine).

The restrictions on the "ordinary program" are normally not a problem - they are basically just: "Other programs have to be able to run at the same time as you".

This obviously makes sense in a multi-user system, but in fact it makes sense on a PC as well. When you're writing a program and you make an error, you don't want your crashed program to be able to crash the whole OS. You still want to be able to turn to the OS (which should still be responsive) and say "terminate that program".

Code is executed either in kernel/system/monitor mode or in user mode.

boot in system mode, load OS
when run program, switch to user mode

when interrupt, switch to system mode 
 and jump to OS code

when resume, switch back to user mode
 and return to next instruction in user code

Privileged instructions can only be executed in system mode.
e.g. Perhaps any I/O at all - user can't do I/O, user has to ask OS to do I/O for it (OS will co-ordinate it with the I/O of other processes).

This is a Protection ring system.
The norm is to use only two rings. But more is possible.

Superuser

A related but different issue is switching between "ordinary" users and users with administration rights.

On Linux, if you have admin (root/superuser) access to your own Linux, see these commands:

su - "substitute user" - change to another user (e.g. root)
sudo - "substitute user do" - run command as other user (typically root)
See sudo apt to install software on your own Linux.

The kernel is the part of OS that is not scheduled

Consider the different types of code, OS or applications:

Parts of OS that are not scheduled themselves. Are always in memory. Operate in kernel/system mode.
This part of the OS is called the kernel.
Includes:
- memory management
- process scheduling
- device I/O
Parts of OS that can be scheduled. Come in and out of memory. Operate in user mode.
Includes:
- command-line
- GUI
- OS utilities
Parts of OS that could be in either of the above:
- file system
- network interface
Applications. All scheduled. Come in and out of memory. Operate in user mode.

Windows has both kernel-mode drivers and user-mode drivers.
From Microsoft driver developer manual.

Virtual Machine (VM)

Virtual machine

IBM VM "Virtual Machine" OS, since 1972.
VM idea built-in. Can run a multi-user OS in the user space of a single user of VM.
Emulator - Programs from Operating System 1 or CPU 1 running on Operating System 2 or CPU 2.
VMware - Run Linux inside Windows. Run another Windows inside Windows. etc.
Windows Subsystem for Linux on Windows 10

Running Linux inside Windows with VMware.
I have used VMware in DCU labs to teach system administration of an imaginary multi-machine network all running on one machine.
Image from here.

Java Virtual Machine

The idea of Java and the JVM:

Java is a language designed for a machine that does not exist, but that can be emulated on top of almost any machine.
Java runs on a virtual piece of "hardware", the Java virtual machine (JVM).
Java is "compiled" to a "virtual" instruction set called Java bytecodes. These run on a JVM, where they are actually mapped to native hardware instructions.

Promises of Java and the JVM:

Portability. Write an application once, run everywhere.
Run code from Internet on client-side. Even though from a site with different hardware, they will run on your hardware.
Java applets. Client-side code on the Web. Written in Java, compiled bytecodes transmitted.
Played an important role in the history of client-side code.
But now obsolete. Mostly replaced by what language?

Types of programming languages

Interpreted programming languages (and here) include JavaScript, PHP, Python, Perl, Shell, Windows scripting.
Compiled programming languages include C, C++, C#, Java, Objective C.

New idea for the Web:

WebAssembly. A new idea for a virtual machine online. So that code for client can be in any programming language, e.g. in C.

Most Popular Programming Languages 1965 - 2019.

What language is the interpreter written in?

We love interpreted languages, but they cannot feel too superior to other languages, since usually they need another language to interpret them!
The question is: What language is the interpreter written in?

Chrome uses V8 to interpret JavaScript. V8 is written in C++.
The most common Python interpreter is CPython. CPython is almost entirely written in C, as you can see by browsing their github.
The Shell interpreter bash is written in C.

What language is OS written in?

OS'es are written mainly in HLL, with some Assembly in Kernel.
Assembly fast but hardware-specific.
HLL much more portable, much easier to write/debug/change/maintain.

UNIX / Linux
Mainly written in C.
Some parts in Assembly.
Windows
Mainly written in C, C++.
Some parts in Assembly.
C / C++ allows you directly embed Assembly instructions within it.
Comparison of open-source operating systems shows kernel programming language.
Comparison of mobile operating systems shows languages programmed in.

Languages used in Android.
From here.

Is Assembly needed?

Assembly is faster. But not needed everywhere.

Perhaps only 5 % of the code is performance-critical. Things executed constantly - Interrupt handler, Short-term CPU scheduler.
Other 95 % can be HLL for portability. 5 % can be machine-specific - needs a rewrite for each new hardware.
Best to find out what is important to performance rather than pre-judge. (In which loops does the system really spend its time? You might be surprised.)
Performance profiling

In practice, C/C++ is often faster than Assembly. Why?
Because C/C++ compiler is probably a better optimiser of code than your hand-coded Assembly.
So often stick to a HLL like C/C++ even for performance-critical code.
Compiler often has a -optimise switch to take extra time analysing HLL code to generate faster Assembly.

Part of the Linux OS source. Written in C.
This is part of init/main.c in Linus Torvalds' Linux source.

Linux is open source model.
Reading the Linux Kernel Sources