MIT 6.828 实验记录 (一)

大二学习操作系统的时候，老师给我们介绍了MIT6.828的实验，由于课程安排的原因，我们并没有完成MIT6.828太多的实验，记忆中应该是只看了和xv6相关的内容辅助理解操作系统，以至于我现在几乎忘记了当时做了哪些实验。趁大三上课不是很多，想重新自己完整完成这7个实验。 ## Part 0: 6.828 Build Environment

虚拟机环境：Ubuntu 18.04（64位）
仿真器（qemu）：git clone https://github.com/mit-pdos/6.828-qemu.git qemu
实验代码（lab）：git clone https://pdos.csail.mit.edu/6.828/2018/jos.git lab

虚拟机环境32位，因为JOS就是32位的操作系统。仿真器使用MIT进行patch过的(见上链接)。原因是实验中分页机制是有意修改过的，使用patched version的话在后面Exercise中不需要手动转换地址。关于实验代码，默认熟悉Git和MakeFile。每做完一个Exercise可以使用make grade进行测试。 ./configure时候可能会出现库缺失导致无法完成配置，可以根据报错提示将缺失的库重新安装补全，Google一下。详细的搭建过程见Tools Guide。

关于Tool Guide中给出的配置指令，如果[--prefix=PFX]参数没有指定的话，默认会安装在/usr/local/share/qemu中，这个目录需要管理员权限才能修改，所以安装时需要使用sudo make install

关于在make install过程中可能会出现

1
2
3

/usr/bin/ld: qga/commands-posix.o: in function `dev_major_minor':
/qemu/qga/commands-posix.c:633: undefined reference to `major'
/usr/bin/ld: /qemu/qga/commands-posix.c:634: undefined reference to `minor'

解决办法是：在/qemu/qga/commands-posix.c头文件中插入#include <sys/sysmacros.h>

Part 1: PC Bootstrap

如果您还不熟悉 x86 汇编语言，那么在本课程中您将很快熟悉它！PC 汇编语言手册是一个很好的起点。希望这本书包含新旧材料的混合供你参考。

警告：不幸的是，书中的例子是为 NASM 汇编器编写的，而我们将使用 GNU 汇编器。NASM 使用所谓的 Intel 语法，而 GNU 使用 AT&T 语法。虽然在语义上是等效的，但程序集文件将有很大差异，至少在表面上是这样，具体取决于使用的语法。幸运的是，两者之间的转换非常简单，Brennan's Guide to Inline Assembly 中对此进行了介绍。

Exercise 1. Familiarize yourself with the assembly language materials available on the 6.828 reference page. You don't have to read them now, but you'll almost certainly want to refer to some of this material when reading and writing x86 assembly. We do recommend reading the section "The Syntax" in Brennan's Guide to Inline Assembly. It gives a good (and quite brief) description of the AT&T assembly syntax we'll be using with the GNU assembler in JOS.

Simulating the x86

我们不是在真实的物理个人计算机（PC）上开发操作系统，而是使用忠实模拟完整 PC 的程序：您为仿真器编写的代码也可以在真实 PC 上启动。使用仿真器可以简化调试;例如，您可以在模拟的 x86 中设置断点，这对于 x86 的 Silicon 版本来说很难做到。在 6.828 中，我们将使用 QEMU Emulator，这是一种现代且相对较快的仿真器。虽然 QEMU 的内置监视器仅提供有限的调试支持，但 QEMU 可以充当 GNU 调试器（GDB）的远程调试目标，我们将在本实验中使用它来逐步完成早期启动过程。

接下来我们就可以编译并尝试在QEMU上运行JOS了，进入之前clone的lab文件夹，执行make指令，可以看到下面的输出

tommygong@TommyGong:~/lab$ make
+ as kern/entry.S
+ cc kern/entrypgdir.c
+ cc kern/init.c
+ cc kern/console.c
+ cc kern/monitor.c
+ cc kern/printf.c
+ cc kern/kdebug.c
+ cc lib/printfmt.c
+ cc lib/readline.c
+ cc lib/string.c
+ ld obj/kern/kernel
ld: warning: section `.bss' type changed to PROGBITS
+ as boot/boot.S
+ cc -Os boot/main.c
+ ld boot/boot
boot block is 397 bytes (max 510)
+ mk obj/kern/kernel.img

这就表示已经成功编译出了镜像文件。现在可以运行qemu，将上面创建的obj/kern/kernel.img作为模拟PC的“虚拟硬盘”的内容提供。这个硬盘映像包含我们的引导加载程序（ obj/boot/boot ）和内核（ obj/kernel ）。

tommygong@TommyGong:~/lab$ make qemu
sed "s/localhost:1234/localhost:26000/" < .gdbinit.tmpl > .gdbinit
qemu-system-i386 -drive file=obj/kern/kernel.img,index=0,media=disk,format=raw -serial mon:stdio -gdb tcp::26000 -D qemu.log 
6828 decimal is XXX octal!
entering test_backtrace 5
entering test_backtrace 4
entering test_backtrace 3
entering test_backtrace 2
entering test_backtrace 1
entering test_backtrace 0
leaving test_backtrace 0
leaving test_backtrace 1
leaving test_backtrace 2
leaving test_backtrace 3
leaving test_backtrace 4
leaving test_backtrace 5
Welcome to the JOS kernel monitor!
Type 'help' for a list of commands.
K>

要退出QEMU，请键入 Ctrl+a x

The PC's Physical Address Space

PC 的物理地址空间是硬连线的，具有以下常规布局：

+------------------+  <- 0xFFFFFFFF (4GB)
|      32-bit      |
|  memory mapped   |
|     devices      |
|                  |
/\/\/\/\/\/\/\/\/\/\
/\/\/\/\/\/\/\/\/\/\
|                  |
|      Unused      |
|                  |
+------------------+  <- depends on amount of RAM
|                  |
|                  |
| Extended Memory  |
|                  |
|                  |
+------------------+  <- 0x00100000 (1MB)
|     BIOS ROM     |
+------------------+  <- 0x000F0000 (960KB)
|  16-bit devices, |
|  expansion ROMs  |
+------------------+  <- 0x000C0000 (768KB)
|   VGA Display    |
+------------------+  <- 0x000A0000 (640KB)
|                  |
|    Low Memory    |
|                  |
+------------------+  <- 0x00000000

第一台基于 16 位 Intel 8088 处理器的 PC 只能寻址 1MB 的物理内存。因此，早期 PC 的物理地址空间将从 0x00000000 开始，但以 0x000FFFFF 结束，而不是 0xFFFFFFFF。标记为“Low Memory”的 640KB 区域是早期 PC 唯一可以使用的随机存取存储器（RAM）;事实上，最早的 PC 只能配置 16KB、32KB 或 64KB 的 RAM！

从 0x000A0000 到 0x000FFFFF 的 384KB 区域由硬件保留用于特殊用途，例如视频显示缓冲区和非易失性存储器中保存的固件。此保留区域最重要的部分是基本输入/输出系统（BIOS），它占据了从 0x000F0000 到 0x000FFFFF 的 64KB 区域。在早期的 PC 中，BIOS 保存在真正的只读存储器（ROM）中，但当前的 PC 将 BIOS 存储在可更新的闪存中。BIOS 负责执行基本的系统初始化，例如激活视频卡和检查安装的内存量。执行此初始化后，BIOS 会从某个适当的位置（如软盘、硬盘、CD-ROM 或网络）加载操作系统，并将计算机的控制权传递给操作系统。

当英特尔最终用分别支持 16MB 和 4GB 物理地址空间的 80286 和 80386 处理器“打破 1MB 的障碍”时，PC 架构师仍然保留了 1MB 物理地址空间的原始布局，以确保与现有软件的向后兼容性。因此，现代 PC 在物理内存上有一个从 0x000A0000 到 0x00100000 的“漏洞”，将 RAM 分为“低内存”或“传统内存”（前 640KB）和“扩展内存”（其他所有内存）。此外，PC 的 32 位物理地址空间最顶部的一些空间（尤其是物理 RAM）现在通常由 BIOS 保留供 32 位 PCI 设备使用。

最新的 x86 处理器可以支持超过 4GB 的物理 RAM，因此 RAM 可以进一步扩展到 0xFFFFFFFF 以上。在这种情况下，BIOS 必须在系统 RAM 的 32 位可寻址区域顶部留出第二个孔，以便为这些 32 位设备留出空间进行映射。由于设计限制，JOS 无论如何都会只使用 PC 物理内存的前 256MB，所以现在我们假设所有 PC 都“只有”一个 32 位的物理地址空间。但是，处理复杂的物理地址空间和多年来发展起来的硬件组织的其他方面是操作系统开发的重要实际挑战之一。

The ROM BIOS

打开两个终端窗口和 cd 两个 shell 进入您的实验室目录。在一个版本中，输入 make qemu-gdb 。这将启动 QEMU，但 QEMU 在处理器执行第一条指令并等待来自 GDB 的调试连接之前停止。在第二个终端中，从您运行的同一目录中运行 make gdb。然后应该就能看到下面的内容

tommygong@TommyGong:~/lab$  make gdb
gdb -n -x .gdbinit
GNU gdb (Ubuntu 8.1.1-0ubuntu1) 8.1.1
Copyright (C) 2018 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
<http://www.gnu.org/software/gdb/documentation/>.
For help, type "help".
Type "apropos word" to search for commands related to "word".
+ target remote localhost:26000
warning: No executable has been specified and target does not support
determining executable automatically.  Try using the "file" command.
warning: A handler for the OS ABI "GNU/Linux" is not built into this configuration
of GDB.  Attempting to continue with the default i8086 settings.

The target architecture is assumed to be i8086
[f000:fff0]    0xffff0:	ljmp   $0xf000,$0xe05b
0x0000fff0 in ?? ()
+ symbol-file obj/kern/kernel

[f000:fff0] 0xffff0: ljmp $0xf000,$0xe05b是GDB对QEMU执行的第一条指令的反汇编。

IBM PC 从物理地址 0x000ffff0 开始执行，该地址位于为 ROM BIOS 保留的 64KB 区域的最顶部。
PC 以 CS(Code Segment) = 0xf000 和 IP(Instruction Pointer) = 0xfff0 开始执行。
要执行的第一条指令是一条 jmp 指令，该指令跳转到分段地址 CS = 0xf000 和 IP = 0xe05b 。

这条指令 0xffff0: ljmp $0xf000,$0xe05b 是一个远跳转（ljmp）指令，它用于将程序的执行流跳转到特定的段和偏移地址。 0xffff0: 这是指令在内存中的地址，意味着当前指令位于内存的 0xFFFF0 地址处。ljmp $0xf000, $0xe05b: 这是一条远跳转指令（long jump，简称 ljmp），它包含两个部分：

段选择符 $0xf000: 段寄存器的值，即代码段的基地址。
偏移地址 $0xe05b: 相对于段基地址的偏移量。

在 x86 保护模式之前的实模式下，内存地址是通过段和偏移组合的形式访问的：

物理地址 = 段选择符 × 16 + 偏移地址

因此，执行这条指令后，CPU 会跳转到段 $0xf000 和偏移 $0xe05b 组合形成的物理地址：

物理地址 = 0xf000 * 16 + 0xe05b
物理地址 = 0xf0000 + 0xe05b = 0xfe05b

这种指令通常出现在系统启动时（例如，BIOS 启动阶段）。当 CPU 加电或者复位时，它会从 0xFFFF0 这个地址开始执行，通常这是一条跳转指令，将 CPU 引导到系统 BIOS 的实际启动代码处。

实模式 (Real Mode)

实模式是x86处理器上电或重置后默认的工作模式，最早用于8086处理器，并且向后兼容现代处理器。

内存寻址：处理器只能访问 1MB 的内存空间。这是由于实模式只能使用20位地址（段寄存器16位+偏移量16位，实际结果为20位地址线）。
段寄存器：内存寻址采用分段机制，内存地址是通过段寄存器和偏移量相加来计算的。例如，物理地址 = 段基址 × 16 + 偏移量。
没有内存保护：在实模式下，程序可以直接访问任何内存地址，导致多个程序之间可能会互相覆盖内存，容易出现系统崩溃。
多任务处理：没有内建的硬件支持多任务处理，处理器无法有效地管理多个程序的并行执行。
应用：实模式主要用于早期的操作系统（如DOS），以及一些简单的嵌入式系统。

保护模式 (Protected Mode)

保护模式是现代x86处理器的主要工作模式，最早引入于80286处理器，后来在80386及以后的处理器中得到了大幅改进。

内存寻址：使用32位地址总线，最多可以寻址 4GB 的内存。并且支持更复杂的内存管理机制，如分页（Paging）和虚拟内存（Virtual Memory）。
段管理：保护模式中的段寄存器不再简单地提供段基址，而是与全局描述符表（GDT）和局部描述符表（LDT）关联，提供段保护。每个段都有自己的权限、大小等信息。
内存保护：每个程序运行在自己的地址空间内，处理器能够检测非法的内存访问。通过段和分页机制，操作系统可以防止不同程序互相干扰，增强系统的稳定性和安全性。
多任务处理：硬件支持多任务处理，处理器能够通过任务状态段（TSS）快速切换任务。内存保护机制使得每个任务在自己的地址空间中运行，确保系统的稳定性。
虚拟内存：保护模式支持虚拟内存，通过分页机制将虚拟地址映射到物理地址，允许程序使用比实际物理内存更大的地址空间。
应用：所有现代操作系统（如Windows、Linux、macOS）都运行在保护模式下。

实模式和保护模式的对比

特性	实模式	保护模式
内存寻址	最大 1MB	最大 4GB（支持分页）
段寄存器	简单的段+偏移	与 GDT/LDT 关联，支持权限
内存保护	无内存保护	内存保护，防止进程冲突
多任务处理	不支持	支持，硬件层面支持
虚拟内存	不支持	支持（通过分页实现）
应用场景	早期操作系统、嵌入式系统	现代操作系统和应用

Exercise 2. Use GDB's si (Step Instruction) command to trace into the ROM BIOS for a few more instructions, and try to guess what it might be doing. You might want to look at Phil Storrs I/O Ports Description, as well as other materials on the 6.828 reference materials page. No need to figure out all the details - just the general idea of what the BIOS is doing first.

当 BIOS 运行时，它会设置中断描述符表并初始化各种设备，例如 VGA 显示器。这就是您在 QEMU 窗口中看到的 “ Starting SeaBIOS ” 消息的来源。在初始化 PCI 总线和 BIOS 知道的所有重要设备后，它会搜索可启动设备，例如软盘、硬盘驱动器或 CD-ROM。最终，当它找到可启动磁盘时，BIOS 会从磁盘中读取 boot loader 并将控制权转移给它。

Part 2: The Boot Loader

用于 PC 的软盘和硬盘被划分为 512 字节的区域，称为扇区。扇区是磁盘的最小传输粒度：每个读取或写入操作的大小必须是一个或多个扇区，并在扇区边界上对齐。如果磁盘是可引导的，则第一个扇区称为引导扇区，因为这是引导加载程序代码所在的位置。当 BIOS 找到可启动的软盘或硬盘时，它会将 512 字节的引导扇区加载到物理地址的内存中，0x7c00 到 0x7dff，然后使用 jmp 指令将 CS：IP 设置为 0000:7c00 ，将控制权传递给引导加载程序。与 BIOS 加载地址一样，这些地址相当任意 - 但它们对于 PC 来说是固定和标准化的。

在 PC 的发展过程中，从 CD-ROM 启动的能力出现得要晚得多，因此 PC 架构师借此机会稍微重新考虑了启动过程。因此，现代 BIOS 从 CD-ROM 启动的方式稍微复杂一些（也更强大）。CD-ROM 使用的扇区大小为 2048 字节而不是 512 字节，并且 BIOS 可以在将控制权转移到磁盘之前将更大的引导映像从磁盘加载到内存中（而不仅仅是一个扇区）。有关更多信息，请参见“El Torito”可启动 CD-ROM 格式规范。

然而，对于 6.828，我们将使用传统的硬盘驱动器启动机制，这意味着我们的启动加载程序必须适合区区 512 字节。引导加载程序由一个汇编语言源文件 boot/boot.S 和一个 C 源文件组成， boot/main.c 请仔细查看这些源文件，并确保您了解发生了什么。引导加载程序必须执行两个主要功能：

首先，boot loader 将处理器从实模式切换到 32 位保护模式，因为只有在这种模式下，软件才能访问处理器物理地址空间中 1MB 以上的所有内存。保护模式在 PC 汇编语言的 1.2.7 和 1.2.8 节中简要描述，在 Intel 架构手册中也有非常详细的描述。此时，您只需要了解分段地址（segment：offset pairs）到物理地址的转换在保护模式下的发生方式不同，并且在转换后偏移量是 32 位而不是 16 位。
其次，引导加载程序通过 x86 的特殊 I/O 指令直接访问 IDE 磁盘设备寄存器，从硬盘读取内核。如果您想更好地理解此处的特定 I/O 指令的含义，请查看 6.828 参考页面上的“IDE 硬盘驱动器控制器”部分。在本课程中，您不需要学习太多有关特定设备编程的知识：编写设备驱动程序实际上是操作系统开发中非常重要的部分，但从概念或体系结构的角度来看，它也是最不有趣的部分之一。

了解引导加载程序源代码后，请查看文件 obj/boot/boot.asm .这个文件是我们的 GNUmakefile 在编译 boot loader 后创建的 boot loader 的反汇编。这个反汇编文件可以很容易地看到所有 boot loader 代码在物理内存中的确切位置，并且更容易跟踪在 GDB 中单步执行 boot loader 时发生的情况。同样， obj/kern/kernel.asm 包含 JOS 内核的反汇编，这通常对调试很有用。

您可以使用该 b 命令在 GDB 中设置地址断点。例如， b *0x7c00 在地址 0x7C00 处设置断点。到达断点后，您可以使用 c and si 命令继续执行： c 使 QEMU 继续执行，直到下一个断点（或直到您按下 Ctrl-C ），并 si N 一次单步执行 N 指令。

要检查内存中的指令（除了 GDB 自动打印的下一个要执行的指令），请使用命令 x/i 。此命令的语法 x/Ni ADDR 为，其中 N 是要反汇编的连续指令数，ADDR 是开始反汇编的内存地址。

Exercise 3. Take a look at the lab tools guide, especially the section on GDB commands. Even if you're familiar with GDB, this includes some esoteric GDB commands that are useful for OS work. Set a breakpoint at address 0x7c00, which is where the boot sector will be loaded. Continue execution until that breakpoint. Trace through the code in boot/boot.S, using the source code and the disassembly file obj/boot/boot.asm to keep track of where you are. Also use the x/i command in GDB to disassemble sequences of instructions in the boot loader, and compare the original boot loader source code with both the disassembly in obj/boot/boot.asm and GDB. Trace into bootmain() in boot/main.c, and then into readsect(). Identify the exact assembly instructions that correspond to each of the statements in readsect(). Trace through the rest of readsect() and back out into bootmain(), and identify the begin and end of the for loop that reads the remaining sectors of the kernel from the disk. Find out what code will run when the loop is finished, set a breakpoint there, and continue to that breakpoint. Then step through the remainder of the boot loader.

Quesiton：

At what point does the processor start executing 32-bit code? What exactly causes the switch from 16- to 32-bit mode?

# Switch from real to protected mode, using a bootstrap GDT
# and segment translation that makes virtual addresses 
# identical to their physical addresses, so that the 
# effective memory map does not change during the switch.
lgdt    gdtdesc
  7c1e:	0f 01 16             	lgdtl  (%esi)
  7c21:	64 7c 0f             	fs jl  7c33 <protcseg+0x1>
movl    %cr0, %eax
  7c24:	20 c0                	and    %al,%al
orl     $CR0_PE_ON, %eax
  7c26:	66 83 c8 01          	or     $0x1,%ax
movl    %eax, %cr0
  7c2a:	0f 22 c0             	mov    %eax,%cr0

# Jump to next instruction, but in 32-bit code segment.
# Switches processor into 32-bit mode.
ljmp    $PROT_MODE_CSEG, $protcseg
  7c2d:	ea                   	.byte 0xea
  7c2e:	32 7c 08 00          	xor    0x0(%eax,%ecx,1),%bh

在这里，引导程序从实模式切换到保护模式，支持更大的内存访问，在GDT（Global Descriptor Table）加载完成之后，处理器就可以开始处理32位指令了，

What is the last instruction of the boot loader executed, and what is the first instruction of the kernel it just loaded?
Where is the first instruction of the kernel?
How does the boot loader decide how many sectors it must read in order to fetch the entire kernel from disk? Where does it find this information?

Loading the Kernel

现在，我们将更详细地查看引导加载程序的 C 语言部分。 boot/main.c 但在此之前，现在是停下来回顾一下 C 编程的一些基础知识的好时机。

Exercise 4. Read about programming with pointers in C. The best reference for the C language is The C Programming Language by Brian Kernighan and Dennis Ritchie (known as 'K&R'). We recommend that students purchase this book (here is an Amazon Link) or find one of MIT's 7 copies. Read 5.1 (Pointers and Addresses) through 5.5 (Character Pointers and Functions) in K&R. Then download the code for pointers.c, run it, and make sure you understand where all of the printed values come from. In particular, make sure you understand where the pointer addresses in printed lines 1 and 6 come from, how all the values in printed lines 2 through 4 get there, and why the values printed in line 5 are seemingly corrupted. There are other references on pointers in C (e.g., A tutorial by Ted Jensen that cites K&R heavily), though not as strongly recommended. Warning: Unless you are already thoroughly versed in C, do not skip or even skim this reading exercise. If you do not really understand pointers in C, you will suffer untold pain and misery in subsequent labs, and then eventually come to understand them the hard way. Trust us; you don't want to find out what "the hard way" is.

要弄清楚这一点， boot/main.c 您需要知道什么是 ELF 二进制文件。编译和链接 C 程序（如 JOS 内核）时，编译器会将每个 C 源（' .c '）文件转换为一个对象 （' .o '）文件，其中包含以硬件所需的二进制格式编码的汇编语言指令。然后，链接器将所有已编译的目标文件组合成一个二进制映像，例如 obj/kern/kernel ，在本例中是 ELF 格式的二进制文件，代表“可执行和可链接格式”。

有关此格式的完整信息可在我们的参考页面上的 ELF 规范中找到，但您无需在本课程中深入研究此格式的详细信息。尽管整体格式非常强大和复杂，但大多数复杂的部分都是为了支持共享库的动态加载，我们不会在本课程中这样做。维基百科页面有一个简短的描述。

对于 6.828，您可以将 ELF 可执行文件视为具有加载信息的标头，后跟几个程序部分，每个部分都是要加载到指定地址的内存中的连续代码块或数据。引导加载程序不会修改代码或数据;它会将其加载到内存中并开始执行它。

ELF 二进制文件以固定长度的 ELF 标头开头，后跟一个可变长度的程序标头，其中列出了要加载的每个程序部分。这些 ELF 标头的 C 定义位于 inc/elf.h 中。我们感兴趣的节目部分是：

.text ：程序的可执行指令。
.rodata ：只读数据，例如 C 编译器生成的 ASCII 字符串常量。（但是，我们不会费心设置硬件来禁止写入。）
.data ：data 部分保存程序的初始化数据，例如使用初始化器声明的全局变量，如 int x = 5; .

当链接器计算程序的内存布局时，它会为未初始化的全局变量保留空间，例如 int x; ，在内存中紧随其后的 .data 名为 section .bss called 中。C 要求“未初始化”的全局变量以零值开头。因此，无需在 ELF 二进制文件中存储内容 .bss ;相反，链接器仅记录 .bss 节的地址和大小。加载器或程序本身必须将 .bss 部分归零。

通过键入以下内容，检查内核可执行文件中所有部分的名称、大小和链接地址的完整列表：

tommygong@TommyGong:~/lab$ objdump -h obj/kern/kernel

obj/kern/kernel:     file format elf32-i386

Sections:
Idx Name          Size      VMA       LMA       File off  Algn
  0 .text         00001925  f0100000  00100000  00001000  2**4
                  CONTENTS, ALLOC, LOAD, READONLY, CODE
  1 .rodata       00000704  f0101940  00101940  00002940  2**5
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  2 .stab         00003a15  f0102044  00102044  00003044  2**2
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  3 .stabstr      00001989  f0105a59  00105a59  00006a59  2**0
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  4 .data         0000a300  f0108000  00108000  00009000  2**12
                  CONTENTS, ALLOC, LOAD, DATA
  5 .bss          00000648  f0112300  00112300  00013300  2**5
                  CONTENTS, ALLOC, LOAD, DATA
  6 .comment      0000002a  00000000  00000000  00013948  2**0
                  CONTENTS, READONLY

这些信息通常包含在程序的可执行文件中，但不会由程序加载器加载到内存中。

请特别注意该 .text 部分的 “VMA” （或链接地址）和 “LMA” （或加载地址）。节的加载地址是该节应加载到内存中的内存地址。

节的 link address 是 section 预期执行的内存地址。链接器以各种方式对二进制文件中的链接地址进行编码，例如，当代码需要全局变量的地址时，结果是如果二进制文件从未链接的地址执行，则二进制文件通常不起作用。（可以生成不包含任何此类绝对地址的与位置无关的代码。这被现代共享库广泛使用，但它有性能和复杂性成本，因此我们不会在 6.828 中使用它。

通常，链路地址和加载地址相同。例如，查看 boot loader .text 的部分：

tommygong@TommyGong:~/lab$ objdump -h obj/boot/boot.out

obj/boot/boot.out:     file format elf32-i386

Sections:
Idx Name          Size      VMA       LMA       File off  Algn
  0 .text         0000018d  00007c00  00007c00  00000074  2**2
                  CONTENTS, ALLOC, LOAD, CODE
  1 .stab         0000084c  00000000  00000000  00000204  2**2
                  CONTENTS, READONLY, DEBUGGING
  2 .stabstr      00000862  00000000  00000000  00000a50  2**0
                  CONTENTS, READONLY, DEBUGGING
  3 .comment      0000002a  00000000  00000000  000012b2  2**0
                  CONTENTS, READONLY

引导加载程序使用 ELF 程序头文件来决定如何加载这些部分。程序头文件指定要加载到内存中的 ELF 对象的哪些部分，以及每个部分应占用的目标地址。您可以通过键入以下内容来检查程序头文件：

tommygong@TommyGong:~/lab$ objdump -x obj/kern/kernel

obj/kern/kernel:     file format elf32-i386
obj/kern/kernel
architecture: i386, flags 0x00000112:
EXEC_P, HAS_SYMS, D_PAGED
start address 0x0010000c

Program Header:
    LOAD off    0x00001000 vaddr 0xf0100000 paddr 0x00100000 align 2**12
         filesz 0x000073e2 memsz 0x000073e2 flags r-x
    LOAD off    0x00009000 vaddr 0xf0108000 paddr 0x00108000 align 2**12
         filesz 0x0000a948 memsz 0x0000a948 flags rw-
   STACK off    0x00000000 vaddr 0x00000000 paddr 0x00000000 align 2**4
         filesz 0x00000000 memsz 0x00000000 flags rwx

Sections:
Idx Name          Size      VMA       LMA       File off  Algn
  0 .text         00001925  f0100000  00100000  00001000  2**4
                  CONTENTS, ALLOC, LOAD, READONLY, CODE
  1 .rodata       00000704  f0101940  00101940  00002940  2**5
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  2 .stab         00003a15  f0102044  00102044  00003044  2**2
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  3 .stabstr      00001989  f0105a59  00105a59  00006a59  2**0
                  CONTENTS, ALLOC, LOAD, READONLY, DATA
  4 .data         0000a300  f0108000  00108000  00009000  2**12
                  CONTENTS, ALLOC, LOAD, DATA
  5 .bss          00000648  f0112300  00112300  00013300  2**5
                  CONTENTS, ALLOC, LOAD, DATA
  6 .comment      0000002a  00000000  00000000  00013948  2**0
                  CONTENTS, READONLY
SYMBOL TABLE:
f0100000 l    d  .text	00000000 .text
f0101940 l    d  .rodata	00000000 .rodata
f0102044 l    d  .stab	00000000 .stab
f0105a59 l    d  .stabstr	00000000 .stabstr
f0108000 l    d  .data	00000000 .data
f0112300 l    d  .bss	00000000 .bss
00000000 l    d  .comment	00000000 .comment
00000000 l    df *ABS*	00000000 obj/kern/entry.o
f010002f l       .text	00000000 relocated
f010003e l       .text	00000000 spin
00000000 l    df *ABS*	00000000 entrypgdir.c
00000000 l    df *ABS*	00000000 init.c
00000000 l    df *ABS*	00000000 console.c
f01001a0 l     F .text	0000000e delay
f01001ae l     F .text	00000020 serial_proc_data
f01001ce l     F .text	00000041 cons_intr
f0112320 l     O .bss	00000208 cons
f010058a l     F .text	0000010c kbd_proc_data
f0112304 l     O .bss	00000001 serial_exists
f010029a l     F .text	000001f0 cons_putc
f0112310 l     O .bss	00000002 crt_pos
f011230c l     O .bss	00000004 crt_buf
f0112308 l     O .bss	00000004 addr_6845
f0112300 l     O .bss	00000004 shift.1300
f0101a00 l     O .rodata	00000100 shiftcode
f0101b00 l     O .rodata	00000100 togglecode
f0101c00 l     O .rodata	00000010 charcode
f0112000 l     O .data	00000100 normalmap
f0112100 l     O .data	00000100 shiftmap
f0112200 l     O .data	00000100 ctlmap
00000000 l    df *ABS*	00000000 monitor.c
f0101df4 l     O .rodata	00000018 commands
00000000 l    df *ABS*	00000000 printf.c
f010094d l     F .text	00000013 putch
00000000 l    df *ABS*	00000000 kdebug.c
f0100960 l     F .text	00000101 stab_binsearch
00000000 l    df *ABS*	00000000 printfmt.c
f0100c40 l     F .text	000000e8 printnum
f0100d28 l     F .text	0000003a getuint
f0100d62 l     F .text	0000001d sprintputch
f0102018 l     O .rodata	0000001c error_string
00000000 l    df *ABS*	00000000 readline.c
f0112540 l     O .bss	00000400 buf
00000000 l    df *ABS*	00000000 string.c
f010000c g       .text	00000000 entry
f0101339 g     F .text	00000020 strcpy
f010020f g     F .text	00000012 kbd_intr
f01006a0 g     F .text	0000000a mon_backtrace
f0100085 g     F .text	0000005f _panic
f0100141 g     F .text	0000005b i386_init
f01014e0 g     F .text	0000007c memmove
f01011cf g     F .text	00000028 snprintf
f0100d7f g     F .text	000003f8 vprintfmt
f010023c g     F .text	00000043 cons_getc
f0100933 g     F .text	0000001a cprintf
f010155c g     F .text	00000021 memcpy
f0101220 g     F .text	000000d3 readline
f0111000 g     O .data	00001000 entry_pgtable
f01000e4 g     F .text	0000005d test_backtrace
f0101177 g     F .text	00000058 vsnprintf
f0112300 g       .bss	00000000 edata
f010049a g     F .text	000000f0 cons_init
f0105a58 g       .stab	00000000 __STAB_END__
f0105a59 g       .stabstr	00000000 __STABSTR_BEGIN__
f01017f0 g     F .text	00000135 .hidden __umoddi3
f0100221 g     F .text	0000001b serial_intr
f01016c0 g     F .text	00000128 .hidden __udivdi3
f0100290 g     F .text	0000000a iscons
f01015da g     F .text	000000e4 strtol
f0101318 g     F .text	00000021 strnlen
f0101359 g     F .text	0000002c strcat
f0112944 g     O .bss	00000004 panicstr
f0112940 g       .bss	00000000 end
f0100040 g     F .text	00000045 _warn
f0101464 g     F .text	0000001d strfind
f0101925 g       .text	00000000 etext
0010000c g       .text	00000000 _start
f01013b1 g     F .text	00000033 strlcpy
f010140a g     F .text	00000039 strncmp
f0101385 g     F .text	0000002c strncpy
f010157d g     F .text	00000040 memcmp
f010048a g     F .text	00000010 cputchar
f0101481 g     F .text	0000005f memset
f010027f g     F .text	00000011 getchar
f01011f7 g     F .text	00000028 printfmt
f01073e1 g       .stabstr	00000000 __STABSTR_END__
f01013e4 g     F .text	00000026 strcmp
f0100a61 g     F .text	000001d6 debuginfo_eip
f0100900 g     F .text	00000033 vcprintf
f0110000 g       .data	00000000 bootstacktop
f0110000 g     O .data	00001000 entry_pgdir
f0108000 g       .data	00000000 bootstack
f0102044 g       .stab	00000000 __STAB_BEGIN__
f0101300 g     F .text	00000018 strlen
f0101443 g     F .text	00000021 strchr
f01006aa g     F .text	000000ca mon_kerninfo
f01007bd g     F .text	00000143 monitor
f01015bd g     F .text	0000001d memfind
f0100774 g     F .text	00000049 mon_help

Exercise 5. Trace through the first few instructions of the boot loader again and identify the first instruction that would "break" or otherwise do the wrong thing if you were to get the boot loader's link address wrong. Then change the link address in boot/Makefrag to something wrong, run make clean, recompile the lab with make, and trace into the boot loader again to see what happens. Don't forget to change the link address back and make clean again afterward!

在一个terminal中cd到lab目录下，执行 make qemu-gdb。再开一个 terminal执行make gdb。因为BIOS会把boot loader加载到0x7c00的位置，因此设置断点b *0x7c00。再执行c,会看到QUMU终端上显示Booting from hard disk。执行x/30i 0x7c00就能看到与boot.S中类似的汇编代码了。

BIOS会将引导扇区的内容加载到 0x7c00 的位置，引导程序也就从0x7C00的位置开始执行。我们通过-Ttext 0x7C00将链接地址传递给boot / Makefrag中的链接器，因此链接器将在生成的代码中生成正确的内存地址。除了部分信息之外，ELF头中还有一个对我们很重要的字段，名为e_entry。该字段保存程序中入口点的链接地址：程序应该开始执行的代码段的存储地址。在反汇编代码中，可以看到最后call 了 0x10018地址。

boot loader程序，最后会调用entry point

1
2
3

// call the entry point from the ELF header
// note: does not return!
((void (*)(void)) (ELFHDR->e_entry))();

通过boot.asm文件，可以得知，我们的entry地址是

1 2	((void ()(void)) (ELFHDR->e_entry))(); 7d71: ff 15 18 00 01 00 call 0x10018

与实际执行objdump -f kernel的结果一致。

./obj/kern/kernel:     file format elf32-i386
architecture: i386, flags 0x00000112:
EXEC_P, HAS_SYMS, D_PAGED
start address 0x0010000c

Exercise 6. We can examine memory using GDB's x command. The GDB manual has full details, but for now, it is enough to know that the command x/Nx ADDR prints N words of memory at ADDR. (Note that both 'x's in the command are lowercase.) Warning: The size of a word is not a universal standard. In GNU assembly, a word is two bytes (the 'w' in xorw, which stands for word, means 2 bytes).

答案应该很明显，在BIOS进入Boot loader时，0x100000内存后的8个字都为零，因为此时内核程序还没有加载进入内存。内核的加载在bootmain函数中完成。若需要用gdb调试，可以使用x/8x 0x100000 查看其内存内容。

Part 3: The Kernel

Using virtual memory to work around position dependence

操作系统内核通常喜欢在非常高的虚拟地址（例如0xf0100000）上链接和运行，以便将处理器虚拟地址空间的较低部分留给用户程序使用。这种安排的原因将在下一个实验中变得更加清楚。许多机器在地址 0xf0100000 处没有任何物理内存，因此我们不能指望能够在那里存储内核。相反，我们将使用处理器的内存管理硬件将虚拟地址 0xf0100000（内核代码预期运行的链接地址）映射到物理地址 0x00100000（引导加载程序将内核加载到物理内存中）。这样，虽然内核的虚拟地址足够高，可以为用户进程留下足够的地址空间，但它将被加载到物理内存中，位于 PC RAM 中的 1MB 点处，就在 BIOS ROM 上方。这种方法要求 PC 至少有几兆字节的物理内存（以便物理地址 0x00100000 有效），但这对于 1990 年左右制造的任何 PC 来说可能都是如此。

Exercise 7. Use QEMU and GDB to trace into the JOS kernel and stop at the movl %eax, %cr0. Examine memory at 0x00100000 and at 0xf0100000. Now, single step over that instruction using the stepi GDB command. Again, examine memory at 0x00100000 and at 0xf0100000. Make sure you understand what just happened. What is the first instruction after the new mapping is established that would fail to work properly if the mapping weren't in place? Comment out the movl %eax, %cr0 in kern/entry.S, trace into it, and see if you were right.

在执行movl％eax，％cr0之前

(gdb) x/8x 0x100000
0x100000:       0x1badb002      0x00000000      0xe4524ffe      0x7205c766
0x100010:       0x34000004      0x1000b812      0x220f0011      0xc0200fd8
(gdb) x/8x 0xf0100000
0xf0100000 <_start-268435468>:  0x00000000      0x00000000      0x00000000      0x00000000
0xf0100010 <entry+4>:   0x00000000      0x00000000      0x00000000      0x00000000

之后

(gdb) x/8x 0x100000
0x100000:       0x1badb002      0x00000000      0xe4524ffe      0x7205c766
0x100010:       0x34000004      0x1000b812      0x220f0011      0xc0200fd8
(gdb)  x/8x 0xf0100000
0xf0100000 <_start-268435468>:  0x1badb002      0x00000000      0xe4524ffe      0x7205c766
0xf0100010 <entry+4>:   0x34000004      0x1000b812      0x220f0011      0xc0200fd8

虚拟地址0xf0100000已经被映射到0x00100000处在修改cr0之前修改了cr3寄存器。将地址0x118000写入了页目录寄存器，页目录表应该就是存放在地址0x118000处。其他操作应该是由entry_pgdir的

pde_t entry_pgdir[NPDENTRIES] = {
	// Map VA's [0, 4MB) to PA's [0, 4MB)
	[0]
		= ((uintptr_t)entry_pgtable - KERNBASE) + PTE_P,
	// Map VA's [KERNBASE, KERNBASE+4MB) to PA's [0, 4MB)
	[KERNBASE>>PDXSHIFT]
		= ((uintptr_t)entry_pgtable - KERNBASE) + PTE_P + PTE_W
};

完成了映射。使得再读取0xf0100000地址时，自动映射到了0~4M的某个位置

CR3是页目录基址寄存器，保存页目录表的物理地址，页目录表总是放在以4K字节为单位的存储器边界上，因此，它的地址的低12位总为0，不起作用，即使写上内容，也不会被理会。

注释掉kern/entry.S中的movl %eax, %cr0因为没有开启分页虚拟存储机制，当访问高位地址时，会出现RAM or ROM 越界错误。

0x0010002a in ?? ()
(gdb) si
=> 0xf010002c <relocated>:      add    %al,(%eax)
relocated () at kern/entry.S:74
74              movl    $0x0,%ebp                       # nuke frame pointer
(gdb) si
Remote connection closed

在执行0xf010002c之后就出错了

Formatted Printing to the Console

Exercise 8. We have omitted a small fragment of code - the code necessary to print octal numbers using patterns of the form "%o". Find and fill in this code fragment.

就是把%u的代码复制一遍，base 改为 8 就差不多了，并不复杂。

// (unsigned) octal
case 'o':
	// Replace this with your code.
          num = getuint(&ap, lflag);
          base = 8;
          goto number;

Explain the interface between printf.c and console.c. Specifically, what function does console.c export? How is this function used by printf.c?

printf.c中使用了console.c 中的cputchar函数，并封装为putch函数。并以函数形参传递到printfmt.c中的vprintfmt函数，用于向屏幕上输出一个字符。

Explain the following from console.c:

// What is the purpose of this?
if (crt_pos >= CRT_SIZE) {
	int i;
  
	memmove(crt_buf, crt_buf + CRT_COLS, (CRT_SIZE - CRT_COLS) * sizeof(uint16_t));
	for (i = CRT_SIZE - CRT_COLS; i < CRT_SIZE; i++)
		crt_buf[i] = 0x0700 | ' ';
	crt_pos -= CRT_COLS;
}

CRT_ROWS，CRT_COLS：CRT显示器行列最大值，此处是25x80
ctr_buf 在初始化时指向了显示器I/O地址
memmove 没有理清哪个是源，哪个是目的。按理解清除第一行的数据，应该第二个是源。即2~n行的数据（CRT_SIZE - CRT_COLS）个，移动到1~n-1行的位置。

For the following questions you might wish to consult the notes for Lecture 2. These notes cover GCC's calling convention on the x86.

在kern/init.c的i386_init()下加入代码，就可以直接测试；加Lab1_exercise8_3标号的目的是为了在kern/kernel.asm反汇编代码中容易找到添加的代码的位置。可以看到地址在0xf0100080处

// lab1 Exercise_8
{
    cprintf("Lab1_Exercise_8:\n");
    int x = 1, y = 3, z = 4;
    // 
    Lab1_exercise8_3:
    cprintf("x %d, y %x, z %d\n", x, y, z);

    unsigned int i = 0x00646c72;
    cprintf("H%x Wo%s", 57616, &i);
}

cprintf (fmt=0xf010478d "x %d, y %x, z %d\n") 
可以看到以上地址处就存了字符串
(gdb) x/s 0xf010478d
0xf010478d:    "x %d, y %x, z %d\n"

(gdb) si
=> 0xf0102f85 <vcprintf>:    push   %ebp
vcprintf (fmt=0xf010478d "x %d, y %x, z %d\n", ap=0xf0118fc4 "\001")
    at kern/printf.c:18


(gdb) x/16b 0xf0118fc4
0xf0118fc4:    0x01    0x00    0x00    0x00    0x03    0x00    0x00    0x00
0xf0118fcc:    0x04    0x00    0x00    0x00    0x7b    0x47    0x10    0xf0

=> 0xf0100a41 <vcprintf>:       push   %ebp
vcprintf (fmt=0xf0101a97 "6828 decimal is %o octal!\n", ap=0xf010efd4 "\254\032") at kern/printf.c:18
18      {

(gdb) x/s 0xf0101a97
0xf0101a97:     "6828 decimal is %o octal!\n"

The Stack

Exercise 9. Determine where the kernel initializes its stack, and exactly where in memory its stack is located. How does the kernel reserve space for its stack? And at which "end" of this reserved area is the stack pointer initialized to point to?

entry.S 77行初始化栈
栈的位置是0xf0108000-0xf0110000
设置栈的方法是在kernel的数据段预留32KB空间(entry.S 92行)
栈顶的初始化位置是0xf0110000

Exercise 10. To become familiar with the C calling conventions on the x86, find the address of the test_backtrace function in obj/kern/kernel.asm, set a breakpoint there, and examine what happens each time it gets called after the kernel starts. How many 32-bit words does each recursive nesting level of test_backtrace push on the stack, and what are those words?

void
test_backtrace(int x)
{
f0100040:	55                   	push   %ebp
f0100041:	89 e5                	mov    %esp,%ebp
f0100043:	56                   	push   %esi
f0100044:	53                   	push   %ebx
f0100045:	e8 91 01 00 00       	call   f01001db <__x86.get_pc_thunk.bx>
f010004a:	81 c3 be 02 01 00    	add    $0x102be,%ebx
f0100050:	8b 75 08             	mov    0x8(%ebp),%esi
	cprintf("entering test_backtrace %d\n", x);
f0100053:	83 ec 08             	sub    $0x8,%esp
f0100056:	56                   	push   %esi
f0100057:	8d 83 38 17 ff ff    	lea    -0xe8c8(%ebx),%eax
f010005d:	50                   	push   %eax
f010005e:	e8 f5 09 00 00       	call   f0100a58 <cprintf>
	if (x > 0)
f0100063:	83 c4 10             	add    $0x10,%esp
f0100066:	85 f6                	test   %esi,%esi
f0100068:	7e 29                	jle    f0100093 <test_backtrace+0x53>
		test_backtrace(x-1);
f010006a:	83 ec 0c             	sub    $0xc,%esp
f010006d:	8d 46 ff             	lea    -0x1(%esi),%eax
f0100070:	50                   	push   %eax
f0100071:	e8 ca ff ff ff       	call   f0100040 <test_backtrace>
f0100076:	83 c4 10             	add    $0x10,%esp
	else
		mon_backtrace(0, 0, 0);
	cprintf("leaving test_backtrace %d\n", x);
f0100079:	83 ec 08             	sub    $0x8,%esp
f010007c:	56                   	push   %esi
f010007d:	8d 83 54 17 ff ff    	lea    -0xe8ac(%ebx),%eax
f0100083:	50                   	push   %eax
f0100084:	e8 cf 09 00 00       	call   f0100a58 <cprintf>
}
f0100089:	83 c4 10             	add    $0x10,%esp
f010008c:	8d 65 f8             	lea    -0x8(%ebp),%esp
f010008f:	5b                   	pop    %ebx
f0100090:	5e                   	pop    %esi
f0100091:	5d                   	pop    %ebp
f0100092:	c3                   	ret    
		mon_backtrace(0, 0, 0);
f0100093:	83 ec 04             	sub    $0x4,%esp
f0100096:	6a 00                	push   $0x0
f0100098:	6a 00                	push   $0x0
f010009a:	6a 00                	push   $0x0
f010009c:	e8 f5 07 00 00       	call   f0100896 <mon_backtrace>
f01000a1:	83 c4 10             	add    $0x10,%esp
f01000a4:	eb d3                	jmp    f0100079 <test_backtrace+0x39>

上面是asm中完整的test_backtrace函数定义。

Exercise 11. Implement the backtrace function as specified above. Use the same format as in the example, since otherwise the grading script will be confused. When you think you have it working right, run make grade to see if its output conforms to what our grading script expects, and fix it if it doesn't. After you have handed in your Lab 1 code, you are welcome to change the output format of the backtrace function any way you like.

Exercise 12. Modify your stack backtrace function to display, for each eip, the function name, source file name, and line number corresponding to that eip. In debuginfo_eip, where do __STAB_* come from? This question has a long answer; to help you to discover the answer, here are some things you might want to do:

look in the file kern/kernel.ld for __STAB_*

run objdump -h obj/kern/kernel

run objdump -G obj/kern/kernel

run gcc -pipe -nostdinc -O2 -fno-builtin -I. -MD -Wall -Wno-format -DJOS_KERNEL -gstabs -c -S kern/init.c, and look at init.s.

see if the bootloader loads the symbol table in memory as part of loading the kernel binary

Complete the implementation of debuginfo_eip by inserting the call to stab_binsearch to find the line number for an address. Add a backtrace command to the kernel monitor, and extend your implementation of mon_backtrace to call debuginfo_eip and print a line for each stack frame of the form:
1
2
3
4
5
6
7
8
9
K> backtrace
Stack backtrace:
  ebp f010ff78  eip f01008ae  args 00000001 f010ff8c 00000000 f0110580 00000000
         kern/monitor.c:143: monitor+106
  ebp f010ffd8  eip f0100193  args 00000000 00001aac 00000660 00000000 00000000
         kern/init.c:49: i386_init+59
  ebp f010fff8  eip f010003d  args 00000000 00000000 0000ffff 10cf9a00 0000ffff
         kern/entry.S:70: <unknown>+0
K> 
Each line gives the file name and line within that file of the stack frame's eip, followed by the name of the function and the offset of the eip from the first instruction of the function (e.g., monitor+106 means the return eip is 106 bytes past the beginning of monitor). Be sure to print the file and function names on a separate line, to avoid confusing the grading script. Tip: printf format strings provide an easy, albeit obscure, way to print non-null-terminated strings like those in STABS tables. printf("%.*s", length, string) prints at most length characters of string. Take a look at the printf man page to find out why this works. You may find that some functions are missing from the backtrace. For example, you will probably see a call to monitor() but not to runcmd(). This is because the compiler in-lines some function calls. Other optimizations may cause you to see unexpected line numbers. If you get rid of the -O2 from GNUMakefile, the backtraces may make more sense (but your kernel will run more slowly).

需要实现monitor.c中的一个函数

int
mon_backtrace(int argc, char **argv, struct Trapframe *tf)
{
	// Your code here.
	return 0;
}

int
mon_backtrace(int argc, char **argv, struct Trapframe *tf)
{
    uint32_t *ebp;
     ebp = (uint32_t *)read_ebp();
    cprintf("Stack backtrace:\n");
    while(ebp!=0){
        cprintf("  ebp %08x  eip %08x  args %08x %08x %08x %08x %08x\n",ebp,eip, *(ebp+2), *(ebp+3), *(ebp+4), *(ebp+5), *(ebp+6));
        ebp  = (uint32_t*) *ebp;
    }
    return 0;
}

mon_backtrace函数中调用的read_ebp()函数声明在 inc/x86.h中，函数实现

static __inline uint32_t
read_ebp(void)
{
    uint32_t ebp;
    __asm __volatile("movl %%ebp,%0" : "=r" (ebp));
    return ebp;
}

这里就已经可以输出

1	ebp f010ff08 eip f01000a1 args 00000000 00000000 00000000 f010004a f0111308

但是，还需要获取eip对应的文件名，行号，函数名等信息。

在阅读实验指导书之后，发现代码提供了

1	int debuginfo_eip(uintptr_t eip, struct Eipdebuginfo *info);

用于eip信息的获取，直接调用并输出结构体中的信息就可以了

int
mon_backtrace(int argc, char **argv, struct Trapframe *tf)
{
    uint32_t *ebp;
    struct Eipdebuginfo info;
	ebp = (uint32_t *)read_ebp();
    cprintf("Stack backtrace:\n");
    while(ebp!=0){
		uint32_t eip = *(ebp+1);
		debuginfo_eip(eip,&info);
        cprintf("  ebp %08x  eip %08x  args %08x %08x %08x %08x %08x\n",ebp,eip, *(ebp+2), *(ebp+3), *(ebp+4), *(ebp+5), *(ebp+6));
		cprintf("%s:%d: %.*s+%d\n", info.eip_file, info.eip_line, info.eip_fn_namelen, info.eip_fn_name, ebp[1] - info.eip_fn_addr);
        ebp  = (uint32_t*) *ebp;
    }
    return 0;
}

实现过程中发现，行号的获取始终是0，查阅代码的时候发现行号的获取需要自己实现。

// Search within [lline, rline] for the line number stab.
// If found, set info->eip_line to the right line number.
// If not found, return -1.
//
// Hint:
//	There's a particular stabs type used for line numbers.
//	Look at the STABS documentation and <inc/stab.h> to find
//	which one. N_SLINE
// Your code here.

stab_binsearch(stabs, &lline, &rline, N_SLINE, addr);
   if (lline > rline)
       return -1;
   info->eip_line = stabs[lline].n_desc;

原来输出是这样的

1 2	ebp f010ffa8 eip f0100076 args 00000004 00000005 00000000 f010004a f0111308 kern/init.c:0: test_backtrace:F(0,1)=(0,1)+54

变成

1 2	ebp f010ffa8 eip f0100076 args 00000004 00000005 00000000 f010004a f0111308 kern/init.c:16 test_backtrace+54

make grade 成功通过测试

tommygong@TommyGong:~/MIT-6.828/lab$ make grade
make clean
make[1]: Entering directory '/home/tommygong/MIT-6.828/lab'
rm -rf obj .gdbinit jos.in qemu.log
make[1]: Leaving directory '/home/tommygong/MIT-6.828/lab'
./grade-lab1 
make[1]: Entering directory '/home/tommygong/MIT-6.828/lab'
+ as kern/entry.S
+ cc kern/entrypgdir.c
+ cc kern/init.c
+ cc kern/console.c
+ cc kern/monitor.c
+ cc kern/printf.c
+ cc kern/kdebug.c
+ cc lib/printfmt.c
+ cc lib/readline.c
+ cc lib/string.c
+ ld obj/kern/kernel
ld: warning: section `.bss' type changed to PROGBITS
+ as boot/boot.S
+ cc -Os boot/main.c
+ ld boot/boot
boot block is 396 bytes (max 510)
+ mk obj/kern/kernel.img
make[1]: Leaving directory '/home/tommygong/MIT-6.828/lab'
running JOS: (1.2s) 
  printf: OK 
  backtrace count: OK 
  backtrace arguments: OK 
  backtrace symbols: OK 
  backtrace lines: OK 
Score: 50/50

最后还要添加一下指令支持，修改一下static struct Command commands[]即可

static struct Command commands[] = {
	{ "help", "Display this list of commands", mon_help },
	{ "kerninfo", "Display information about the kernel", mon_kerninfo },
	{ "backtrace", "Display the call stack", mon_backtrace },
};

Part 5: Note

终于是写完了，断断续续持续了一个学期吧，期间有别的实验需要写。也就在期末才有空余的时间来重新看一下这个实验。老实说，这个实验上手难度还是有一点的，哪怕我学完了操作系统，计算机组成原理，体系结构等课程，回来看这个实验的前大半部分还是比较难以理解。所幸英语水平在不断提高，学的东西也在不断变多。编写代码的部分不是很多，主要是对整个过程有一个清晰的认识，才是这个lab1所困难的地方。