5.1 KiB
Decompiling
This document describes how you can start decompiling code and contribute to the project. Feel free to ask for help if you get stuck or need assistance.
- Pick a source file
- Decompiling a source file
- Decompiling a function
- Decompiling
.initfunctions - The Ghidra project
Pick a source file
See the decomp tag in the issue tracker for
a list of delinked source files that are ready to be decompiled. This list grows as more source files are delinked from the
rest of the Assembly code.
You can claim a source file by leaving a comment on its issue, so that GitHub allows us to assign you to it. This indicates that you are currently decompiling that source file.
If you want to unclaim the file, leave another comment so we can be certain that the source file is available to be claimed again. Remember to make a pull request of any notable progress you made on the source file, which can include non-matching functions.
Decompiling a source file
It can be tricky to fully decompile an assembly file into a C/C++ source file, so here's some advice to make it easier:
- C/C++ code is built before assembly code
- This means you can take one function from the top of your assembly file, decompile it, and append it to the bottom of your C/C++ file.
- Build the ROM often
- We recommend building every time you decompile a function.
- This is because functions can sometimes match in decomp.me, but not when building.
- If your ROM doesn't match, it's easier to know which function is wrong if it's the only function added since the last successful build.
Decompiling a function
Say you've found a function you want to decompile. Here are the steps we recommend for decompiling it:
- Visit decomp.me and start decomping.
- Under the platforms, select "Nintendo DS".
- Select compiler preset "Phantom Hourglass".
- Copy and paste the target assembly for your function, including the
func_startandfunc_endmacros, and the pool constants. For example:
.global func_ov09_0211bf48
thumb_func_start func_ov09_0211bf48
func_ov09_0211bf48: ; 0x0211bf48
ldr r0, _0211bf50 ; =data_ov09_0211f59c
ldrb r0, [r0]
bx lr
nop
thumb_func_end func_ov09_0211bf48
_0211bf50: .word data_ov09_0211f59c
- Run
m2ctx.py include/MyHeader.hpp -cto generate a context and put it in your clipboard.- If no suitable header file exists, make a new one and put any structs and types you need in there.
- Paste the context into decomp.me, and create the scratch.
- Decompile the function and try to get a 100% match.
- There's no ARM decompiler in decomp.me yet, but Ghidra does the job quite well. See the Ghidra section for more info.
- If you're unable to get a 100% match, share your decomp.me scratch with other contributors and they may assist you.
- In the worst case, you can also contribute non-matching functions to this project.
Decompiling .init functions
Note
This section will be updated as we learn more about global objects. Feel free to contribute or provide us with more information!
Functions in the .init section are static initializers. Their purpose is to call C++ constructors on global objects, and to
register destructors so the global objects can be destroyed when their overlay unloads.
Static initializers are generated implicitly and do not require us to write any code ourselves. So, to generate one, you must define a global variable by using a constructor.
If the static initializer calls __register_global_object, that means the global object has a destructor. This means you'll
have to declare a destructor if it doesn't exist already.
Another consequence of having a destructor is that a DestructorChain object will be added to the .bss section. This struct
is 12 (0xc) bytes long and is also implicit, so we don't need to define it ourselves.
Important
An important thing to keep in mind is that a static initializer can construct multiple global objects.
Decompiling data
Note
Under construction! It's not fully clear how data is decompiled, as the compiler is strict on how it orders global variables. Feel free to contribute to this section or provide us with more information!
Other than .text and .init which contain code, there are the following sections for data:
.rodata: Global or static constants.data: Global or static variables.bss/.sbss: Global or static uninitialized variables
You can see examples of these data sections in the compilation section in build_system.md.
The Ghidra project
We use a shared Ghidra project to analyze the game and decompile functions. To gain access to the project, install Ghidra version 10.2.3 and request access from @aetias on Discord.