Self Hosted Compiler Meetings - hdorio/zig GitHub Wiki

The meetings happen weekly, every Thursday at 19.00 UTC, on this Discord server

Edit this wiki to get your agenda item for next week added.
When there are no items in the agenda for a given week, the meeting is skipped.

2022-12-29

@matu3ba: I'd like to present my work so far and ask a bunch of design questions regarding a portable interface for non-standard streams or if that is too clunky/not flexible enough. I'd also like to discuss how to handle posix spawn, which mandates declarative step definition, but child_process offers no api to it and how this could work together with the Windows approach of process spawning (imperative like fork). This insofar aligns with stage2, as it is a prequisite for a clean fix to std.log.debug and std.log.info do not show up in zig build test output as mentioned in https://zig.news/slimsag/my-hopes-and-dreams-for-zig-test-2pkh and I got so far no input/feedback on the design or general attitude of my PR(s) from core members. Relevant PR (both breaking and creating new zig1 fails) https://github.com/ziglang/zig/pull/12754 and https://github.com/matu3ba/zig/tree/childproc_stdin for the sketch of necessary changes to use stdin without pipe mode.

2022-11-10

@gwenzek
- I've adapted a large test suite for C ABI and figured out how to test it across different arch
- discuss how to best use the test suite
- get pointers on how to fix it

2022-10-20

@mlugg
- inconsistent and hard to repro bug with package imports
  - I have a vague theory as to what's going on but need a hand from experienced folk to figure out if I'm right and figure out a fix
  - only repros with one specific project; I've failed at finding any reduction. I suspect the inconsistency of the issue is the problem here
  - happens without clearing cache in the meantime
  - expected behavior: no errors, actual behavior: Sema error
  - further insights might be gained from in-depth debug logging
@andrewrk
- status on the 0.10.0 release (proceeding according to schedule)
  - 38 issues left open, some of these may be postponed
- outlining a plan for moving towards a global intern pool for types and values
  - Values are represented as u32 indices, this makes the compilation state easily serialized
  - this will affect many lines of code, therefore, Andrew has a transition plan
  - At the moment, the index into the indexed values is a u32, this may be changed to u64 in the future
  - Another idea: perform Liveness Analysis on ZIR so we can reuse slots in the intern pool
  - The intern pool would also need to be garbage collected
  - we do want to eventually multi-thread Sema, but the intern pool may present a challenge for this, worst-case scenario: locking

2022-09-15

@joachimschmidt557
- Layout of optional types
  - is it defined according to the language spec?
    - no
  - if not, should we strive toward same layout in LLVM and self-hosted?
    - yup, as the code in type.zig assumes the payload-first layout

2022-09-08

@Vexu
- referenced here note. Reference #10648 #10648 #12141 #7668
  - Andrew: referenced here notes in stage1 were very noisy, which led to people ignoring the error notes
  - maybe introduce a command-line flag to turn on these error notes?
  - But if they aren't there by default, might as well not include them
  - Maybe this version of referenced here notes: https://github.com/ziglang/zig/pull/11533
  - Vexu's idea: provide a limited error note trace for the first error together with instructions on how to get a full trace (using a command-line flag)
    - We'll go forward with this
- Reinterpreting invalid data at comptime? Reference #12468 #11734
  - SpexGuy had some opinions on this, but he was not present at the meeting
- https://github.com/ziglang/zig/pull/12764 result locations
  - Proposal: remove type awareness from AstGen, move to Sema
  - Andrew's take: hold off on AstGen because some language design changes regarding aliasing, async, and result locations may come in the future
@kubkon
- demo of the incremental COFF linker
- issue with libstd and File abstraction I bumped into while working on the COFF linker and getting all tests passing on Windows: #12783
  - Andrew's opinion: When using pwrites, we can expect that we only use pwrites, so it is reasonable to make the behavior that pwrites invalidate seek position
  - Luuk's opinion: People expect the same behavior across all platforms
  - Maybe also add some safety feature to File so that normal writes/other other operations will panic after pwrites were performed
@andrewrk
- LLVM 15 upgrade status report
  - Updated NativeTargetInfo.zig
  - CI will test llvm-15 branch, when everything works, it's time to merge

2022-08-25

@joachimschmidt557
- ~~CodeGen: merging the instruction mappings of both branches in a conditional branch~~ (postponed)

2022-08-18

@joachimschmidt557
- Yet another rewrite of the register allocation mechanism in Codegen
  - Justification
    - More generic than binOpRegister and binOpImmediate
    - Handles register classes
    - Promotes use of Air.Inst.Index instead of MCValue
    - Handles edge case of flipped RHS and LHS
  - Also: get rid of binOp
    - huge function, split each switch prong into a new fn
    - eliminate redundancies and footguns (e.g. checking whether an operand fits into an immediate)
@kristoff-it
- Quick update on the state of Autodoc
  - source listings are implemented
  - at the moment, source listings are basically only the source files with syntax highlighting and anchors for line numbers
  - Future work: clicking import paths leading to other files, clicking other stuff leading to their respective declaration
  - Loris thinks replacing multiple-file approach in the future might be possible
  - Proposition: go even further, ship Autodoc as a WASM module and generate documentation and syntax highlighting on-the-fly just-in-time
    - would maybe make examples in the language reference editable and runnable
    - There is an inherent tradeoff between bandwidth and compute power here – having the compiler as a WASM module means less data tranferred (only raw source without syntax highlighting and other metadata) but having more power draw on the client
    - We'll have to investigate further where we want to land eventually

2022-08-04

@vincenzopalazzo
- std-lib: JSON parsing panic when the string contains trash at the end, issue related
  - Suggestion: use https://ziglang.org/documentation/master/std/#root;io.Reader.readUntilDelimiterArrayList API of std.io.Reader
@matu3ba
- questions on AST traversal and rendering strategy for zig-reduce (see https://gist.github.com/matu3ba/a6a76dd8309b37a002cd09de79512e99)
  - render function of "normal" AST is written with zig fmt in mind
  - Suggestion: using translate-c's AST API which is a bit more flexible and more suitable for this use case
- strategy to reduce generated C code (see https://github.com/ziglang/zig/issues/11651, https://github.com/ziglang/zig/issues/11849)
  - An AIR optimization is necessary where we prevent emitting many debug line numbers (which may arise from comptime code etc.)

2022-07-28

@joachimschmidt557
- self-hosted codegen: revamp branch_stack mapping of AIR indices -> MCValue
  - problem that made Joachim aware of this issue is unrelated (runtime safety checks for divisions are emitted in Sema even when the RHS of the division is a constant)
  - example AIR snippet:

  %4 = constant(u32, 32)
   ...
  %6 = cmp_neq(%4, %5!)
   ...
  %15 = div_trunc(%0!, %4!)

In %6, %4 is moved into a register. This move is made "permanent", i.e. in the instruction mapping, the MCValue corresponding to %4 is no longer an immediate, but instead a register. This alone is actually a good thing: For more complex constants such as 0x1234abcd, moving them into a register is something we want to avoid as much as possible, so we communicate to later instructions that this value is "cached" in a register.

However, a problem arises in %15. Codegen doesn't know that %4 is actually a constant value and cannot generate a more efficient right shift. The solution is to add more tracking information.

Also related: "copy-on-write" behavior for register and stack allocations: For register_c_flag MCValues, we currently need to allocate a new register (which is kinda wasteful) when we want to access the register or the c flag individually (as part of struct_field_val). Proposed solution: introduce more sophisticated tracking mechanisms to RegisterManager (and future StackManager) which support multiple AIR instructions "sharing" an MCValue and performing "copy-on-write" if necessary.

Also also related: Why do the native codegen backends have this branch stack? It's because they don't have the luxury of infinite registers like LLVM and WASM. Therefore, they need to keep track of branches and consolidate the MCValues of different branches.

@mattnite
- some experiments on C/C++ build.zig packages
  - Goal: make transition from CMake to build.zig as smooth as possible
  - Proposed "design pattern" in build.zig:
    - <library> = <library_name>.create(b, target, mode, <build options> where build options can be e.g. for curl whether to enable FTP
    - <library>.addDependency(.private, <other library>
  - Preventing arbitrary code execution (possible options)
    - use zig comptime to configure a library
    - force libraries to only use the standard library build steps and use a custom "sandboxed" build runner
    - compile the configure code to WASM/BPF and execute that in a sandbox, make it print the dependencies and all other information on standard out
  - Integration with the system package manager
    - introduce a config that makes the build script use the system package instead of the vendored package
@nektro
- debugging help for #11867
  - Andrew will review the PR
- showcase of llvm15 upgrade progress
  - ThreadSanitizer runtime is manually cherry-picked from LLVM codebase, so this can be temporarily regressed in the LLVM 15 upgrade
  - systemz vs s390x naming: make it consistent, it seems like s390x is the preferred name
    - making this consistent can be done on master branch instead of llvm15 branch
  - llvm15 branch created, @nektro will open a PR
@superauguste / aurame
- questions/clarifications about plans for stage2 language server
  - Protocol used will be different from language server protocol
  - Protocol will be subscription-based instead of query-based
@matu3ba (screensharing did not work)

2022-06-09

@kubkon
- demo of some micro-optimisations immediately achievable in self-hosted backends
- incremental writing of symtab in ELF leads to inelegant warnings in gdb, lldb, objdump, etc.
  - Likely the cause by a call of allocateDeclIndexes followed by the usage of freeUnusedDecls. Since we've been wanting to get rid of allocateDeclIndexes for a while now, we will follow through with that change and it will likely fix this issue as well.
- ordering of static libs on linker line - should we link libcompiler_rt.a before libc.a, or after? Depending on the ordering, we will get different results. Related #11832.
  - Problems were occurring due to symbol aliases, the alias was exported as strong, rather than the linkage as specified. - Decided to remove all symbol aliases within compiler-rt, as well as look into fixing this issue.
  - Discussed the improvement of link-time by changing compiler-rt from a singular object file, into object files per file and store those as part of the archive file. This has a few benefits:
    - We can parallelize the compilation of those files; (This will likely offset the extra work that needs to be done to handle those files separately rather than in a singular compilation unit).
    - Symbol resolution is lazy for archive files, meaning we only link the object files within an archive if it contains a symbol that is required.
    - This means that rather than having to parse, resolve symbols and link the entire compiler-rt object file, we can simply link with the compiler-rt functions that are actually used;
    - Deduplication/garbage collection will also be made faster by this, as there is less to clean up, further increasing the linking time.

2022-06-02

@Luukdegram
- Require some help tackling https://github.com/ziglang/zig/pull/11747
  - compiler_rt is working for WASM, except for memcpy and memset
  - memcpy and memset should be moved from c.zig to compiler_rt
  - However, this currently causes issues, specifically when compiling for MIPS+musl
  - During LLVM Emit Object, OOM occurs
@andrewrk
- Discussing some additional linker API for Sema to query what kinds of relocations are supported by the target, for the purposes of comptime math performed on the result of @ptrToInt(&global_variable_or_function).
  - Test case: https://github.com/ziglang/zig/blob/e498fb155051f548071da1a13098b8793f527275/test/behavior/align.zig#L503
  - Top-level const x = @ptrToInt(&foo) + 10 could be lowered to a relocation of foo with addend 10
  - requires addend support in linker format (WASM doesn't support this)
  - ELF also supports negative addends
  - Support for this can be added incrementally; if the linker backend does not support these operations, emit a compile error
@kristoff
- Next steps to expose autodoc from Zig master.
  - new Autodoc is close to publishable
  - command-line flag for new Autodoc could be possible, but not necessary (old Autodoc has a bad reputation anyways)
  - new Autodoc has a larger JSON payload size (28 MB vs 13 MB on stdlib)
  - However, compressed (gzip) sizes are very similar (1-2 MB)
  - Source code in Autodoc: as strings inside JSON payload or HTML files?
  - Extra network requests for source code viewing is an acceptable tradeoff, so HTML files sounds like a good idea

2022-05-26

@Vexu
- @ptrToInt on pointers to comptime variables
  - https://github.com/ziglang/zig/blob/b08d32ceb5aac5b1ba73c84449c6afee630710bb/test/behavior/align.zig#L508
  - Previous design: Storing the result of @ptrToInt on a comptime var to a runtime value will cause the comptime var to lose it's mutability and be a global constant henceforth
  - Proposal: Make that a compile error instead
@kubkon - ~~If Jakub can make it~~(I made it!!!) we'll talk about adding linker test cases to the test-cases harness.
- initial proposal (rejected): Linker tests usually need multiple files, so for each test we include one folder with all the source files and a manifest file which describes the build process and the expected output/some other test (e.g. fuzzy grep).
- Consensus: No binary blobs in source tree
- zig cc is used by Go and Rust, we want regression tests for that, but we'll do that in a different repo
- Alternative to initial proposal: Use the standalone tests API
- New build step for linker tests, these tests will however reuse the standalone test API
- Also: we will start to enable LLVM in the CI tests
@kristoff-it
- ultra quick demo of autodoc progress
  - Autodoc is searching for more contributors
  - some things are better in new autodoc, however, many features from stage1 autodoc are still missing

2022-05-19

@andrewrk
- let's come up with a strategy for more efficient overflow arithmetic lowering for safety checks, and coordinate between different backend maintainers to make sure it is reasonable for the different backends.
  - The approach by stage2 results in slightly less optimal LLVM IR
  - The generated assembly is identical however
  - Turns out this is not a problem
@kubkon
- AVX PoC for native self-hosted x64 backend - #11681
  - floating point support with AVX/SSE works on Linux and macOS now
- handling of extended register sets in regalloc - general purpose + floating-point/SIMD
- demo of new register locking mechanism - vital for anyone wanting to contribute to native backends that require register use!
  - previous mechanism (freeze/unfreeze) was error-prone (a register could be locked a second time further down in the call stack and unlocked when returning to the caller function, which assumed it was locked)
  - New lock/unlock mechanism prevents this bug by only locking once
- #10318 - proposed resolution by changing macOS ABI from GNU to none in fix-10318
  - The default ABI for macOS will change to none
  - Compile error when using gnu abi for macOS (similar to the error which is shown when using musl ABI on macos)

2022-04-28

@kubkon
- demo of the new test harness for compile error tests and run-and-compare (incremental updates) tests
  - spec by @mitchellh in #11288
  - initial draft partial implementation by @topolarity in #11284, #11417 and #11421
  - some cleaning + more complete manifest parser + migration of run-and-compare tests in #11530

2022-04-21

@joachimschmidt557
- debug info for local variables: keep AIR instructions alive for debug info purposes
  - Joachim's "premise": no <optimized out> in debug builds
  - This would require us to preserve AIR instructions which have debug info attached to them
  - Temporary AIR instructions which do not have debug info attached do not have this restriction
  - We run into a predicament:
    - On the one hand, no <optimized out> in debug builds makes sense
    - On the other hand, extracting a temporary into a constant variable should not have negative effects on the performance
  - Also might be worth considering: Overwriting variables with undefined once they go out of scope
  - @kubkon wants to investigate capabilities of DWARF, how it might mitigate this issue
  - Possible solution:
    - Make the frontend handle this
    - Add a new AIR instruction (e.g. dbg_var_end) for when a variable x goes out of scope
    - This has the effect that the AIR instruction connected to x dies exactly at that dbg_var_end instruction
    - Investigate the implications of this in a separate branch
@andrewrk
- outline the plan for shipping the self-hosted compiler
  - 0.10.0 has two non-negotiable prerequisites:
    - ship self-hosted
    - LLVM 14
  - Next steps
    1. fix translate-c segfault
    2. runtime safety tests (will unlock working on 3 with stage3)
    3. compile error tests

2022-03-17

@joachimschmidt557
- remove MCValue.embedded_in_code
  - consensus: yes, do it
  - switch jump tables will be lowered using a different mechanism than MCValue.embedded_in_code
- register allocation for floating point registers
  - Multiple RegisterManagers or one unified?
  - We also need to keep vector registers in mind
  - Try multiple RegisterManagers approach for now
@kubkon
- demo of hot-code reloading on macOS
  - AArch64 backend still needs some work for stdlib nanosleep
  - ~~Requires sudo for now~~ Managed to bake in entitlements as explained in the follow-up blog post: https://www.jakubkonka.com/2022/03/22/hcs-zig-part-two.html
    - needs to open Mach IPC port
    - LLDB compiled from source can do this without sudo using entitlements
    - we will do this too
  - Requires manual pkill of child process for now
  - Also works with ASLR enabled (we kind of act like a dynamic linker)
  - Blog post on Jakub's homepage: http://www.jakubkonka.com/2022/03/16/hcs-zig.html
@Snektron
- Showcase some spir-v progress
  - stage2 can compile a SPIR-V shader which can then be executed on the GPU by a Zig program which was also compiled with stage2
  - improvements to SPIR-V inline assembly will be taken into account when overhauling inline assembly
@andrewrk
- quick progress update and asking people with open PRs to please rebase because of the recent CI failure

2022-03-10

@andrewrk
- let's figure out how we want Sema to emit debug information for local variables that works well for the LLVM backend as well as the native backends and wasm backend.
  - Mapping of variable names to locations (stack, register)
  - Add 2 more AIR instructions: "debug pointer" and "debug value" (corresponding to the LLVM debug instructions)
  - We should also emit spills (register -> stack offset and vice versa) in DWARF
  - There was still uncertainty whether 2 instructions were necessary or 1 is enough
  - Andrew will implement this for LLVM (and possibly reduce it to 1 extra instruction)
@kristoff-it
- Showcase current state of Autodoc, briefly explain implementation layout & main techniques adopted, invite others to contribute. Video trailer.
  - Slides
  - Autodoc can render std.ascii already
  - Correct terminology: Autodoc = system that generates docs, Autodocs = the generated docs
  - High-Level Design
    - Single-Page Application, search embedded in page
    - Optimise for memory usage
  - Additional features (not present in stage1 autodoc)
    - toggle for internal mode (shows private decls)
    - Sources section (syntax-highlighted Zig code)
    - Guides section (hand-written docs)
  - Autodoc uses ZIR
    - for complex expressions which cannot be trivially evaluated, Autodoc uses ComptimeExpr
    - Possibility: Wait for Sema to complete and use more info from there; map Sema information to ComptimeExprs
    - ArrayList: When both ArrayList(u8) and ArrayList([]const u8) are used, show these example instatiations
  - Vision for Guides
    - searchable
    - maybe generated from folder with Markdown files?
    - figure out vision later

2022-03-03

@joachimschmidt557
- runtime safety checks in Sema without generating too much extra machine code
  - related issues: https://github.com/ziglang/zig/issues/10248 https://github.com/ziglang/zig/issues/84
  - way forward for arithmetic operations: Sema will generate an add_with_overflow instruction for add operations when runtime safety is on, else will emit add (for subtractions, multiplications similar)
  - as @addWithOverflow and co. return a tuple, this enables the codegen backends to specialise on this: For e.g. ARM, @addWithOverflow can return an MCValue consisting of a register and the condition flags (including overflow flag). The overflow check can then be lowered to check the condition flags.
  - We will introduce optimizations to lowering of cond_br in codegen. If one branch ends with unreachable, we can remove the need for one branch instruction as control flow will never leave that branch again. Checking whether a branch ends in unreachable is trivial: we just check the last instruction in the branch.
  - Why Sema generates the panic: This enables us to skip emitting the panic handler if it is never called.
  - Related: For try, Sema can generate conditional branches with some sort of hint that the branch for the error case can jump to the end of the function (or similar) and have the non-error path (which should be the hot path) in the machine code as "uninterrupted" as possible.
Type Coercion in AstGen vs in Sema

const a = true;
const b = if (a) 1 else 2;

At the moment, this generates a coercion to bool in AstGen which should be moved to Sema.

https://github.com/ziglang/zig/issues/11046
- one of the last things preventing std.debug.dumpStackTrace
- @andrewrk wants to tackle this

2022-02-10

@joachimschmidt557
- codegen: third rewrite of lowering mechanics for binary operations: showcase and feedback
@kubkon
- codegen+linker: lowering of slices: showcase
- codegen: truncation, signed, non-power-of-two: showcase and feedback
- codegen: PIE mechanics: showcase and feedback
- linker: refactoring of LinkBlock abstraction
@topolarity
- sema: Does "Value.read/writeToMemory" need to respect the ABI size of exotic (non-power-of-two) types? Is memory required to be sign-extended to the full buffer size? (Context: Trying to fix an observed bug in @bitCast for exotic integers, where the sign bit is ignored)
@andrewrk
- Pitch my SegmentedList idea
@agni
- :^)

2022-01-27

@g_w1
- For https://github.com/ziglang/zig/issues/7923 , what should the parser type of the ident be? We don't want to allow test u32 { or test @as(...) for example, but do we want test @This() {? What should the parser do vs astgen vs sema?

2022-01-20

@andrewrk
- Should we move the concept of byval/byref from the backend to the frontend? In other words, the frontend would ask the backend if a given type was byval or byref, and avoid using the byval instructions, such as elem_val, for types that are byref. This would make Sema emit allocations rather than codegen backends having to find out (possibly too late) that they needed more allocations for byval stuff.
@Luukdegram
- Moving and enable/disable behavior tests.

2022-01-13

@andrewrk
- Getting help from @Luukdegram on the wasm backend to troubleshoot #10582
@joachimschmidt557
- Register allocation stuff again (yay)
  - API change
  - scratch register for Emit pass
@kubkon
- proposal: storing pointers in linear memory: dealing with moved atoms/decls in the linker via relocation mechanism rather than in codegen

2021-12-23

@gwenzek
- Heterogeneous programming support #10397
@Luukdegram
- Address spaces in Wasm. Solving #4866

2021-12-15

@mattnite
- BTF: getting useful BTF info when compiling for BPF, see if there are other ideas for llvm intrinsics.

2021-11-25

@joachimschmidt557
- Register allocation: maybe just use the stack for now and focus on getting more tests passing and then design a generic and efficient register allocator (credits for that idea: Loris and Meghan)
@Luukdegram
- ABI implementations: Ensuring correctness between backends (llvm vs ours).
- (Info for the curious reader: https://github.com/WebAssembly/tool-conventions/blob/main/BasicCABI.md)

2021-11-18

@kubkon
- callee preserved registers on x86_64 - do we reserve rax for params and return values only, or is a block free to use after pushing it on the stack?
- related issue - https://github.com/ziglang/zig/pull/10140
- for register allocator - do we track register liveness?
@jmc
- signal boost for the new wiki pages Contributing to Stage 2 and C backend signup sheet

2021-11-11

Meeting will be skipped due to Handmade Seattle.

2021-11-04

@Luukdegram
- Proposal and reasoning for emitting MIR in the wasm backend.
  - Supporting document: https://docs.google.com/document/d/1JEUe5ivuMHmIY8ehkVHihWIq9bFsvsB0IuVbrLp4Mak/edit?usp=sharing
@kubkon
- progress on MIR transition of x86_64 backend
@joachimschmidt557
- MIR branch lowering algorithm for AArch64
@andrewrk
- showing off ziglang.org/perf and looking for contributions to add benchmarks to gotta-go-fast

2021-10-28

@joachimschmidt557
- Progress on MIR transition of AArch64 backend
- MIR branch lowering
@The-King-of-Toasters
- Discussion on how to work in the event loop code.
  - End goal: add support for Solaris event ports.
  - Easy win: merge the openbsd for initOsData with the rest of the KQueue OSs.
  - Possibly shadow @kprotty to get my head around the codebase.
- Adding seccomp definitions to os.linux
  - Requires filling in definitions for "classic" BPF that many OSs use, split out into own namespace?
  - Seccomp programs load members from the seccomp_data structure (see <linux/seccomp.h>) using offsets. A common idiom for seccomp programs in C is offsetof(seccomp_data, arg[n]), where arg is an array of u64s. I have a workaround for this, but it would be nice to have this added for parity with C (would be interested in doing this).
@andrewrk
- progress demo & roadmap

2021-10-21

@joachimschmidt557
- Register allocation in a separate pass: discussion
@paezao
- Mentoring for https://github.com/ziglang/zig/issues/9185
@kubkon
- Demo and review of MachO's linker state dumping mechanism: https://github.com/ziglang/zig/pull/9985
@moosichu
- Request for clarification on zig ar implementation: https://github.com/moosichu/zar, https://github.com/ziglang/zig/issues/9828 Steady progress is being made (big thanks to @iddev5), but as features get fleshed out it would be good to have an idea of a final end-goal.
  - What does it need to be considered ready for upstream integration, beyond coverage of the llvm-ar features?
  - Some behavioural differences seem to be desired from llvm-ar (in a cross-compilation setting), so it would be good to know where it struggles so we can address that.
  - Because the goal is to be a drop-in replacement for an existing program. Is that the only goal, or are there others? Should that be a compatibility mode or how the program behaves by default?
  - I also have some ideas for testing, and I want to make sure they make sense (depends on if we can use llvm-ar through zig as well).
  - Also happy to answer any questions/concerns anyone might have.

2021-10-14

@kubkon
- progress on incremental Mach-O linker
- progress on zld ELF/x86_64
@andrewrk
- brainstorming on MIR
@andyfleming
- help on appropriate way to run test subset (compile_errors.zig)
  - Issue: https://github.com/ziglang/zig/issues/9203
  - WIP: https://github.com/ziglang/zig/compare/master...andyfleming:fix-9203?expand=1
@nektro
- Q on building with llvm13
- Q on https://github.com/ziglang/zig/commit/b0f80ef0d5517b5ce8ba60dfae6dd986346f8d4c

2021-09-23

@joachimschmidt557
- Thoughts on https://github.com/ziglang/zig/issues/9798

2021-09-16

@kubkon
- demo of the MachO linker after merging traditional with incremental codepaths - https://github.com/ziglang/zig/pull/9676
- what's next for MachO?

2021-09-02

@g-w1
- question about ++ and ** at runtime
- maybe plan9 debuginfo demo
@kubkon
- update from the linker depths - https://github.com/ziglang/zig/pull/9676
- discussion of current limitations for self-hosted for PIE targets
- demo of self-hosted linking Zig with compiled C object file
@nektro
- I submitted my first stage2 PR :)
- Then I submitted 3 more
- Not super demo-able, just excited. I can overview the bugs they fixed though

2021-08-26

@andrewrk
- short "how to start contributing" talk
- talk about usingnamespace
- stage2 progress report
- show some stage2 debugging tips
- here's a contributor friendly item: https://github.com/ziglang/zig/issues/9622
- brief explanation about Value/Type tag() vs zigTypeTag()

2021-07-29

@andrewrk
- Pitch to see if anyone wants to try deleting the "BoundFn" type from the language and compiler (both stage1 and stage2). #9484
- Demo of recent progress on getting zig test to work for stage2.
- Discussion: should we kill the "ref" AIR instruction?
@joachmschmidt557
- compare flags issues in self-hosted codegen

2021-07-15

@andrewrk
- Pitch to see if anyone wants to take on zig objcopy #9261
@jmc
- https://github.com/ziglang/zig/issues/9257 was closed, but not sure it's "fixed", see discussion there
- Is https://github.com/ziglang/zig/pull/9183 mergeable?

2021-07-08

@andrewrk
- Demo some recent progress
- Pitch to see if anyone wants to take on a stage1 memory task to help us accelerate stage2 work this release cycle
- Pitch to see if anyone wants to take on a project to track compilation speed over time
- Picking back up work on AIR memory layout, specifically with a design goal in mind of improving simplicity of codegen by putting all the constants up front so they can be possibly lowered to memory without jumps
- Code review & merging of any stage2 open PRs during the meeting
@greenfork
- Questions about linker, relevant PR
@kubkon
- Demo some recent progress on linker rewrite - merging Zld with incremental MachO!

2021-06-24

@g-w1
- Demo plan9 hello world!
@andrewrk
- Brief explanation of how ast-check is now running even when using the stage1 compiler.
@greenfork
- Ideas on tilde error printing, questions PR, why parent
@redj
- https://github.com/ziglang/zig/pull/9029 vs @kubkon's ObjC support.
@jmc
- Opinions on https://github.com/ziglang/zig/issues/9182?

2021-06-17

@g-w1
- Demo plan9 linker.
- Discuss more inline assembly to support hello world/show abi.

2021-06-10

@Vexu
- I have some issues and prs that could be discussed to try to shrink the pr queue.
- Maybe a quick demo of stage2 comptime vars?
@andrewrk
- It's time to rework AIR Memory Layout. Is anybody interested in taking it on? I have a branch to get you started if you want.

2021-05-27

Agenda

@Luukdegram
- wasm proposals, versions, and how to handle them regarding compatibility. (i.e. multi-value returns)
@g-w1
- showcase unused var compile error

2021-05-20

@Snektron
- progress on SPIR-V
@kubkon
- progress and Q&A on shipping WASI libc in Zig - #8837
- what to do about zig c++ -target wasm32-wasi?
@andrewrk
- demo some progress from whole-file-astgen
- release plan for 0.8.0 - what is left, what to prioritise, etc.

2021-05-13

Agenda

@kubkon
- zld demo: stage1 bootstrapped with zig-bootstrap passes tests on macOS
- Wanna run macOS tests but don't have a Mac? No probs! Zig's got you covered (at least partially): #8760
@andrewrk
- progress on stage2-whole-file-astgen
- native libc++
- pitch the idea of test harness loading code at runtime

2021-04-29

Agenda

@ifreund
- std/build: change default install prefix from local zig-cache to . #8638

2021-04-22

Agenda

@andrewrk
- Demo whole-file-AstGen
- Outline what a hot code reloading feature would look like

2021-04-15

Agenda

@joachimschmidt557:
- Register manager: Register classes
@amro:
- Stage 1 C ABI struct parameters / return values
@andrewrk:
- llvm12 update
- demo work-in-progress whole-file-astgen and std.zig integration

2021-04-08

Agenda

@andrewrk
- Go over some contributor friendly stage2 issues, try to inspire some fresh faces to get involved :)
- Demo recent progress on structs & enums
- LLVM 12 update
@joachimschmidt557:
- Questions about reusing stack locations (stage2 codegen)

2021-04-01

Agenda

@joachimschmidt557:
- Register allocation: Use function arguments from stack
- Questions about compiler_rt
@g-w1
- Showcase ast-based doc generator
- Ask for feedback, specifically about design

2021-03-25

Agenda

@andrewrk:
- brief overview of the compiler pipeline
- zir-memory-layout branch update
- a plan for inline while, comptime code, etc in stage2
@nektro: plz explain stage1 ZigType + ZigValue

2021-03-18

Agenda

@joachimschmidt557:
- Extract register allocation to separate file
- Track all registers, not just callee preserved
- Slightly modify register spilling
@andrewrk:
- Pitching an idea for handling archive files in the cache system/ incremental compilation
- Status update on zir-memory-layout branch
- The "start2.zig" plan
@kubkon
- Pitching an idea for handling PIE targets in a uniform manner - see #8281 for changes to GOT handling on macOS
- Status update on zld - see #8282
@dimenus
- new contributor trying to grok the stage 2 compiler pipeline
- working on TODOs for inline asm

2021-03-11

Agenda

@joachimschmidt557: CPU feature detection on ARM revisited (see https://github.com/ziglang/zig/pull/8206)
@andrewrk: progress on ZIR memory layout reworking. LLVM12 status update.

2021-03-04

Agenda

@andrewrk:
- Demo the new target CPU features maintenance strategy
- LLVM 12 status update
- Looking for contributors for native CPU feature detection for ARM
- Some guidance on lazy value system / error sets / strategizing what areas of stage2 to work on so we don't have to redo work later
@joachimschmidt557:
- wrapping arithmetic vs normal arithmetic in TZIR
- showcase of different ways we could implement CPU feature detection for ARM
@kubkon:
- Yet-another-demo of zld -- zld now passes a vast majority of Zig tests both on aarch64 and x86_64!
- Enabling logs in stage1 for the linker stage -- as a quick workaround while working out fixes to reloc fixups, I've reverted to using log.warn in place of log.debug to gather some context for the potentially failing test cases; however, a more permanent solution would be to print logs with some runtime command/setting. So the question really is, do we have something similar to how we manage logs in stage2 in stage1?

2021-02-25

Agenda

@andrewrk: proposal to improve test harness to support loading test cases at runtime
@andrewrk: merge of ast memory layout
@kubkon: zld upstreaming & how to handle caching logic as it applies to linking

2021-02-18

Agenda

Loris: alternate meeting venues?
@andrewrk: progress report on ast memory layout
@andrewrk: I have to drop everything and work on llvm12 branch
@kubkon: https://github.com/ziglang/zig/pull/8035 -- for context, I need this to be able to integrate zld in Zig
@kubkon: zld progress -- the story thus far...

2021-02-04

Agenda

@andrewrk: progress on ast memory layout / asking for help with zig fmt
@joachimschmidt557: comptime bitwise operations
@Luukdegram: Small wasm demo (importing js functions and calling zig functions from js)
@nektro: reconsider length between release cycles

2021-01-28

Agenda

~~@nektro: reconsider length between release cycles~~
@andrewrk: experimenting with a different memory layout for AST, with the goal of less memory usage and faster perf. If it is successful, the same strategies can be applied to ZIR and TZIR.

2021-01-21

Agenda

@joachimschmidt557: How exitlude jump relocations are implemented in the ARM backend.
@andrewrk: (quick announcement) temporarily regressing .zir parsing/emission in order to rework it to better fit into the design
@Snektron: Discuss SPIR-V architecture definitions.
@max: Demo register allocation system
~~@kubkon: Inside the linker: what is a TextBlock and how do we manage the free-lists?~~
@kubkon: Pushing stage2 to its limits - demo!
@andrewrk: Pitching my grand plan to make astgen emit ZIR that handles result locations optimally and solves cases like this. Goal is clean, maintainable compiler code with no hacks, as well as avoiding dead stores in the common cases, even in debug builds.
~~@kubkon: if time permits, TLS + LLVM + Apple Silicon == no stack traces in stage1. More context here.~~

2021-01-14

Agenda

@Luuk: Refactoring Wasm backend in stage2
@joachimschmidt557: #7187
@joachimschmidt557: Questions about reuseOperand function in codegen.zig
@FireFox317: Demo of the LLVM backend in stage2
@kubkon: Fancy that! -> #7781

2021-01-07

Agenda

@andrewrk: It's time to start tackling comptime performance in stage2. Here's a motivating test case: https://clbin.com/VdzX5
@andrewrk: Windows CI is no longer able to build zig1.obj (using stage1) due to OOM. We need to figure out how to make the CI green.
@andrewrk: #7296
@joachimschmidt557: duplication function parameters onto the stack (for debugging purposes etc.)
@kubkon: demo of extern fn linking in Mach-O.

2020-12-22

Agenda

@andrewrk will demonstrate tracy integration -- tracy is a performance profiling tool. Here's a link to Andrew's old live coding session about it: youtube.com.
@kubkon asked how we will handle threadlocal as a keyword. How is it currently handled in stage1 with LLVM?
@kubkon asked what is currently missing in stage2 to get the standard basic binary like

const std = @import("std");

pub fn main() void {
    std.log.info("Hello, world!", .{});
}

to successfully compile?

@andrewrk will discuss the new thread pool landing in master: ziglang/zig#7462.
@kubkon made a couple of major changes to how we handle Mach-O in stage2 in the sense that, much like for Elf, we preallocate space for various segments and sections to aid in incremental linking. This allows us to skip rewriting __LINKEDIT segment every single time __text changes for instance. However, this means we break out from the convention set by other tools such as Apple's ld or LLVM's ld64. Additionally, this may imply that Apple's codesign tool will not validate the Mach-O binary due to certain hard-coded (wrong) assumptions about the structure of the Mach-O. Question here is: should we leave it as-is for Debug builds, and perhaps follow the convention in Release?

Meeting notes

A command that might be useful for Unix-like OSes:

nix-shell  -p cmake -p ninja -p pkg-config -p glfw -p freetype -p capstone -p tbb -p gtk3-x11

Thread-local is delegated to an appropriate LLVM API function. @kubkon demonstrated the origin of the question being a failing libstd test on aarch64-macos-gnu to do with declaring a global variable as thread-local. After some live debugging, we concluded that both clang and zig generate almost identical LLVM IR for it, and @kubkon later (offline) verified that when we swap LLD for Apple's system linker, the problem disappears. @andrewrk also mentioned that he posted a question on LLVM-dev mailing list inquiring what the chances are of aarch64 MachO2 backend being ready for LLVM 12 release, and got a disheartening reply stating that "it's ready when it's ready". So yet another point in favour of ditching LLD and putting more effort into our own linker(s).
@andrewrk had a stab at answering that one and the answer is that we've got quite a few major hurdles to go over before that's done including comptime, panic handlers, etc. Specifically for Mach-O however, a good first achievable milestone would be to add support for extern in stage2 and link with libSystem functions instead of hand-writing print and exit in pure assembly as it is done now.

EDIT: the downstream tracking issue: ziglang/zig#7527
TODO @andrewrk could you write this one up as I AFK when you were discussing it?
The entire point boils down to whether we should worry about the generated Mach-O binaries be compatible with Apple's codesign tool, and the short answer is no. @andrewrk suggested that we implement the incremental linker in the most optimal and yet correct way (which BTW is not what Apple assumes in their codesign or more specifically codesign_allocate tool), and see if our users are OK with it, and perhaps, with a bit of luck, have Apple fix their incorrect assumptions in `codesign_allocate.

2020-12-17

Agenda

1. Simplify translate-c implementation by introducing a new pseudo-ast data structures.

See #6710 for more context. This discussion point is championed by @andrewrk.

2. Middle-level IR.

It seems like this could be modeled as a sequence of lowering and optimization passes. So something like parse to ast -> comptime and semantic analysis -> runtime semantic IR -> backend agnostic optimizations -> backend -> backend-specific IR -> backend specific optimizations -> assembly.

The individual backends could do multiple passes of lowering followed by optimization, some of which are shared. So you could make a "register machine"-specific IR and passes, and a "stack machine"-specific IR and passes, and each backend could choose to run the semantic IR through one of those shared lowering + optimization processes before doing really specific backend optimizations.

The problem with the value type you showed is neatly solved by this. The problem is that you need to combine some low-level state about registers with high-level semantic state, because you're not able to go directly to assembly. This solves that by introducing a new IR ("register machine IR") which gets produced from the semantic IR. The code that produces this IR can be shared between any register machine backends, and then they can use the results in different ways to generate machine code. Backends which aren't register machines, like wasm or the JVM, can do something different with the optimized semantic IR to produce their code. And since the semantic IR doesn't have any machine specifics in it, there aren't any problems with incompatibility.

This discussion point is championed by @SpexGuy.

3. LLVM-backend for the self-hosted.

Will include a demo. This discussion point is championed by @FireFox317.

Meeting notes

meeting minutes:

6710 is accepted
@Specs_guy 's middle level IR idea is good and we're gonna try to do it
@FireFox317 showed an impressive LLVM backend demo, looking forward to merging that. We discussed how it would integrate into the rest of the compiler design
@Vexu brought up 7146. I'm gonna review it, ponder it, and then we're going to do a 1:1 follow up discussion to figure stuff out, specifically regarding how to do incremental compilation

re: @Specs_guy idea, it might be a nice idea to learn about https://mlir.llvm.org/ as inspiration. we're not going to depend on it because that would be yet another dependency on a c++ llvm project thing, but it's tackling the same problem, so it could have some important ideas for us to consider

2020-12-10

Agenda

Identify everything that is linking-dependent codegen and linking-independent codegen
- Linking-dependent: function calls, access to global variables(?)
- Linking-independent: everything else ?
Clean distinction between architecture-independent stuff and architecture-specific stuff
- Encourage deduplication of code
- Make it very contributer-friendly to add and improve backends
MCValue needs to be completely architecture-dependent
- Some entries can be considered universal and architecture-independent (embedded_in_code, memory)
- Entries that are not universal: Register (no registers, general purpose, floating point, etc.) and more
- MCValue{ .memory = ... } is used to populate and reference the GOT via absolute addressing (e.g., movabsq on x86_64); however, for platforms that require PIE such macOS, we need to fallback to RIP-relative/PC-relative addressing, therefore should .memory be re-used for PIEs or should we come up with an additional/alternative to it?
Consider supporting architechtures without general purpose registers (stack-based, C backend)
codegen.zig should not contain any architecture-dependent code, abstract only over stuff that is shared in all backends
What should belong into semantic analysis, what should belong into codegen?
- Architecture-independent optimizations (arithmetic expressions (strength reduction), unreachable optimizations, constant propagation, dead code elimination, loop unrolling, etc.): after sema and before codegen?
- Generating checks for overflow in debug builds: sema or codegen?
(not technically 100% codegen) What optimizations do we want to include?
- e.g. ARM: conditional instructions

Meeting Notes

For the kickoff meeting, the agenda was largely driven by @joachimschmidt57 who came up with this brilliant discussion write-up you can find above. Below you can find a summary of the discussion for each set of bullets from the agenda.

Identify everything that is linking-dependent codegen and linking-independent codegen

Linking-dependent: function calls, access to global variables(?)

Linking-independent: everything else ?

Linking and codegen are pretty tightly knit together (to better facilitate incremental linking). @joachimschmidt57 suggests we should split the linking-dependent from linking-independent bits in codegen.zig. Some examples:

function calls - are always linking-dependent
arithmetic calls - should be linking-independent

Consensus here was that (in the long run) we should audit the codegen/linking codebase and identify everything that is linking-dependent. This should help us come up with a plan on how to pull shared bits together and reorganise the codebase in general.

Clean distinction between architecture-independent stuff and architecture-specific stuff

Encourage deduplication of code

Make it very contributer-friendly to add and improve backends

MCValue needs to be completely architecture-dependent

Some entries can be considered universal and architecture-independent (embedded_in_code, memory)

Entries that are not universal: Register (no registers, general purpose, floating point, etc.) and more

MCValue{ .memory = ... } is used to populate and reference the GOT via absolute addressing (e.g., movabsq on x86_64); however, for platforms that require PIE such macOS, we need to fallback to RIP-relative/PC-relative addressing, therefore should .memory be re-used for PIEs or should we come up with an additional/alternative to it?

Consider supporting architechtures without general purpose registers (stack-based, C backend)

codegen.zig should not contain any architecture-dependent code, abstract only over stuff that is shared in all backend

@joachimschmidt57 suggested we should move architecture-specific codegen bits (where appropriate) into their own codegen modules, for instance, ARM64 codegen-specific functionality would land in codegen/aarch64.zig. The general idea here would be for each architecture to have their own version of MCValue. One motivation for this is the fact that ARM64, in addition to general-purpose register, also has multiple special-purposed floating-point registers. Furthermore, we should also support register-free architecture and the current version of MCValue makes it somewhat clunky since register-free architectures will simply panic/error out on certain MCValue fields when switching over them, etc. @andrewrk supports this premise, however, as also pointed out by @ifreund and @luukdegram, we should be careful not to optimise prematurely since we haven't advanced the stage2 enough yet to have explored what MCValue would require for each architecture. The timing for this is perfect since @luukdegram only recently refactored Wasm32 to use the common codepath as other architectures: ziglang/zig#7321.

What should belong into semantic analysis, what should belong into codegen?

Architecture-independent optimizations (arithmetic expressions (strength reduction), unreachable optimizations, constant propagation, dead code elimination, loop unrolling, etc.): after sema and before codegen?

Generating checks for overflow in debug builds: sema or codegen?

(not technically 100% codegen) What optimizations do we want to include?

e.g. ARM: conditional instructions

We should identify which stuff goes in semantic analysis (sema) and which in codegen. For example, generating checks for overflows. @andrewrk said it should go in the sema.

Furthermore, @andrewrk said the plan for this is as follows:

first big step, make a simple, maintainable compiler that passes all the tests
optimise
design direction is for less memory usage in opcodes; ZIR can be architecture-dependent by design if it solves a particular problem for a particular architecture/platform
comptime knows the target architecture and immitates it -> sema is very aware of the target

@joachimschmidt57 asked if the optimisations should go after sema but before codegen in the pipeline. @andrewrk said that we don't have any yet they indeed should go between sema and codegen; optimisations take typed IR instructions as the input, and give back optimised typed IR instructions as the output. What optimisations do we want to include? For debug builds, none. No LLVM but Release mode (this would be our self-hosted backend) -> our hand-crafted optimisations. This might be a totally different codegen path. It is important to note that the focus for 1.0 release is to have self-hosted in Debug, and LLVM-backed in Release.

If we figure out some good optimisations, it would be beneficial to insert them between codegen and the LLVM backend; e.g., async functions etc.