zig - General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software. https://ziglang.org

Age	Commit message (Collapse)	Author
2023-09-21	compiler: fix compilation for 32-bit targets	Andrew Kelley

2023-09-21	compiler: move struct types into InternPool proper	Andrew Kelley
	Structs were previously using `SegmentedList` to be given indexes, but were not actually backed by the InternPool arrays. After this, the only remaining uses of `SegmentedList` in the compiler are `Module.Decl` and `Module.Namespace`. Once those last two are migrated to become backed by InternPool arrays as well, we can introduce state serialization via writing these arrays to disk all at once. Unfortunately there are a lot of source code locations that touch the struct type API, so this commit is still work-in-progress. Once I get it compiling and passing the test suite, I can provide some interesting data points such as how it affected the InternPool memory size and performance comparison against master branch. I also couldn't resist migrating over a bunch of alignment API over to use the log2 Alignment type rather than a mismash of u32 and u64 byte units with 0 meaning something implicitly different and special at every location. Turns out you can do all the math you need directly on the log2 representation of alignments.
2023-09-13	elf: do not store Symbol's index in Symbol	Jakub Konka

2023-09-13	Merge pull request #17113 from ziglang/elf-linker	Jakub Konka
	elf: upstream zld/ELF functionality, part 1
2023-09-12	InternPool: prevent anon struct UAF bugs with type safety	Andrew Kelley
	Instead of using actual slices for InternPool.Key.AnonStructType, this commit changes to use Slice types instead, which store a long-lived index rather than a pointer. This is a follow-up to 7ef1eb1c27754cb0349fdc10db1f02ff2dddd99b.
2023-09-12	elf: add simplistic reloc scanning mechanism	Jakub Konka

2023-09-08	elf: store GOT index in symbol extra array; use GotSection for GOT	Jakub Konka

2023-09-06	elf: make everything upside down - track by Symbol.Index rather than Atom.Index	Jakub Konka

2023-09-04	elf: simplify accessors to symbols, atoms, etc	Jakub Konka

2023-08-22	compiler: move unions into InternPool	Andrew Kelley
	There are a couple concepts here worth understanding: Key.UnionType - This type is available before resolving the union's fields. The enum tag type, number of fields, and field names, field types, and field alignments are not available with this. InternPool.UnionType - This one can be obtained from the above type with `InternPool.loadUnionType` which asserts that the union's enum tag type has been resolved. This one has all the information available. Additionally: * ZIR: Turn an unused bit into `any_aligned_fields` flag to help semantic analysis know whether a union has explicit alignment on any fields (usually not). * Sema: delete `resolveTypeRequiresComptime` which had the same type signature and near-duplicate logic to `typeRequiresComptime`. - Make opaque types not report comptime-only (this was inconsistent between the two implementations of this function). * Implement accepted proposal #12556 which is a breaking change.
2023-07-31	Fix integer overflow in field padding calculation	antlilja
	The old code was iterating and generating symbols for fields in their declared order instead of the memory optimized order while getting offsets in the memory optimized order.
2023-07-21	codegen: writer().writeByteNTimes -> appendNTimes	r00ster91
	Both ways do the same thing but I think the compiler might have an easier time optimizing `appendNTimes` because it does less things/the path is shorter. I have not done any benchmarking at runtime but have compared the instruction count of both ways a little here: https://zig.godbolt.org/z/vr193W9oj `b` (`appendNTimes`) is ~103 instructions while `a` (`writer().writeByteNTimes`) is ~117 instructions. And looking at the implementation of `writeByteNTimes`, it only seems to buffer up 256 bytes before doing another `writeAll` which for `std.ArrayList` probably means another allocation, whereas when directly using `appendNTimes`, the entire exact additional capacity required is known from the start. Either way, this would be more consistent anyway.
2023-07-18	rework generic function calls	Andrew Kelley
	Abridged summary: * Move `Module.Fn` into `InternPool`. * Delete a lot of confusing and problematic `Sema` logic related to generic function calls. This commit removes `Module.Fn` and replaces it with two new `InternPool.Tag` values: * `func_decl` - corresponding to a function declared in the source code. This one contains line/column numbers, zir_body_inst, etc. * `func_instance` - one for each monomorphization of a generic function. Contains a reference to the `func_decl` from whence the instantiation came, along with the `comptime` parameter values (or types in the case of `anytype`) Since `InternPool` provides deduplication on these values, these fields are now deleted from `Module`: * `monomorphed_func_keys` * `monomorphed_funcs` * `align_stack_fns` Instead of these, Sema logic for generic function instantiation now unconditionally evaluates the function prototype expression for every generic callsite. This is technically required in order for type coercions to work. The previous code had some dubious, probably wrong hacks to make things work, such as `hashUncoerced`. I'm not 100% sure how we were able to eliminate that function and still pass all the behavior tests, but I'm pretty sure things were still broken without doing type coercion for every generic function call argument. After the function prototype is evaluated, it produces a deduplicated `func_instance` `InternPool.Index` which can then be used for the generic function call. Some other nice things made by this simplification are the removal of `comptime_args_fn_inst` and `preallocated_new_func` from `Sema`, and the messy logic associated with them. I have not yet been able to measure the perf of this against master branch. On one hand, it reduces memory usage and pointer chasing of the most heavily used `InternPool` Tag - function bodies - but on the other hand, it does evaluate function prototype expressions more than before. We will soon find out.
2023-06-25	x86_64: add missing padding to global unions	Jacob Young

2023-06-25	x86_64: fix global pointers to packed struct fields	Jacob Young

2023-06-24	all: migrate code to new cast builtin syntax	mlugg
	Most of this migration was performed automatically with `zig fmt`. There were a few exceptions which I had to manually fix: * `@alignCast` and `@addrSpaceCast` cannot be automatically rewritten * `@truncate`'s fixup is incorrect for vectors * Test cases are not formatted, and their error locations change
2023-06-19	all: zig fmt and rename "@XToY" to "@YFromX"	Eric Joldasov
	Signed-off-by: Eric Joldasov <bratishkaerik@getgoogleoff.me>
2023-06-17	mem: rename alignGeneric to mem.align	Motiejus Jakštys
	Anecdote 1: The generic version is way more popular than the non-generic one in Zig codebase: git grep -w alignForward \| wc -l 56 git grep -w alignForwardGeneric \| wc -l 149 git grep -w alignBackward \| wc -l 6 git grep -w alignBackwardGeneric \| wc -l 15 Anecdote 2: In my project (turbonss) that does much arithmetic and alignment I exclusively use the Generic functions. Anecdote 3: we used only the Generic versions in the Macho Man's linker workshop.
2023-06-16	Merge pull request #16064 from Luukdegram/wasm-linker	Andrew Kelley
	wasm/linker: symbol resolution improvements
2023-06-16	codegen: fix union padding	Luuk de Gram
	This regressed during the internpool merges. This commit reinstates the padding logic for unions.
2023-06-16	plan9: revamp the relocation system to allow decl refs	Jacob G-W

2023-06-16	Plan9: Add support for lazy symbols	Jacob G-W
	This includes a renaming from DeclBlock to Atom.
2023-06-10	InternPool: fix yet more key lifetime issues	Jacob Young

2023-06-10	compiler: eliminate Decl.value_arena and Sema.perm_arena	Andrew Kelley
	The main motivation for this commit is eliminating Decl.value_arena. Everything else is dominoes. Decl.name used to be stored in the GPA, now it is stored in InternPool. It ended up being simpler to migrate other strings to be interned as well, such as struct field names, union field names, and a few others. This ended up requiring a big diff, sorry about that. But the changes are pretty nice, we finally start to take advantage of InternPool's existence. global_error_set and error_name_list are simplified. Now it is a single ArrayHashMap(NullTerminatedString, void) and the index is the error tag value. Module.tmp_hack_arena is re-introduced (it was removed in eeff407941560ce8eb5b737b2436dfa93cfd3a0c) in order to deal with comptime_args, optimized_order, and struct and union fields. After structs and unions get moved into InternPool properly, tmp_hack_arena can be deleted again.
2023-06-10	codegen: fix doubled global sentinels	Jacob Young

2023-06-10	InternPool: remove memoized_decl	Andrew Kelley
	This is neither a type nor a value. Simplifies `addStrLit` as well as the many places that switch on `InternPool.Key`. This is a partial revert of bec29b9e498e08202679aa29a45dab2a06a69a1e.
2023-06-10	InternPool: add representation for value of empty enums and unions	mlugg
	This is a bit odd, because this value doesn't actually exist: see #15909. This gets all the empty enum/union behavior tests passing. Also adds an assertion to `Sema.analyzeBodyInner` which would have helped figure out the issue here much more quickly.
2023-06-10	InternPool: improve hashing performance	Andrew Kelley
	Key.PtrType is now an extern struct so that hashing it can be done by reinterpreting bytes directly. It also uses the same representation for type_pointer Tag encoding and the Key. Accessing pointer attributes now requires packed struct access, however, many operations are now a copy of a u32 rather than several independent fields. This function moves the top two most used Key variants - pointer types and pointer values - to use a single-shot hash function that branches for small keys instead of calling memcpy. As a result, perf against merge-base went from 1.17x ± 0.04 slower to 1.12x ± 0.04 slower. After the pointer value hashing was changed, total CPU instructions spent in memcpy went from 4.40% to 4.08%, and after additionally improving pointer type hashing, it further decreased to 3.72%.
2023-06-10	x86_64: fix InternPool regressions	Jacob Young

2023-06-10	behavior: additional llvm fixes	Jacob Young

2023-06-10	Module: move memoized data to the intern pool	Jacob Young
	This avoids memory management bugs with the previous implementation.
2023-06-10	behavior: get more test cases passing with llvm	Jacob Young

2023-06-10	C backend: InternPool fixes	Andrew Kelley

2023-06-10	codegen: fix lowering of constant structs	Andrew Kelley

2023-06-10	InternPool: fix more crashes	Jacob Young

2023-06-10	Module: intern the values of decls when they are marked alive	Jacob Young
	I'm not sure if this is the right place for this to happen, and it should become obsolete when comptime mutation is rewritten and the remaining legacy value tags are remove, so keeping this as a separate revertable commit.
2023-06-10	InternPool: remove more legacy values	Jacob Young
	Reinstate some tags that will be needed for comptime init.
2023-06-10	InternPool: port most of value tags	Jacob Young

2023-06-10	compiler: move error union types and error set types to InternPool	Andrew Kelley
	One change worth noting in this commit is that `module.global_error_set` is no longer kept strictly up-to-date. The previous code reserved integer error values when dealing with error set types, but this is no longer needed because the integer values are not needed for semantic analysis unless `@errorToInt` or `@intToError` are used and therefore may be assigned lazily.
2023-06-10	compiler: eliminate legacy Type.Tag.pointer	Andrew Kelley
	Now pointer types are stored only in InternPool.
2023-06-10	stage2: move function types to InternPool	Andrew Kelley

2023-06-10	stage2: move enum tag values into the InternPool	Andrew Kelley
	I'm seeing a new assertion trip: the call to `enumTagFieldIndex` in the implementation of `@Type` is attempting to query the field index of an union's enum tag, but the type of the enum tag value provided is not the same as the union's tag type. Most likely this is a problem with type coercion, since values are now typed. Another problem is that I added some hacks in std.builtin because I didn't see any convenient way to access them from Sema. That should definitely be cleaned up before merging this branch.
2023-06-10	stage2: move enum types into the InternPool	Andrew Kelley
	Unlike unions and structs, enums are actually encoded into the InternPool directly, rather than using the SegmentedList trick. This results in them being quite compact, and greatly improved the ergonomics of using enum types throughout the compiler. It did however require introducing a new concept to the InternPool which is an "incomplete" item - something that is added to gain a permanent Index, but which is then mutated in place. This was necessary because enum tag values and tag types may reference the namespaces created by the enum itself, which required constructing the namespace, decl, and calling analyzeDecl on the decl, which required the decl value, which required the enum type, which required an InternPool index to be assigned and for it to be meaningful. The API for updating enums in place turned out to be quite slick and efficient - the methods directly populate pre-allocated arrays and return the information necessary to output the same compilation errors as before.
2023-06-10	stage2: move union types and values to InternPool	Andrew Kelley

2023-06-10	stage2: move struct types and aggregate values to InternPool	Andrew Kelley

2023-06-10	stage2: implement intTagType logic	Andrew Kelley
	This commit changes a lot of `const Module` to `Module` to make it work, since accessing the integer tag type of an enum might need to mutate the InternPool by adding a new integer type into it. An alternate strategy would be to pre-heat the InternPool with the integer tag type when creating an enum type, which would make it so that intTagType could accept a const Module instead of a mutable one, asserting that the InternPool already had the integer tag type.
2023-06-10	Replace uses of Value.zero, Value.one, Value.negative_one	mlugg
	This is a bit nasty, mainly because Type.onePossibleValue is now errorable, which is a quite viral change.
2023-06-10	stage2: move integer values to InternPool	Andrew Kelley

2023-06-10	InternPool: add a slice encoding	Andrew Kelley
	This uses the data field to reference its pointer field type, which allows for efficient and infallible access of a slice type's pointer type.
2023-06-10	stage2: move undef, unreach, null values to InternPool	Andrew Kelley