Mirrors/rust - rust - Gitea @ Femelysm.ru

mirror of https://github.com/rust-lang/rust.git synced 2026-05-30 21:16:27 +03:00

Author	SHA1	Message	Date
Jonathan Brouwer	8162cf582b	Rollup merge of #156242 - Jamesbarford:feat/remove-alwaysinline+target-feature, r=RalfJung,saethlin Remove unsound `target_feature_inline_always` feature ## Summary - Remove `target_feature_inline_always` - Update stdarch generators to only use `#[inline]` & regenerate stdarch. ## Why? Succinctly; the feature relies on LLVMs `AlwaysInlinerPass()` running before LLVMs heuristic based inliner pass. Which is not a basis for sound code. This has been discussed in [the tracking issue](https://github.com/rust-lang/rust/issues/145574). If the ordering of the passes were to change, of which they have in the past, it is very possible we could inline functions across callsites with mismatching target features leading to unsound code. Checks proposed in; https://github.com/rust-lang/rust/pull/155426 would only take into account caller -> callee which is not enough to guard against possibly of generating unsound code if the pass ordering were to change. There doesn't seem to be a way, presently, this this mechanism to provide soundness guarantees nor does it seem like `AlwaysInlinerPass()` is a desired feature of LLVM, which this feature relies on. r? @RalfJung	2026-05-21 12:21:45 +02:00
Jonathan Brouwer	ff42fc00d4	Rollup merge of #156208 - BorrowSanitizer:codegen-emit-retag-1, r=saethlin Emit retags in codegen to support BorrowSanitizer (part 1) Tracking issue: rust-lang/rust#154760 [Zulip Thread](https://rust-lang.zulipchat.com/#narrow/channel/131828-t-compiler/topic/Staging.20for.20emitting.20retags.20in.20codegen/with/593004012) This is the first of several PRs that will add experimental support for emitting retags as function calls in codegen. Each PR will be a minimal, improved slice of the changes in rust-lang/rust#155965. This PR contains all of the changes that will affect codegen backends. It adds methods to `IntrinsicCallBuilderMethods` for emitting each kind of retag call. cc: @RalfJung	2026-05-17 15:52:36 +02:00
Ian McCormack	e15861bdd2	Add trait methods for experimental retags to cg.	2026-05-15 12:34:03 -04:00
James Barford-Evans	405214dd92	PR feedback - revert `fn inline_attr(...)` to how it was prior to inlinealways + target feature changes, factoring in subsequent codebase changes	2026-05-15 10:30:16 +01:00
Guy Bedford	45adab87a5	Pass user-provided llvm_args after target spec llvm_args. This ensures that user arguments can override target arguments without being silently ignored.	2026-05-13 12:16:25 -07:00
Jonathan Brouwer	0705b602f0	Rollup merge of #155815 - djc:swift-cc, r=jieyouxu Add Swift function call ABI Adds an unstable `extern "Swift"` ABI behind the `abi_swift` feature gate, mapping to LLVM's `swiftcc` calling convention. This is only allowed (a) for `is_darwin_like` targets, since the [ABI is only stable for those platforms](https://www.swift.org/blog/abi-stability-and-more/) and (b) with the LLVM backend, since the other backends don't support it. Current approaches to interoperability with Swift lower to Objective-C (or require a Swift stub exposing a C ABI), but that is an optional mapping on the Swift side that some newer Apple frameworks omit. It would be great to be able to more directly/natively be able to call into Swift code directly via its stable API (on Apple platforms at least). Reimplements rust-lang/rust#64582 on top of current main. The main objection to the previous PR seemed to be that it needed an RFC, but there was pushback (which seems sensible to me) that an RFC could be deferred until stabilization. I think this needs a tracking issue? Would be happy to write one up if/when there is a consensus that this will be merged.	2026-05-13 15:16:17 +02:00
Jonathan Brouwer	8edd846028	Rollup merge of #156472 - stephenduong1004:print-stats-json, r=nikomatsakis Add support for Zprint-codegen-stats-json Add a flag `-Zprint-codegen-stats-json=<file>` to collect and write LLVM statistics in JSON format. It makes use of the function `llvm::PrintStatisticsJSON` in LLVM. The flag currently only obtains LLVM statistics for now, but could be used to collect more front-end statistics in the future.	2026-05-13 11:46:37 +02:00
khyperia	57b3e84ed4	Unnormalized migration: struct_tail takes fn taking Unnormalized	2026-05-12 20:45:13 +02:00
khyperia	869c941be9	Unnormalized migration: introduce assert_fully_normalized	2026-05-12 20:45:13 +02:00
Dirkjan Ochtman	f49e45101c	Add Swift function call ABI Adds an unstable `extern "Swift"` ABI behind the `abi_swift` feature gate, mapping to LLVM's `swiftcc` calling convention. Cranelift and GCC backends fall back to the platform default since they have no equivalent.	2026-05-12 17:09:19 +02:00
Stephen	8477a690b1	add support for Zprint-codegen-stats-json	2026-05-11 16:28:51 -04:00
Tim Neumann	8bd0b7a7a2	LLVM 23: Specify `returnaddress` intrinsic return type	2026-05-11 18:33:11 +02:00
Jonathan Brouwer	5246f6e4bc	Rollup merge of #156323 - bjorn3:clif_has_mnemonic, r=wesleywiser Handle --print=backend-has-mnemonic in cg_clif And introduce a `has_mnemonic` method on `CodegenBackend` just like `--print=backend-has-zstd`.	2026-05-10 19:05:44 +02:00
Jonathan Brouwer	30ec142b97	Rollup merge of #154972 - chorman0773:return_address, r=Mark-Simulacrum Implement `core::arch::return_address` and tests Tracking issue: rust-lang/rust#154966 Implements libs-team#768	2026-05-10 19:05:42 +02:00
bjorn3	c2ab95fb61	Handle --print=backend-has-mnemonic in cg_clif And introduce a has_mnemonic method on CodegenBackend just like --print=backend-has-zstd	2026-05-08 16:08:56 +02:00
Nicholas Nethercote	33cd10a881	Rename `rustc_data_structures::stable_hasher` as `stable_hash`. Part of MCP 983.	2026-05-08 15:10:33 +10:00
bors	ffccab6abe	Auto merge of #156278 - JonathanBrouwer:rollup-O8O1IcI, r=JonathanBrouwer Rollup of 10 pull requests Successful merges: - rust-lang/rust#146273 (lint ImproperCTypes: refactor linting architecture (part 2)) - rust-lang/rust#154025 (Add `keepalive`, `set_keepalive` to `TcpStream` implementations) - rust-lang/rust#156024 (CFI: Fix LTO for `#![no_builtins]` crates with CFI) - rust-lang/rust#156243 (Move CrateInfo computation after codegen_crate) - rust-lang/rust#154846 (Add better default spans for the `Ty` impl of `QueryKey`) - rust-lang/rust#155220 (cg_clif: Don't show verbose run-make cmd output for passing tests) - rust-lang/rust#156204 (Implemented `PathBuf::into_string`) - rust-lang/rust#156245 (Move invocation_temp into OutputFilenames) - rust-lang/rust#156250 (add a few new solver normalization tests) - rust-lang/rust#156265 (Remove unused `ToStableHashKey` impls.)	2026-05-07 14:59:29 +00:00
Jonathan Brouwer	a48901b65b	Rollup merge of #156245 - bjorn3:move_invocation_temp, r=oli-obk Move invocation_temp into OutputFilenames While it was previously defined in Session, it is only ever used with OutputFilenames methods.	2026-05-07 14:13:54 +02:00
Jonathan Brouwer	c5941a03a2	Rollup merge of #156243 - bjorn3:lto_refactors18, r=mu001999 Move CrateInfo computation after codegen_crate CrateInfo is only necessary during linking and non-local LTO. Part of https://github.com/rust-lang/compiler-team/issues/908	2026-05-07 14:13:51 +02:00
James Barford-Evans	c6d2da90b5	Remove #[inline(always)] + #[target_feature(enable = "<feature>")] support, reinstating error message and providing a link to the tracking issue for the full rationale.	2026-05-07 08:28:09 +01:00
Jacob Pratt	e6fac7e506	Rollup merge of #156202 - maurer:splat, r=nikic llvm: Use correct type for splat mask After llvm/llvm-project#195486 , LLVM has explicit handling for null pointers in simd operations instead of using special handling based on zeroes. This causes LLVM with asserts enabled to detect an improperly typed mask passed to splat (if the output vector does not have i32 elements), and can cause SIGSEGV in more complex cases with asserts disabled. @rustbot label: +llvm-main r? @durin42	2026-05-07 02:12:12 -04:00
bjorn3	9297586cb9	Move invocation_temp into OutputFilenames While it was previously defined in Session, it is only ever used with OutputFilenames methods.	2026-05-06 15:01:54 +00:00
bjorn3	da6e4d4890	Move CrateInfo computation after codegen_crate CrateInfo is only necessary during linking and non-local LTO.	2026-05-06 14:51:38 +00:00
Matthew Maurer	3c83d3adc9	llvm: Use correct type for splat mask After llvm/llvm-project#195486 , LLVM has explicit handling for null pointers in simd operations instead of using special handling based on zeroes. This causes LLVM with asserts enabled to detect an improperly typed mask passed to splat (if the output vector does not have i32 elements), and can cause SIGSEGV in more complex cases with asserts disabled.	2026-05-05 19:14:39 +00:00
bors	740679e1f5	Auto merge of #154861 - mehdiakiki:fix/rlib-digest, r=petrochenkov Add rlib digest to identify Rust object files This adds a metadata entry to `rlib` archives that lists which members are Rust object files instead of relying on the filename heuristic in `looks_like_rust_object.file`. I also added a fallback to the old behavior for `rlibs` built by older compilers. Part of https://github.com/rust-lang/rust/issues/138243.	2026-05-05 16:26:33 +00:00
mehdiakiki	9606b0bd77	Add rlib digest to identify Rust object files	2026-05-04 21:55:47 -04:00
Guillaume Gomez	be1e98b06e	Rollup merge of #156043 - folkertdev:c-variadic-avr-assembly-test, r=joshtriplett c-variadic: gate `va_arg` on `c_variadic_experimental_arch` tracking issue: https://github.com/rust-lang/rust/issues/44930 Just gating `...` is insufficient because we make the types available everywhere, and you could still define and export functions that used va_arg for targets where we don't want to stably support it. r? joshtriplett	2026-05-05 02:50:09 +02:00
Folkert de Vries	8e0ebb9044	c-variadic: gate `va_arg` on `c_variadic_experimental_arch` Just gating `...` is insufficient because we make the types available everywhere, and you could still define and export functions that used va_arg for targets where we don't want to stably support it.	2026-05-04 10:03:05 +02:00
Nicholas Nethercote	e7d28d3e9b	Rename the `HashStable` trait/derives as `StableHash`. Specifically: - `HashStable` -> `StableHash` (trait) - `HashStable` -> `StableHash` (derive) - `HashStable_NoContext` -> `StableHash_NoContext` (derive) Note: there are some names in `compiler/rustc_macros/src/hash_stable.rs` that are still to be renamed, e.g. `HashStableMode`. Part of MCP 983.	2026-05-02 10:16:40 +02:00
Nicholas Nethercote	851049cf58	Rename the `hash_stable` method as `stable_hash`. Part of MCP 983.	2026-05-02 10:15:59 +02:00
Jonathan Brouwer	6d8e1250ab	Rollup merge of #155995 - Mrmaxmeier:debuginfo-embed-external-source, r=oli-obk -Zembed-source: also embed external source Hi, I've been using `-Zembed-source` for a while and noticed that some sources are not embedded when compiling in release mode. See the minimal reproducer below for missing embedded source code. The current implementation only emits source from the `src` field in `SourceFile`, but for multi-codegen-unit scenarios the source might only be available via the `external_src` field. I've updated the LLVM and Cranelift backends to fall back to `external_src` if `src` is not available. Minimal reproducer: ```console $ cat Cargo.toml [package] name = "repro" version = "0.0.1" edition = "2021" [profile.release] strip = "none" debug = true $ cat src/lib.rs // marker comment pub fn foo() -> u64 { 42 } $ cat src/main.rs fn main() { println!("{}", repro::foo()); } $ RUSTFLAGS="-g -Zembed-source=yes -Zdwarf-version=5" cargo build -r Compiling repro v0.0.1 (/tmp/repro) Finished `release` profile [optimized + debuginfo] target(s) in 0.08s $ # Note: Source for lib.rs is missing here: $ llvm-dwarfdump --debug-line target/release/repro \| grep "marker comment" $ RUSTFLAGS="-g -Zembed-source=yes -Zdwarf-version=5" cargo +stage1 build -r Finished `release` profile [optimized + debuginfo] target(s) in 0.04s $ llvm-dwarfdump --debug-line target/release/repro \| grep "marker comment" source: "// marker comment\npub fn foo() -> u64 { 42 }\n" ``` Thanks!	2026-05-01 13:10:36 +02:00
Jacob Pratt	560cc23c95	Rollup merge of #156003 - bjorn3:lto_refactors17, r=lqd Pass Session to optimize_and_codegen_fat_lto This is necessary to fix incremental LTO in cg_gcc as well as to do some LTO refactorings I want to do. The actual fix for cg_gcc will be done on the cg_gcc repo to test it in CI.	2026-04-30 22:28:29 -04:00
Jacob Pratt	fae8c6edd1	Rollup merge of #155974 - folkertdev:c-variadic-experimental-arch, r=joshtriplett add `c_variadic_experimental_arch` feature tracking issue: https://github.com/rust-lang/rust/issues/155973 Based on https://hackmd.io/pIbUgMQuQcGaibJcinOcEw#Stabilize-c-variadic-function-definitions-rust155697, we'll gate niche targets where we don't control the implementation of `va_arg`, the ABI is unclear, or in general where we're not confident stabilizing the implementation.	2026-04-30 22:28:28 -04:00
Jacob Pratt	0649c3f5d5	Rollup merge of #155249 - hoodmane:wasm-panic-in-cleanup-reland, r=bjorn3 Fix: On wasm targets, call `panic_in_cleanup` if panic occurs in cleanup Relies on https://github.com/rust-lang/llvm-project/pull/194. Reland of https://github.com/rust-lang/rust/pull/151771. Previously this was not correctly implemented. Each funclet may need its own terminate block, so this changes the `terminate_block` into a `terminate_blocks` `IndexVec` which can have a terminate_block for each funclet. We key on the first basic block of the funclet -- in particular, this is the start block for the old case of the top level terminate function. Rather than using a catchswitch/catchpad pair, I used a cleanuppad. The reason for the pair is to avoid catching foreign exceptions on MSVC. On wasm, it seems that the catchswitch/catchpad pair is optimized back into a single cleanuppad and a catch_all instruction is emitted which will catch foreign exceptions. Because the new logic is only used on wasm, it seemed better to take the simpler approach seeing as they do the same thing. - [ ] Add test for https://github.com/rust-lang/rust/issues/153948	2026-04-30 22:28:24 -04:00
bjorn3	c5e38b741d	Pass Session to optimize_and_codegen_fat_lto This is necessary to fix incremental LTO in cg_gcc as well as to do some LTO refactorings I want to do. The actual fix for cg_gcc will be done on the cg_gcc repo to test it in CI.	2026-04-30 14:18:33 +00:00
Mrmaxmeier	84daca4fdb	debuginfo: embed external source This allows embedding source code even when it's only available via previously emitted metadata files. Without this, embedded source availability is inconsistent, especially in release builds.	2026-04-30 12:35:13 +02:00
Folkert de Vries	41796ecf1c	add `c_variadic_experimental_arch` feature	2026-04-30 00:47:30 +02:00
Ralf Jung	ad7ddbc08e	simd_reduce_min/max: remove float support	2026-04-29 13:48:45 +02:00
Shoyu Vanilla (Flint)	5d26634476	Rollup merge of #152443 - kjetilkjeka:nvptx_drop_support_old_hw_and_isa, r=ZuseZ4 NVPTX: Drop support for old architectures and old ISAs This is the implementation of [this MCP](https://github.com/rust-lang/compiler-team/issues/965#issuecomment-3837320262) I believe it was said that no FCP was needed, but if that is incorrect then the FCP is anyway scheduled to finish in 2 days so it can in any case be merged then.	2026-04-29 10:40:46 +09:00
Jonathan Brouwer	c6912bf401	Rollup merge of #155692 - fneddy:fix_naked-dead-code-elimination, r=folkertdev disable naked-dead-code-elimination test if no RET mnemonic is available this test emit x86_64 specific ret asm instruction and should not be compiled on any other arch.	2026-04-28 20:24:33 +02:00
Hood Chatham	b072d24e26	Fix: On wasm targets, call `panic_in_cleanup` if panic occurs in cleanup Previously this was not correctly implemented. Each funclet may need its own terminate block, so this changes the `terminate_block` into a `terminate_blocks` `IndexVec` which can have a terminate_block for each funclet. We key on the first basic block of the funclet -- in particular, this is the start block for the old case of the top level terminate function. Rather than using a catchswitch/catchpad pair, I used a cleanuppad. The reason for the pair is to avoid catching foreign exceptions on MSVC. On wasm, it seems that the catchswitch/catchpad pair is optimized back into a single cleanuppad and a catch_all instruction is emitted which will catch foreign exceptions. Because the new logic is only used on wasm, it seemed better to take the simpler approach seeing as they do the same thing.	2026-04-28 09:56:22 -07:00
Eddy (Eduard) Stefes	2a8e588c90	Add `--print=backend-has-mnemonic` and `needs-asm-mnemonic` directive Add infrastructure to query LLVM backend for specific assembly mnemonics and use it in compiletest to conditionally run tests based on instruction availability. This fixes test failures with naked-dead-code-elimination which requires the `RET` mnemonic. Co-authored-by: Folkert de Vries <flokkievids@gmail.com>	2026-04-28 10:21:15 +02:00
Kjetil Kjeka	a2d23cec5f	NVPTX: Drop support for old hw and old ISAs	2026-04-28 08:48:22 +02:00
Connor Horman	4b5d5449d2	Implement `core::arch::return_address` and tests * Implement core::arch::return_address and tests Fix typo Apply suggestions from code review Wording/docs changes. Co-authored-by: Ralf Jung <post@ralfj.de> Change signature according to Ralf's comment Fix call to `core::intrinsics::return_address()` according to the new signature Add cranelift implementation for intrinsic Change wording on `return_address!()` to be clear that returning a null pointer is best-effort. Fix formatting of doc comment Fix mistake in cranelift codegen * Do not generate llvm.returnaddress on wasm * Supress return_address test on miri	2026-04-26 11:56:36 +00:00
Jonathan Brouwer	dde4886801	Rollup merge of #146181 - Flakebi:dynamic-shared-memory, r=ZuseZ4,Sa4dus,workingjubilee,RalfJung,nikic,kjetilkjeka,kulst Add intrinsic for launch-sized workgroup memory on GPUs Workgroup memory is a memory region that is shared between all threads in a workgroup on GPUs. Workgroup memory can be allocated statically or after compilation, when launching a gpu-kernel. The intrinsic added here returns the pointer to the memory that is allocated at launch-time. # Interface With this change, workgroup memory can be accessed in Rust by calling the new `gpu_launch_sized_workgroup_mem<T>() -> *mut T` intrinsic. It returns the pointer to workgroup memory guaranteeing that it is aligned to at least the alignment of `T`. The pointer is dereferencable for the size specified when launching the current gpu-kernel (which may be the size of `T` but can also be larger or smaller or zero). All calls to this intrinsic return a pointer to the same address. See the intrinsic documentation for more details. ## Alternative Interfaces It was also considered to expose dynamic workgroup memory as extern static variables in Rust, like they are represented in LLVM IR. However, due to the pointer not being guaranteed to be dereferencable (that depends on the allocated size at runtime), such a global must be zero-sized, which makes global variables a bad fit. # Implementation Details Workgroup memory in amdgpu and nvptx lives in address space 3. Workgroup memory from a launch is implemented by creating an external global variable in address space 3. The global is declared with size 0, as the actual size is only known at runtime. It is defined behavior in LLVM to access an external global outside the defined size. There is no similar way to get the allocated size of launch-sized workgroup memory on amdgpu an nvptx, so users have to pass this out-of-band or rely on target specific ways for now. Tracking issue: rust-lang/rust#135516	2026-04-25 23:07:48 +02:00
Flakebi	13ec3de673	Add intrinsic for launch-sized workgroup memory on GPUs Workgroup memory is a memory region that is shared between all threads in a workgroup on GPUs. Workgroup memory can be allocated statically or after compilation, when launching a gpu-kernel. The intrinsic added here returns the pointer to the memory that is allocated at launch-time. # Interface With this change, workgroup memory can be accessed in Rust by calling the new `gpu_launch_sized_workgroup_mem<T>() -> *mut T` intrinsic. It returns the pointer to workgroup memory guaranteeing that it is aligned to at least the alignment of `T`. The pointer is dereferencable for the size specified when launching the current gpu-kernel (which may be the size of `T` but can also be larger or smaller or zero). All calls to this intrinsic return a pointer to the same address. See the intrinsic documentation for more details. ## Alternative Interfaces It was also considered to expose dynamic workgroup memory as extern static variables in Rust, like they are represented in LLVM IR. However, due to the pointer not being guaranteed to be dereferencable (that depends on the allocated size at runtime), such a global must be zero-sized, which makes global variables a bad fit. # Implementation Details Workgroup memory in amdgpu and nvptx lives in address space 3. Workgroup memory from a launch is implemented by creating an external global variable in address space 3. The global is declared with size 0, as the actual size is only known at runtime. It is defined behavior in LLVM to access an external global outside the defined size. There is no similar way to get the allocated size of launch-sized workgroup memory on amdgpu an nvptx, so users have to pass this out-of-band or rely on target specific ways for now.	2026-04-24 10:03:45 +02:00
Folkert de Vries	797059769e	c-variadic: fix for sparc64 validated versus https://godbolt.org/z/qrM37rY6n	2026-04-23 14:46:44 +02:00
Folkert de Vries	e9ab558406	va_arg: use `emit_ptr_va_arg` for mips	2026-04-23 01:20:59 +02:00
Jonathan Brouwer	e9972d9dec	Rollup merge of #155622 - folkertdev:va-arg-llvm-fixes, r=tgross35 c-variadic: `va_arg` fixes tracking issue: https://github.com/rust-lang/rust/issues/44930 extracts some generic LLVM `va_arg` fixes from https://github.com/rust-lang/rust/pull/152576 and https://github.com/rust-lang/rust/pull/155429. r? tgross35	2026-04-22 19:18:30 +02:00
Folkert de Vries	5149a31515	change `llvm.ptrmask` argument to `isize` we were saying that the type is i32, but would often provide an i64. That never failed so far, but starts failing (like, crashing LLVM) when working with 128-bit values that are 16-byte aligned. So, we may as well use the more robust approach now.	2026-04-22 10:32:27 +02:00

1 2 3 4 5 ...

3439 Commits