archive/ruby - Eplg Git: Free And Private Git Hosting

mirror of https://github.com/ruby/ruby.git synced 2025-08-15 05:29:10 +02:00

Author	SHA1	Message	Date
Jean Boussier	10aa4134d4	imemo_fields: store owner object in RBasic.klass It is much more convenient than storing the klass, especially when dealing with `object_id` as it allows to update the id2ref table without having to dereference the owner, which may be garbage at that point.	2025-08-13 19:53:18 +02:00
Jean Boussier	85c52079aa	set.c: Store `set_table->bins` at the end of `set_table->entries` This saves one pointer in `struct set_table`, which would allow `Set` objects to still fit in 80B TypedData slots even if RTypedData goes from 32B to 40B large. The existing set benchmark seem to show this doesn't have a very significant impact. Smaller sets are a bit faster, larger sets a bit slower. It seem consistent over multiple runs, but it's unclear how much of that is just error margin. ``` compare-ruby: ruby 3.5.0dev (2025-08-12T02:14:57Z master `428937a536`) +YJIT +PRISM [arm64-darwin24] built-ruby: ruby 3.5.0dev (2025-08-12T07:22:26Z set-entries-bounds da30024fdc) +YJIT +PRISM [arm64-darwin24] warming up........ \| \|compare-ruby\|built-ruby\| \|:------------------------\|-----------:\|---------:\| \|new_0 \| 15.459M\| 15.823M\| \| \| -\| 1.02x\| \|new_10 \| 3.484M\| 3.574M\| \| \| -\| 1.03x\| \|new_100 \| 546.992k\| 564.679k\| \| \| -\| 1.03x\| \|new_1000 \| 49.391k\| 48.169k\| \| \| 1.03x\| -\| \|aref_0 \| 18.643M\| 19.350M\| \| \| -\| 1.04x\| \|aref_10 \| 5.941M\| 6.006M\| \| \| -\| 1.01x\| \|aref_100 \| 822.197k\| 814.219k\| \| \| 1.01x\| -\| \|aref_1000 \| 83.230k\| 79.411k\| \| \| 1.05x\| -\| ```	2025-08-12 21:56:57 +02:00
Jean Boussier	1986d775cd	symbol.c: use `rb_gc_mark_and_move` over `rb_gc_location` The `p->field = rb_gc_location(p->field)` isn't ideal because it means all references are rewritten on compaction, regardless of whether the referenced object has moved. This isn't good for caches nor for Copy-on-Write. `rb_gc_mark_and_move` avoid needless writes, and most of the time allow to have a single function for both marking and updating references.	2025-08-07 21:00:00 +02:00
Jean Boussier	f3206cc79b	Struct: keep direct reference to IMEMO/fields when space allows It's not rare for structs to have additional ivars, hence are one of the most common, if not the most common type in the `gen_fields_tbl`. This can cause Ractor contention, but even in single ractor mode means having to do a hash lookup to access the ivars, and increase GC work. Instead, unless the struct is perfectly right sized, we can store a reference to the associated IMEMO/fields object right after the last struct member. ``` compare-ruby: ruby 3.5.0dev (2025-08-06T12:50:36Z struct-ivar-fields-2 9a30d141a1) +PRISM [arm64-darwin24] built-ruby: ruby 3.5.0dev (2025-08-06T12:57:59Z struct-ivar-fields-2 2ff3ec237f) +PRISM [arm64-darwin24] warming up..... \| \|compare-ruby\|built-ruby\| \|:---------------------\|-----------:\|---------:\| \|member_reader \| 590.317k\| 579.246k\| \| \| 1.02x\| -\| \|member_writer \| 543.963k\| 527.104k\| \| \| 1.03x\| -\| \|member_reader_method \| 213.540k\| 213.004k\| \| \| 1.00x\| -\| \|member_writer_method \| 192.657k\| 191.491k\| \| \| 1.01x\| -\| \|ivar_reader \| 403.993k\| 569.915k\| \| \| -\| 1.41x\| ``` Co-Authored-By: Étienne Barrié <etienne.barrie@gmail.com>	2025-08-06 17:07:49 +02:00
Jean Boussier	92688f7d57	variable.c: refactor accesses to the generic_fields_tbl All accesses to `generic_fields_tbl_` are now encapsulated inside: - `rb_obj_fields` - `rb_obj_set_fields` - `rb_obj_replace_fields`	2025-08-06 12:33:44 +02:00
Peter Zhu	95320f1ddf	Fix RUBY_FREE_AT_EXIT for static symbols Since static symbols allocate memory, we should deallocate them at shutdown to prevent memory leaks from being reported with RUBY_FREE_AT_EXIT.	2025-08-05 12:04:27 -04:00
Jean Boussier	f2a7e48dea	Make `RClass.cc_table` a managed object For now this doesn't change anything, but now that the table is managed by GC, it opens the door to use RCU when in multi-ractor mode, hence allow unsynchornized reads.	2025-08-01 10:42:04 +02:00
Takashi Kokubun	2cd10de330	ZJIT: Prepare for sharing JIT hooks with ZJIT (#14044 )	2025-07-30 10:11:10 -07:00
Takashi Kokubun	b22eb0e468	ZJIT: Add --zjit-stats (#14034 )	2025-07-29 10:00:15 -07:00
tomoya ishida	a66e4f2154	Improve performance of bignum[beg, len] (#14007 ) Implement rb_big_aref2. Taking a small slice from large bignum was slow in rb_int_aref2.	2025-07-29 16:34:13 +00:00
Jean Boussier	7ee127d2d1	Get rid of imemo_ast It has been marked as obsolete for a while and I see no reason to keep it.	2025-07-29 13:05:12 +02:00
John Hawthorn	7f25b8f5fb	Disable TSAN for rb_gc_mark_machine_context Previously this was listed as a suppression, but we actually want this permanently unsanitized. This should be faster and more reliable since TASN won't have to match against symbolicated backtraces.	2025-07-24 16:35:42 -07:00
Peter Zhu	f186f2cb70	Remove unused imemo_parser_strterm	2025-07-24 09:49:13 -04:00
Samuel Williams	64f508ade8	Support `cause:` in `Thread#raise` and `Fiber#raise`. (#13967 ) * Add support for `cause:` argument to `Fiber#raise` and `Thread#raise`. The implementation behaviour is consistent with `Kernel#raise` and `Exception#initialize` methods, allowing the `cause:` argument to be passed to `Fiber#raise` and `Thread#raise`. This change ensures that the `cause:` argument is handled correctly, providing a more consistent and expected behavior when raising exceptions in fibers and threads. [Feature #21360] * Shared specs for Fiber/Thread/Kernel raise. --------- Co-authored-by: Samuel Williams <samuel.williams@shopify.com>	2025-07-24 14:45:43 +12:00
John Hawthorn	9256442615	Cleanup M_TBL workarounds and comments Previously we had an assertion that the method table was only set on young objects, and a comment stating that was how it needed to be used. I think that confused the complexity of the write barriers that may be needed here. * Setting an empty M_TBL never needs a write barrier * T_CLASS and T_MODULE should always fire a write barrier to newly added methods * T_ICLASS only needs a write barrier to methods when RCLASSEXT_ICLASS_IS_ORIGIN(x) && !RCLASSEXT_ICLASS_ORIGIN_SHARED_MTBL(x) We shouldn't assume that the object being young is sufficient, because we also need write barriers for incremental marking and it's unreliable.	2025-07-23 14:33:55 -07:00
Peter Zhu	66349692f0	Introduce free function to rb_concurrent_set_funcs If we create a key but don't insert it (due to other Ractor winning the race), then it would leak memory if we don't free it. This introduces a new function to free that memory for this case.	2025-07-21 10:58:30 -04:00
Peter Zhu	2bcb155b49	Convert global symbol table to concurrent set	2025-07-21 10:58:30 -04:00
Peter Zhu	f05ee26a1f	Add rb_concurrent_set_find	2025-07-21 10:58:30 -04:00
Peter Zhu	9ef482bd13	Add rb_concurrent_set_size	2025-07-21 10:58:30 -04:00
Peter Zhu	f5312d8e7f	Make rb_concurrent_set_funcs const We should never modify rb_concurrent_set_funcs during runtime, so we can make it const.	2025-07-15 09:55:36 -04:00
Peter Zhu	b2a7b76992	Remove dead rb_cc_table_free	2025-07-14 11:11:47 -04:00
Peter Zhu	127cc425b7	Remove dead rb_cc_table_mark	2025-07-14 11:11:47 -04:00
Jeremy Evans	08d4f7893e	Rename some set_* functions to set_table_* These functions conflict with the planned C-API functions. Since they deal with the underlying set_table pointers and not Set instances, this seems like a more accurate name as well.	2025-07-11 15:24:23 +09:00
Jean Boussier	1fb4929ace	Make `rb_enc_autoload_p` atomic Using `encoding->max_enc_len` as a way to check if the encoding has been loaded isn't atomic, because it's not atomically set last. Intead we can use a dedicated atomic value inside the encoding table.	2025-07-10 17:18:20 +02:00
Jean Boussier	0bb44f291e	Rename `ractor_safe_set` into `concurrent_set` There's nothing ractor related in them, and the classic terminology for these sort of data structures is `concurrent-*`, e.g. concurrent hash.	2025-07-07 15:12:39 +02:00
Jean Boussier	517c106709	imemo_fields_set: save copying when reassigning a variable If we still fit in the existing imemo/fields object we can update it atomically, saving a reallocation.	2025-07-03 09:20:22 +02:00
Peter Zhu	ead3739c34	Inline ASAN poison functions when ASAN is not enabled The ASAN poison functions was always defined in gc.c, even if ASAN was not enabled. This made function calls to happen all the time even if ASAN is not enabled. This commit defines these functions as empty macros when ASAN is not enabled.	2025-06-30 10:25:58 -04:00
Erik Berlin	eab4a0bc8d	Fix race condition in signal handler query (#13712 ) * Fix race condition in signal handler query * Initialize signal lock dynamically and reset after fork * Fix signal handler mutex initialization conditions	2025-06-28 13:55:59 +09:00
Peter Zhu	d9b2d89976	Extract Ractor safe table used for frozen strings This commit extracts the Ractor safe table used for frozen strings into ractor_safe_table.c, which will allow it to be used elsewhere, including for the global symbol table.	2025-06-27 09:23:14 -04:00
Jean Boussier	242343ff80	variable.c: Refactor `generic_field_set` / `generic_ivar_set` These two functions are very similar, they can share most of their logic.	2025-06-26 16:25:57 +02:00
Yusuke Endoh	a18fa86351	Change how to correct the first lineno in the backtrace on ArgumentError Follow up to fix `3b7373fd00`. In that commit, the line number in the first frame was overwritten after the whole backtrace was created. There was a problem that the line number was overwritten even if the location was backpatched. Instead, this commit uses first_lineno if the frame is VM_FRAME_MAGIC_DUMMY when generating the backtrace. Before the patch: ``` $ ./miniruby -e '[1, 2].inject(:tap)' -e:in '<main>': wrong number of arguments (given 1, expected 0) (ArgumentError) from -e:1:in 'Enumerable#inject' from -e:1:in '<main>' ``` After the patch: ``` $ ./miniruby -e '[1, 2].inject(:tap)' -e:1:in '<main>': wrong number of arguments (given 1, expected 0) (ArgumentError) from -e:1:in 'Enumerable#inject' from -e:1:in '<main>' ```	2025-06-24 11:39:58 +09:00
Nobuyoshi Nakada	af6b98f7a2	Make the critical level an enum	2025-06-23 20:03:48 +09:00
Jean Boussier	32ee3fab0a	Shink RClass when it is known they can't be namespaced Even when namespaces are enabled, only a few core classes created during init will eventually be namespaced. For these it's OK to allocate a 320B slot to hold the extra namespace stuff. But for any class created post init, we know we'll never need the namespace and we can fit in a 160B slot.	2025-06-23 10:04:58 +01:00
Jean Boussier	ea4a53c595	Avoid creating namespace table for classes that can't be namespaced.	2025-06-23 10:04:58 +01:00
Jean Boussier	96a0c2065a	Mark RClass instance that may be namespaced with RCLASS_NAMESPACEABLE	2025-06-23 10:04:58 +01:00
Jean Boussier	393e9a5f3e	Optimize `rb_namespace_available` Rather than to lazily check the env using a trinary value, we can more straightforwardly check for the env during the VM boot. This allow `rb_namespace_available` to just be a pointer dereference.	2025-06-23 10:04:58 +01:00
Jean Boussier	cd9f447be2	Refactor generic fields to use `T_IMEMO/fields` objects. Followup: https://github.com/ruby/ruby/pull/13589 This simplify a lot of things, as we no longer need to manually manage the memory, we can use the Read-Copy-Update pattern and avoid numerous race conditions. Co-Authored-By: Étienne Barrié <etienne.barrie@gmail.com>	2025-06-17 15:28:05 +02:00
Jean Boussier	164486a954	Refactor `rb_imemo_fields_new` to not assume T_CLASS	2025-06-17 15:28:05 +02:00
Jean Boussier	fb68721f63	Rename `imemo_class_fields` -> `imemo_fields`	2025-06-17 15:28:05 +02:00
Jean Boussier	9e839d3c0e	Optimize `benchmark/vm_ivar_of_class` ``` compare-ruby: ruby 3.5.0dev (2025-06-17T08:45:40Z master `e9d35671d2`) +PRISM [arm64-darwin24] last_commit=[ruby/json] Fix a typo built-ruby: ruby 3.5.0dev (2025-06-17T09:27:05Z opt-getivar-for-cl.. ed1d7cd778) +PRISM [arm64-darwin24] \| \|compare-ruby\|built-ruby\| \|:---------------------\|-----------:\|---------:\| \|vm_ivar_of_class_set \| 12.306M\| 13.957M\| \| \| -\| 1.13x\| \|vm_ivar_of_class \| 16.167M\| 24.029M\| \| \| -\| 1.49x\| ```	2025-06-17 13:00:31 +02:00
Samuel Williams	68625a23d6	Fix blocking operation cancellation. (#13614 ) Expose `rb_thread_resolve_unblock_function` internally.	2025-06-14 12:32:51 +09:00
Jean Boussier	a99d941cac	Add SHAPE_ID_HAS_IVAR_MASK for quick ivar check This allow checking if an object has ivars with just a shape_id mask.	2025-06-13 19:46:29 +02:00
John Hawthorn	a34fcf401b	Add a new_thread flag to rb_interrupt_exec Previously rb_ractor_interrupt_exec would use an intermediate function to create a new thread with the actual target function, replacing the data being passed in with a piece of malloc memory holding the "next" function and the original data. Because of this, passing rb_interrupt_exec_flag_value_data to rb_ractor_interrupt_exec didn't have the intended effect of allowing data to be passed in and marked. This commit adds a rb_interrupt_exec_flag_new_thread flag, which both simplifies the implementation and allows the original data to be marked.	2025-06-12 13:13:55 -07:00
Jean Boussier	a74c385208	Make setting and accessing class ivars lock-free Now that class fields have been deletated to a T_IMEMO/class_fields when we're in multi-ractor mode, we can read and write class instance variable in an atomic way using Read-Copy-Update (RCU). Note when in multi-ractor mode, we always use RCU. In theory we don't need to, instead if we ensured the field is written before the shape is updated it would be safe. Benchmark: ```ruby Warning[:experimental] = false class Foo @foo = 1 @bar = 2 @baz = 3 @egg = 4 @spam = 5 class << self attr_reader :foo, :bar, :baz, :egg, :spam end end ractors = 8.times.map do Ractor.new do 1_000_000.times do Foo.bar + Foo.baz * Foo.egg - Foo.spam end end end if Ractor.method_defined?(:value) ractors.each(&:value) else ractors.each(&:take) end ``` This branch vs Ruby 3.4: ```bash $ hyperfine -w 1 'ruby --disable-all ../test.rb' './miniruby ../test.rb' Benchmark 1: ruby --disable-all ../test.rb Time (mean ± σ): 3.162 s ± 0.071 s [User: 2.783 s, System: 10.809 s] Range (min … max): 3.093 s … 3.337 s 10 runs Benchmark 2: ./miniruby ../test.rb Time (mean ± σ): 208.7 ms ± 4.6 ms [User: 889.7 ms, System: 6.9 ms] Range (min … max): 202.8 ms … 222.0 ms 14 runs Summary ./miniruby ../test.rb ran 15.15 ± 0.47 times faster than ruby --disable-all ../test.rb ```	2025-06-12 14:55:13 +02:00
Jean Boussier	8b5ac5abf2	Fix class instance variable inside namespaces Now that classes fields are delegated to an object with its own shape_id, we no longer need to mark all classes as TOO_COMPLEX.	2025-06-12 13:43:29 +02:00
Jean Boussier	3abdd4241f	Turn `rb_classext_t.fields` into a T_IMEMO/class_fields This behave almost exactly as a T_OBJECT, the layout is entirely compatible. This aims to solve two problems. First, it solves the problem of namspaced classes having a single `shape_id`. Now each namespaced classext has an object that can hold the namespace specific shape. Second, it open the door to later make class instance variable writes atomics, hence be able to read class variables without locking the VM. In the future, in multi-ractor mode, we can do the write on a copy of the `fields_obj` and then atomically swap it. Considerations: - Right now the `RClass` shape_id is always synchronized, but with namespace we should likely mark classes that have multiple namespace with a specific shape flag.	2025-06-12 07:58:16 +02:00
Jean Boussier	95201299fd	Refactor the last references to `rb_shape_t` The type isn't opaque because Ruby isn't often compiled with LTO, so for optimization purpose it's better to allow as much inlining as possible. However ideally only `shape.c` and `shape.h` should deal with the actual struct, and everything else should just deal with opaque `shape_id_t`.	2025-06-11 16:38:38 +02:00
alpaca-tc	c8ddc0a843	Optimize callcache invalidation for refinements Fixes [Bug #21201] This change addresses a performance regression where defining methods inside `refine` blocks caused severe slowdowns. The issue was due to `rb_clear_all_refinement_method_cache()` triggering a full object space scan via `rb_objspace_each_objects` to find and invalidate affected callcaches, which is very inefficient. To fix this, I introduce `vm->cc_refinement_table` to track callcaches related to refinements. This allows us to invalidate only the necessary callcaches without scanning the entire heap, resulting in significant performance improvement.	2025-06-09 12:33:35 +09:00
Jean Boussier	8c4e368dcf	shape.c: ensure heap_index is consistent for complex shapes	2025-06-07 18:30:44 +02:00
Samuel Williams	81a23c5793	`rb_io_blocking_operation_exit` should not execute with pending interrupts.	2025-06-06 13:13:16 +09:00

1 2 3 4 5 ...

659 commits