archive/ruby - Eplg Git: Free And Private Git Hosting

mirror of https://github.com/ruby/ruby.git synced 2025-08-25 22:14:37 +02:00

Author	SHA1	Message	Date
Peter Zhu	f14e52c8c4	Fix setting GC stress at boot when objspace not available	2024-03-27 09:39:23 -04:00
eileencodes	e16086b7f2	Refactor init_copy gc attributes This PR moves `rb_copy_wb_protected_attribute` and `rb_gc_copy_finalizer` into a single function called `rb_gc_copy_attributes` to be called by `init_copy`. This reduces the surface area of the GC API. Co-authored-by: Peter Zhu <peter@peterzhu.ca>	2024-03-26 14:29:36 -04:00
Peter Zhu	9cf754b648	Fix --debug=gc_stress flag ruby_env_debug_option gets called after Init_gc_stress, so the --debug=gc_stress flag never works.	2024-03-25 13:07:39 -04:00
Peter Zhu	e07441f05f	Make rb_aligned_malloc private It is not used anywhere else.	2024-03-20 10:27:41 -04:00
Peter Zhu	3f5f04afa7	Remove duplicated function prototype rb_gc_disable_no_rest	2024-03-18 16:39:04 -04:00
Peter Zhu	4469729558	Remove rb_raw_obj_info_basic It's not used outside of gc.c.	2024-03-18 10:19:11 -04:00
Peter Zhu	ff51dc5654	[Feature #20265 ] Remove rb_newobj_of and RB_NEWOBJ_OF	2024-03-14 12:53:04 -04:00
Peter Zhu	6ad347a105	Don't directly read the SIZE_POOL_COUNT in shapes This removes the assumption about SIZE_POOL_COUNT for shapes.	2024-03-13 09:55:52 -04:00
Peter Zhu	2fc551e34e	Simplify NEWOBJ_OF macro	2024-03-13 09:05:52 -04:00
Peter Zhu	2c349cf4b6	Use NEWOBJ_OF_ec in NEWOBJ_OF_0	2024-03-11 19:30:09 -04:00
Jean Boussier	2d80b6093f	Retire RUBY_MARK_UNLESS_NULL Marking `Qnil` or `Qfalse` works fine, having an extra macro to avoid it isn't needed.	2024-03-08 14:13:14 +01:00
Peter Zhu	5481dbef07	Make rb_define_finalizer_no_check private	2024-02-28 13:45:19 -05:00
Peter Zhu	dcc976add9	Remove unused rb_gc_id2ref_obj_tbl	2024-02-28 12:21:38 -05:00
Peter Zhu	78ae6dbb11	Remove rb_objspace_marked_object_p rb_objspace_marked_object_p is no longer used in the objspace module, so we can remove it.	2024-02-26 17:05:34 -05:00
Peter Zhu	7538703d1b	Make rb_objspace_data_type_memsize private rb_objspace_data_type_memsize is not used in the objspace module, so we can make it private.	2024-02-26 17:05:34 -05:00
Peter Zhu	c9b6cd4223	Remove unused rb_objspace_each_objects_without_setup	2024-02-26 14:34:24 -05:00
Peter Zhu	e65315a725	Extract imemo functions from gc.c into imemo.c	2024-02-22 11:35:09 -05:00
Peter Zhu	1d3b306753	Move rb_class_allocate_instance from gc.c to object.c	2024-02-14 13:43:02 -05:00
KJ Tsanaktsidis	25f5b83689	Fix crash when printing RGENGC_DEBUG=5 output from GC I was trying to debug an (unrelated) issue in the GC, and wanted to turn on the trace-level GC output by compiling it with -DRGENGC_DEBUG=5. Unfortunately, this actually causes a crash in newobj_init() because the code there tries to log the obj_info() of the newly created object. However, the object is not actually sufficiently set up for some of the things that obj_info() tries to do: * The instance variable table for a class is not yet initialized, and when using variable-length RVALUES, said ivar table is embedded in as-yet unitialized memory after the struct RValue. Attempting to read this, as obj_info() does, causes a crash. * T_DATA variables need to dereference their ->type field to print out the underlying C type name, which is not set up until newobj_fill() is called. To fix this, create a new method `obj_info_basic`, which dumps out only the parts of the object that are valid before the object is fully initialized. [Fixes #18795]	2024-01-11 10:44:57 +11:00
Peter Zhu	269c705f93	Fix compaction for generic ivars When generic instance variable has a shape, it is marked movable. If it it transitions to too complex, it needs to update references otherwise it may have incorrect references.	2023-11-24 13:29:04 -05:00
Jean Boussier	e5364ea496	rb_shape_transition_shape_capa: use optimal sizes transitions Previously the growth was 3(embed), 6, 12, 24, ... With this change it's now 3(embed), 8, 16, 32, 64, ... by default. However, since power of two isn't the best size for all allocators, if `malloc_usable_size` is vailable, we use it to discover the best offset. On Linux/glibc 2.35 for instance, the growth will be 3(embed), 7, 15, 31 to avoid wasting 8B per object. Test program: ```c size_t test(size_t slots) { size_t allocated = slots * VALUE_SIZE; void test_ptr = malloc(allocated); size_t wasted = malloc_usable_size(test_ptr) - allocated; free(test_ptr); fprintf(stderr, "slots = %lu, wasted_bytes = %lu\n", slots, wasted); return wasted; } int main(int argc, char argv[]) { size_t best_padding = 0; size_t padding = 0; for (padding = 0; padding <= 2; padding++) { size_t wasted = test(8 - padding); if (wasted == 0) { best_padding = padding; break; } } size_t index = 0; fprintf(stderr, "=============== naive ================\n"); size_t list_size = 4; for (index = 0; index < 10; index++) { test(list_size); list_size = 2; } fprintf(stderr, "=============== auto-padded (-%lu) ================\n", best_padding); list_size = 4; for (index = 0; index < 10; index ++) { test(list_size - best_padding); list_size = 2; } fprintf(stderr, "\n\n"); return 0; } ``` ``` ===== glibc ====== slots = 8, wasted_bytes = 8 slots = 7, wasted_bytes = 0 =============== naive ================ slots = 4, wasted_bytes = 8 slots = 8, wasted_bytes = 8 slots = 16, wasted_bytes = 8 slots = 32, wasted_bytes = 8 slots = 64, wasted_bytes = 8 slots = 128, wasted_bytes = 8 slots = 256, wasted_bytes = 8 slots = 512, wasted_bytes = 8 slots = 1024, wasted_bytes = 8 slots = 2048, wasted_bytes = 8 =============== auto-padded (-1) ================ slots = 3, wasted_bytes = 0 slots = 7, wasted_bytes = 0 slots = 15, wasted_bytes = 0 slots = 31, wasted_bytes = 0 slots = 63, wasted_bytes = 0 slots = 127, wasted_bytes = 0 slots = 255, wasted_bytes = 0 slots = 511, wasted_bytes = 0 slots = 1023, wasted_bytes = 0 slots = 2047, wasted_bytes = 0 ``` ``` ========== jemalloc ======= slots = 8, wasted_bytes = 0 =============== naive ================ slots = 4, wasted_bytes = 0 slots = 8, wasted_bytes = 0 slots = 16, wasted_bytes = 0 slots = 32, wasted_bytes = 0 slots = 64, wasted_bytes = 0 slots = 128, wasted_bytes = 0 slots = 256, wasted_bytes = 0 slots = 512, wasted_bytes = 0 slots = 1024, wasted_bytes = 0 slots = 2048, wasted_bytes = 0 =============== auto-padded (-0) ================ slots = 4, wasted_bytes = 0 slots = 8, wasted_bytes = 0 slots = 16, wasted_bytes = 0 slots = 32, wasted_bytes = 0 slots = 64, wasted_bytes = 0 slots = 128, wasted_bytes = 0 slots = 256, wasted_bytes = 0 slots = 512, wasted_bytes = 0 slots = 1024, wasted_bytes = 0 slots = 2048, wasted_bytes = 0 ```	2023-10-23 09:33:15 +02:00
Peter Zhu	12102d101a	Fix crash in WeakMap during compaction WeakMap can crash during compaction because the st_insert could allocate memory.	2023-09-06 14:20:23 -04:00
Peter Zhu	9a8398a18f	Introduce rb_gc_remove_weak If we're during incremental marking, then Ruby code can execute that deallocates certain memory buffers that have been called with rb_gc_mark_weak, which can cause use-after-free bugs.	2023-09-05 14:32:15 -04:00
Matt Valentine-House	322548180d	Prevent rb_gc_mark_values from pinning objects This is an internal only function not exposed to the C extension API. It's only use so far is from rb_vm_mark, where it's used to mark the values in the vm->trap_list.cmd array. There shouldn't be any reason why these cannot move. This commit allows them to move by updating their references during the reference updating step of compaction. To do this we've introduced another internal function rb_gc_update_values as a partner to rb_gc_mark_values. This allows us to refactor rb_gc_mark_values to not pin	2023-08-31 19:31:18 +01:00
Peter Zhu	bfb395c620	Implement weak references in the GC [Feature #19783] This commit adds support for weak references in the GC through the function `rb_gc_mark_weak`. Unlike strong references, weak references does not mark the object, but rather lets the GC know that an object refers to another one. If the child object is freed, the pointer from the parent object is overwritten with `Qundef`. Co-Authored-By: Jean Boussier <byroot@ruby-lang.org>	2023-08-25 09:01:21 -04:00
Jean Boussier	fa30b99c34	Implement Process.warmup [Feature #18885] For now, the optimizations performed are: - Run a major GC - Compact the heap - Promote all surviving objects to oldgen Other optimizations may follow.	2023-07-17 11:20:15 +02:00
yui-knk	b481b673d7	[Feature #19719 ] Universal Parser Introduce Universal Parser mode for the parser. This commit includes these changes: * Introduce `UNIVERSAL_PARSER` macro. All of CRuby related functions are passed via `struct rb_parser_config_struct` when this macro is enabled. * Add CI task with 'cppflags=-DUNIVERSAL_PARSER' for ubuntu.	2023-06-12 18:23:48 +09:00
Peter Zhu	a0d1069e03	Make classes embedded on 32 bit Classes are now exactly 80 bytes when embedded, which perfectly fits the 3rd size pool on 32 bit systems.	2023-04-16 11:06:31 -04:00
Peter Zhu	b4571097df	Enable 5 size pools on 32 bit systems This commit will allow 32 bit systems to take advantage of VWA.	2023-04-11 11:25:12 -04:00
Matt Valentine-House	026321c5b9	[Feature #19474 ] Refactor NEWOBJ macros NEWOBJ_OF is now our canonical newobj macro. It takes an optional ec	2023-04-06 11:07:16 +01:00
Matt Valentine-House	879cda98a4	Remove dependancy of vm_core.h on shape.h so that now shape can happily include gc.h	2023-04-06 11:07:16 +01:00
Peter Zhu	1da2e7fca3	[Feature #19579 ] Remove !USE_RVARGC code (#7655 ) Remove !USE_RVARGC code [Feature #19579] The Variable Width Allocation feature was turned on by default in Ruby 3.2. Since then, we haven't received bug reports or backports to the non-Variable Width Allocation code paths, so we assume that nobody is using it. We also don't plan on maintaining the non-Variable Width Allocation code, so we are going to remove it.	2023-04-04 17:30:06 -04:00
Aaron Patterson	8525603c72	Revert "Fix transient heap mode" This reverts commit `87253d047c`. Revert "Implement `Process.warmup`" This reverts commit `ba6ccd8714`.	2023-04-04 12:59:14 -07:00
Jean Boussier	ba6ccd8714	Implement `Process.warmup` [Feature #18885] For now, the optimizations performed are: - Run a major GC - Compact the heap - Promote all surviving objects to oldgen Other optimizations may follow.	2023-04-04 19:49:08 +02:00
Peter Zhu	a2ff9423a1	Allow user defined SIZE_POOL_COUNT We shouldn't overwrite the value of SIZE_POOL_COUNT if the user has specified one.	2023-03-27 09:40:24 -04:00
Matt Valentine-House	5791aa6263	Move RB_GC_SAVE_MACHINE_CONTEXT to vm_core.h	2023-03-15 21:26:26 +00:00
Peter Zhu	f98a7fd28d	Move WeakMap and WeakKeyMap code to weakmap.c These classes don't belong in gc.c as they're not actually part of the GC. This commit refactors the code by moving all the code into a weakmap.c file.	2023-03-10 09:32:10 -05:00
Matt Valentine-House	9546c70e09	Remove unused forward decl of rb_thread_struct	2023-03-09 21:14:01 +00:00
Takashi Kokubun	233ddfac54	Stop exporting symbols for MJIT	2023-03-06 21:59:23 -08:00
Matt Valentine-House	72aba64fff	Merge gc.h and internal/gc.h [Feature #19425]	2023-02-09 10:32:29 -05:00
Peter Zhu	1c0127ed5e	Make Time objects WB protected Co-Authored-By: Jean Boussier <byroot@ruby-lang.org>	2023-02-07 16:03:08 -05:00
Peter Zhu	0ee4fec988	Fix typo in gc.h [ci skip]	2023-02-07 13:39:34 -05:00
Nobuyoshi Nakada	15328302dc	In `UNALIGNED_MEMBER_PTR` cast through `void` pointer Suppress warnings shown even with `-Waddress-of-packed-member` disabled in gcc 11.	2023-01-21 00:30:34 +09:00
Peter Zhu	41bf2354e3	Add rb_gc_mark_and_move and implement on iseq This commit adds rb_gc_mark_and_move which takes a pointer to an object and marks it during marking phase and updates references during compaction. This allows for marking and reference updating to be combined into a single function, which reduces code duplication and prevents bugs if marking and reference updating goes out of sync. This commit also implements rb_gc_mark_and_move on iseq as an example.	2023-01-19 11:23:35 -05:00
Peter Zhu	4fa7d38324	Don't redefine RB_OBJ_WRITE RB_OBJ_WRITE already exists in rgengc.h, so we shouldn't redefine it in gc.h.	2023-01-18 08:49:32 -05:00
Peter Zhu	c505448cdb	Move definition of SIZE_POOL_COUNT back to gc.h SIZE_POOL_COUNT is a GC macro, it should belong in gc.h and not shape.h. SIZE_POOL_COUNT doesn't depend on shape.h so we can have shape.h depend on gc.h. Co-Authored-By: Matt Valentine-House <matt@eightbitraptor.com>	2022-12-15 16:33:46 -05:00
Jemma Issroff	c1ab6ddc9a	Transition complex objects to "too complex" shape When an object becomes "too complex" (in other words it has too many variations in the shape tree), we transition it to use a "too complex" shape and use a hash for storing instance variables. Without this patch, there were rare cases where shape tree growth could "explode" and cause performance degradation on what would otherwise have been cached fast paths. This patch puts a limit on shape tree growth, and gracefully degrades in the rare case where there could be a factorial growth in the shape tree. For example: ```ruby class NG; end HUGE_NUMBER.times do NG.new.instance_variable_set(:"@unique_ivar_#{_1}", 1) end ``` We consider objects to be "too complex" when the object's class has more than SHAPE_MAX_VARIATIONS (currently 8) leaf nodes in the shape tree and the object introduces a new variation (a new leaf node) associated with that class. For example, new variations on instances of the following class would be considered "too complex" because those instances create more than 8 leaves in the shape tree: ```ruby class Foo; end 9.times { Foo.new.instance_variable_set(":@uniq_#{_1}", 1) } ``` However, the following class is not too complex because it only has one leaf in the shape tree: ```ruby class Foo def initialize @a = @b = @c = @d = @e = @f = @g = @h = @i = nil end end 9.times { Foo.new } `` This case is rare, so we don't expect this change to impact performance of most applications, but it needs to be handled. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-12-15 10:06:04 -08:00
Jemma Issroff	5246f4027e	Transition shape when object's capacity changes This commit adds a `capacity` field to shapes, and adds shape transitions whenever an object's capacity changes. Objects which are allocated out of a bigger size pool will also make a transition from the root shape to the shape with the correct capacity for their size pool when they are allocated. This commit will allow us to remove numiv from objects completely, and will also mean we can guarantee that if two objects share shapes, their IVs are in the same positions (an embedded and extended object cannot share shapes). This will enable us to implement ivar sets in YJIT using object shapes. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-11-10 10:11:34 -05:00
Peter Zhu	a41fbc2c95	Increase SIZE_POOL_COUNT to 5 Having more size pools will allow us to allocate larger objects through Variable Width Allocation. I have attached some benchmark results below. Discourse: On Discourse, we don't see much change in response times. We do see a small reduction in RSS. Branch RSS: 377.8 MB Master RSS: 396.3 MB railsbench: On railsbench, we don't see a big change in RPS or p99 performance. We see a small increase in RSS. Branch RPS: 815.38 Master RPS: 811.73 Branch p99: 1.69 ms Master p99: 1.68 ms Branch RSS: 90.6 MB Master RSS: 89.4 MB liquid: We don't see a significant change in liquid performance. Branch parse & render: 29.041 I/s Master parse & render: 29.211 I/s	2022-05-09 08:45:24 -04:00
Peter Zhu	dde164e968	Decouple incremental marking step from page sizes Currently, the number of incremental marking steps is calculated based on the number of pooled pages available. This means that if we make Ruby heap pages larger, it would run fewer incremental marking steps (which would mean each incremental marking step takes longer). This commit changes incremental marking to run after every INCREMENTAL_MARK_STEP_ALLOCATIONS number of allocations. This means that the behaviour of incremental marking remains the same regardless of the Ruby heap page size. I've benchmarked against discourse benchmarks and did not get a significant change in response times beyond the margin of error. This is expected as this new incremental marking algorithm behaves very similarly to the previous one.	2022-03-30 09:33:17 -04:00

1 2

78 commits