Fix pandas DataFrame dtype preservation in VMDataset initialization
validmind-library
2.8.22
bug
enhancement
You can now reduce memory usage when initializing VMDataset
objects with vm.init_dataset()
. We’ve introduced a copy_data
option that lets you avoid copying the input dataframe, which is useful for handling large datasets in environments with limited memory. By default, copy_data
is set to True. Here’s how to use it:
= vm.init_dataset(
vm_ds =df,
dataset="demo",
input_id="target",
target_column=False,
copy_data )