The OSError occurred because TFDS opens many sharded data files even for small samples to handle internal data access and file shuffling. To fix this, you should increase your operating system's open file limit (ulimit -n) and set shuffle_files=False
in tfds.load()
to reduce concurrent file access for quick sampling. Please refer to the gist where i attempted to solve this issue.