b2txt25

Author	SHA1	Message	Date
Zchen	e399cf262a	ff	2025-10-20 13:37:11 +08:00
Zchen	7358ff3d79	Enable soft device placement for CTC operations and update related comments	2025-10-20 11:22:13 +08:00
Zchen	f8fb4d7133	Remove setup script, TPU memory monitor, and training model script - Deleted `setup_tensorflow_tpu.sh` which was responsible for setting up the TensorFlow environment on TPU v5e-8. - Removed `tpu_memory_monitor.py`, a tool for monitoring TPU memory usage during training. - Eliminated `train_model.py`, the script for training the Brain-to-Text RNN model.	2025-10-20 11:05:03 +08:00
Zchen	7c272b7c5b	Remove test scripts for data loading and TensorFlow implementation	2025-10-20 01:37:22 +08:00
Zchen	0a0e07a193	Remove custom CTC loss implementation for TPU from the TripleGRUDecoder class	2025-10-20 01:16:50 +08:00
Zchen	06ddbc6ac2	Refactor input function to implement batch-first approach with dynamic padding and apply data augmentation post-batching for TPU compatibility	2025-10-20 00:58:29 +08:00
Zchen	fabf70cfa9	Enhance dataset shape analysis by implementing parallel processing and improving sampling logic	2025-10-20 00:35:17 +08:00
Zchen	e1669b5a4c	Increase batch size from 256 to 512 for training in rnn_args.yaml	2025-10-20 00:21:33 +08:00
Zchen	6e02894a8a	f	2025-10-20 00:13:39 +08:00
Zchen	4db3625dc5	f	2025-10-19 23:55:56 +08:00
Zchen	fed5fd8251	f	2025-10-19 22:25:21 +08:00
Zchen	4b373ab317	ff	2025-10-19 20:16:23 +08:00
Zchen	40d0fc50de	f	2025-10-19 13:18:20 +08:00
Zchen	4328114ed6	Add dataset shape analysis function and integrate into input function for TPU optimization	2025-10-19 11:04:36 +08:00
Zchen	cfd9653da9	Enhance dataset caching logic for training and validation sets with improved messaging	2025-10-19 10:31:31 +08:00
Zchen	558be0ad98	Refactor individual dataset creation for improved I/O efficiency and add logging for error handling	2025-10-19 10:31:18 +08:00
Zchen	d83f990beb	f	2025-10-17 12:20:17 +08:00
Zchen	eb058fe9d3	ff	2025-10-17 11:57:10 +08:00
Zchen	57360bec8a	Remove CPU optimization call and add logging for TPU strategy and data pipeline performance	2025-10-17 11:45:20 +08:00
Zchen	eb4e3fc69f	fff	2025-10-17 11:38:57 +08:00
Zchen	6c7abfcca8	f	2025-10-17 10:53:58 +08:00
Zchen	7ede7b5f12	f	2025-10-17 02:09:14 +08:00
Zchen	ca8c615505	f	2025-10-17 02:01:48 +08:00
Zchen	49700456b8	f	2025-10-17 01:58:28 +08:00
Zchen	8ee09b6b5e	f	2025-10-17 01:54:32 +08:00
Zchen	a5a3179ca6	f	2025-10-17 01:49:03 +08:00
Zchen	59fb73ee9f	f	2025-10-17 01:36:08 +08:00
Zchen	0a72143513	legacy adam	2025-10-17 01:26:02 +08:00
Zchen	7df78244e6	adamw to adam	2025-10-17 01:07:01 +08:00
Zchen	a96e272f7b	fix twice gradient cut	2025-10-17 00:51:53 +08:00
Zchen	7a43ebfb71	refactor: streamline model building and ensure dtype consistency in L2 loss calculation	2025-10-16 23:06:09 +08:00
Zchen	9453b70fad	remove quick test script for TensorFlow implementation fixes	2025-10-16 23:05:53 +08:00
Zchen	7efa33d730	f	2025-10-16 22:42:33 +08:00
Zchen	982d2dc256	f	2025-10-16 22:20:08 +08:00
Zchen	bd61136f93	f	2025-10-16 22:02:11 +08:00
Zchen	6f94ad5fae	f	2025-10-16 21:51:43 +08:00
Zchen	eefff1ce5e	fix	2025-10-16 21:40:43 +08:00
Zchen	426b72ef25	fix	2025-10-16 21:26:00 +08:00
Zchen	dde6378481	fixed	2025-10-16 21:13:42 +08:00
Zchen	a0b59c6987	fix	2025-10-16 21:06:01 +08:00
Zchen	ed6e21bfe4	fix 'NoneType' object has no attribute 'extended'	2025-10-16 20:57:40 +08:00
Zchen	1e7077bba7	adamw修复	2025-10-16 20:44:55 +08:00
Zchen	c2661550ef	内存泄漏修复	2025-10-16 20:26:32 +08:00
Zchen	1b9e0d9bdf	调整batch_size	2025-10-16 17:37:59 +08:00
Zchen	be578f2e1d	修复数据加载器低效问题	2025-10-16 17:14:06 +08:00
Zchen	a545cc5648	tpu维护	2025-10-16 13:39:05 +08:00
Zchen	5a1e446219	HBM	2025-10-16 11:42:56 +08:00
Zchen	0ff6634192	简单修复	2025-10-16 10:53:42 +08:00
Zchen	df4a914bbd	小参数	2025-10-16 09:22:25 +08:00
Zchen	25561a7615	超大batch_size	2025-10-16 01:57:19 +08:00

1 2 3 4

160 Commits