LAUNCH INFO 2022-07-25 04:15:52,492 ----------- Configuration ---------------------- LAUNCH INFO 2022-07-25 04:15:52,492 devices: None LAUNCH INFO 2022-07-25 04:15:52,493 elastic_level: -1 LAUNCH INFO 2022-07-25 04:15:52,493 elastic_timeout: 30 LAUNCH INFO 2022-07-25 04:15:52,493 gloo_port: 6767 LAUNCH INFO 2022-07-25 04:15:52,493 host: None LAUNCH INFO 2022-07-25 04:15:52,493 job_id: default LAUNCH INFO 2022-07-25 04:15:52,493 legacy: False LAUNCH INFO 2022-07-25 04:15:52,493 log_dir: log LAUNCH INFO 2022-07-25 04:15:52,493 log_level: INFO LAUNCH INFO 2022-07-25 04:15:52,493 master: None LAUNCH INFO 2022-07-25 04:15:52,493 max_restart: 3 LAUNCH INFO 2022-07-25 04:15:52,493 nnodes: 1 LAUNCH INFO 2022-07-25 04:15:52,493 nproc_per_node: None LAUNCH INFO 2022-07-25 04:15:52,493 rank: -1 LAUNCH INFO 2022-07-25 04:15:52,493 run_mode: collective LAUNCH INFO 2022-07-25 04:15:52,493 server_num: None LAUNCH INFO 2022-07-25 04:15:52,493 servers: LAUNCH INFO 2022-07-25 04:15:52,493 trainer_num: None LAUNCH INFO 2022-07-25 04:15:52,493 trainers: LAUNCH INFO 2022-07-25 04:15:52,493 training_script: tools/train.py LAUNCH INFO 2022-07-25 04:15:52,493 training_script_args: ['--config', 'configs/centerpoint/kitti_centerpoint_pillars_016voxel_epoch_nospa.yml', '--num_workers', '4', '--save_interval', '5', '--save_dir', 'output_nospa'] LAUNCH INFO 2022-07-25 04:15:52,493 with_gloo: 0 LAUNCH INFO 2022-07-25 04:15:52,493 -------------------------------------------------- LAUNCH INFO 2022-07-25 04:15:52,525 Job: default, mode collective, replicas 1[1:1], elastic False LAUNCH INFO 2022-07-25 04:15:52,526 Run Pod: kwxphu, replicas 8, status ready LAUNCH INFO 2022-07-25 04:15:52,631 Watching Pod: kwxphu, replicas 8, status running grep: warning: GREP_OPTIONS is deprecated; please use an alias or script 2022-07-25 04:15:57,785 - INFO - Load 2207 Pedestrian database infos 2022-07-25 04:15:57,786 - INFO - Load 14357 Car database infos 2022-07-25 04:15:57,790 - INFO - Load 734 Cyclist database infos 2022-07-25 04:15:57,790 - INFO - After filtering min_num_points_in_box: 2022-07-25 04:15:57,790 - INFO - Load 2161 Pedestrian database infos 2022-07-25 04:15:57,790 - INFO - Load 13442 Car database infos 2022-07-25 04:15:57,790 - INFO - Load 699 Cyclist database infos 2022-07-25 04:15:57,825 - INFO - After filtering ignored difficulty: 2022-07-25 04:15:57,825 - INFO - Load 2066 Pedestrian database infos 2022-07-25 04:15:57,825 - INFO - Load 10520 Car database infos 2022-07-25 04:15:57,825 - INFO - Load 580 Cyclist database infos 2022-07-25 04:15:58,333 - INFO - ------------Environment Information------------- platform: Linux-4.14.0_1-0-0-36-x86_64-with-debian-stretch-sid gcc (GCC) 8.2.0 Python - 3.7.0 (default, May 7 2022, 07:55:29) [GCC 8.2.0] Science Toolkits: cv2 - 4.6.0 numpy - 1.21.6 numba - 0.55.2 pandas - 1.1.5 pillow - 9.1.0 skimage - 0.19.3 PaddlePaddle: paddle(gpu) - 2.3.1 paddle3d - 0.5.0 paddleseg - 2.5.0 FLAGS_cudnn_deterministic - Not set. FLAGS_cudnn_exhaustive_search - Not set. CUDA: cudnn - 7605 nvcc - Cuda compilation tools, release 10.2, V10.2.89 GPUs: ------------------------------------------------ 2022-07-25 04:15:58,358 - INFO - ---------------Config Information--------------- batch_size: 4 epochs: 160 lr_scheduler: base_learning_rate: 0.001 lr_ratio_peak: 10 lr_ratio_trough: 0.0001 step_ratio_peak: 0.4 type: OneCycleWarmupDecayLr model: backbone: downsample_strides: - 1 - 2 - 2 in_channels: 64 layer_nums: - 3 - 5 - 5 out_channels: - 64 - 128 - 256 type: SecondBackbone bbox_head: code_weights: - 1.0 - 1.0 - 1.0 - 1.0 - 1.0 - 1.0 - 1.0 - 1.0 common_heads: dim: - 3 - 2 height: - 1 - 2 reg: - 2 - 2 rot: - 2 - 2 in_channels: 384 tasks: - class_names: - Car num_class: 1 - class_names: - Cyclist - Pedestrian num_class: 2 type: CenterHead weight: 2.5 middle_encoder: in_channels: 64 point_cloud_range: - 0 - -39.68 - -3 - 69.12 - 39.68 - 1 type: PointPillarsScatter voxel_size: - 0.16 - 0.16 - 4 neck: in_channels: - 64 - 128 - 256 out_channels: - 128 - 128 - 128 type: SecondFPN upsample_strides: - 0.5 - 1 - 2 use_conv_for_no_stride: true use_spatial_attn_before_concat: false test_cfg: down_ratio: 2 nms: nms_iou_threshold: 0.1 nms_post_max_size: 83 nms_pre_max_size: 1000 point_cloud_range: - 0 - -39.68 - -3 - 69.12 - 39.68 - 1 post_center_limit_range: - -10.0 - -50.0 - -10.0 - 80.0 - 50.0 - 10.0 score_threshold: 0.1 voxel_size: - 0.16 - 0.16 - 4 type: CenterPoint voxel_encoder: feat_channels: - 64 - 64 in_channels: 4 legacy: false max_num_points_in_voxel: 100 point_cloud_range: - 0 - -39.68 - -3 - 69.12 - 39.68 - 1 type: PillarFeatureNet voxel_size: - 0.16 - 0.16 - 4 with_distance: false voxelizer: max_num_points_in_voxel: 100 max_num_voxels: - 12000 - 40000 point_cloud_range: - 0 - -39.68 - -3 - 69.12 - 39.68 - 1 type: HardVoxelizer voxel_size: - 0.16 - 0.16 - 4 optimizer: beta1: momentum_peak: 0.95 momentum_trough: 0.85 step_ratio_peak: 0.4 type: OneCycleDecayWarmupMomentum beta2: 0.99 grad_clip: clip_norm: 35 type: ClipGradByGlobalNorm type: OneCycleAdam weight_decay: 0.01 train_dataset: class_balanced_sampling: false class_names: - Car - Cyclist - Pedestrian dataset_root: /dataset/KITTI mode: train transforms: - dim: 4 type: LoadPointCloud use_dim: 4 - type: RemoveCameraInvisiblePointsKITTI - class_names: - Car - Cyclist - Pedestrian database_anno_path: /dataset/KITTI/gt_paddle/kitti_train_gt_database/anno_info_train.pkl database_root: /dataset/KITTI/gt_paddle/ ignored_difficulty: - -1 max_num_samples_per_class: Car: 15 Cyclist: 10 min_num_points_in_box_per_class: Car: 5 Cyclist: 5 Pedestrian: 5 type: SamplingDatabase - max_num_attempts: 100 rotation_range: - -0.15707963267 - 0.15707963267 translation_std: - 0.25 - 0.25 - 0.25 type: RandomObjectPerturb - type: RandomVerticalFlip - max_rot: 0.78539816 min_rot: -0.78539816 type: GlobalRotate - max_scale: 1.05 min_scale: 0.95 type: GlobalScale - translation_std: - 0.2 - 0.2 - 0.2 type: GlobalTranslate - type: ShufflePoint - point_cloud_range: - 0 - -39.68 - -3 - 69.12 - 39.68 - 1 type: FilterBBoxOutsideRange - down_ratio: 2 gaussian_overlap: 0.1 max_objs: 500 min_radius: 2 point_cloud_range: - 0 - -39.68 - -3 - 69.12 - 39.68 - 1 tasks: - class_names: - Car num_class: 1 - class_names: - Cyclist - Pedestrian num_class: 2 type: Gt2CenterPointTarget voxel_size: - 0.16 - 0.16 - 4 type: KittiPCDataset val_dataset: class_names: - Car - Cyclist - Pedestrian dataset_root: /dataset/KITTI mode: val transforms: - dim: 4 type: LoadPointCloud use_dim: 4 - type: RemoveCameraInvisiblePointsKITTI type: KittiPCDataset ------------------------------------------------ W0725 04:15:58.361044 3690 gpu_resources.cc:61] Please NOTE: device: 0, GPU Compute Capability: 6.1, Driver API Version: 10.2, Runtime API Version: 10.2 W0725 04:15:58.361075 3690 gpu_resources.cc:91] device: 0, cuDNN Version: 7.6. 2022-07-25 04:16:00,322 - INFO - Finish CenterHead Initialization server not ready, wait 3 sec to retry... not ready endpoints:['10.181.196.13:26402', '10.181.196.13:32011'] I0725 04:16:03.385490 3690 nccl_context.cc:83] init nccl context nranks: 8 local rank: 0 gpu id: 0 ring id: 0 2022-07-25 04:16:56,217 - INFO - [TRAIN] epoch=1/160, iter=10/18560, loss=40.434525, lr=0.001000 | ETA 06:21:20 2022-07-25 04:17:08,643 - INFO - [TRAIN] epoch=1/160, iter=20/18560, loss=16.374845, lr=0.001000 | ETA 06:25:06 2022-07-25 04:17:20,992 - INFO - [TRAIN] epoch=1/160, iter=30/18560, loss=15.002150, lr=0.001000 | ETA 06:21:54 2022-07-25 04:17:33,295 - INFO - [TRAIN] epoch=1/160, iter=40/18560, loss=14.770219, lr=0.001001 | ETA 06:35:25 2022-07-25 04:17:45,555 - INFO - [TRAIN] epoch=1/160, iter=50/18560, loss=13.188998, lr=0.001001 | ETA 06:21:10 2022-07-25 04:17:57,784 - INFO - [TRAIN] epoch=1/160, iter=60/18560, loss=13.187563, lr=0.001001 | ETA 06:13:48 2022-07-25 04:18:09,953 - INFO - [TRAIN] epoch=1/160, iter=70/18560, loss=13.175962, lr=0.001002 | ETA 06:11:08 2022-07-25 04:18:22,161 - INFO - [TRAIN] epoch=1/160, iter=80/18560, loss=12.566396, lr=0.001003 | ETA 06:12:42 2022-07-25 04:18:34,294 - INFO - [TRAIN] epoch=1/160, iter=90/18560, loss=12.551312, lr=0.001003 | ETA 06:15:54 2022-07-25 04:18:46,383 - INFO - [TRAIN] epoch=1/160, iter=100/18560, loss=12.009351, lr=0.001004 | ETA 06:12:49 2022-07-25 04:18:58,474 - INFO - [TRAIN] epoch=1/160, iter=110/18560, loss=12.203223, lr=0.001005 | ETA 06:07:45 2022-07-25 04:19:42,916 - INFO - [TRAIN] epoch=2/160, iter=120/18560, loss=11.686024, lr=0.001006 | ETA 16:30:55 2022-07-25 04:19:55,155 - INFO - [TRAIN] epoch=2/160, iter=130/18560, loss=11.843185, lr=0.001007 | ETA 06:17:23 2022-07-25 04:20:07,243 - INFO - [TRAIN] epoch=2/160, iter=140/18560, loss=11.455157, lr=0.001008 | ETA 06:14:24 2022-07-25 04:20:19,364 - INFO - [TRAIN] epoch=2/160, iter=150/18560, loss=10.831976, lr=0.001009 | ETA 06:10:45 2022-07-25 04:20:31,508 - INFO - [TRAIN] epoch=2/160, iter=160/18560, loss=11.186121, lr=0.001010 | ETA 06:11:45 2022-07-25 04:20:43,986 - INFO - [TRAIN] epoch=2/160, iter=170/18560, loss=10.864419, lr=0.001012 | ETA 06:31:39 2022-07-25 04:20:56,324 - INFO - [TRAIN] epoch=2/160, iter=180/18560, loss=10.672402, lr=0.001013 | ETA 06:10:33 2022-07-25 04:21:08,792 - INFO - [TRAIN] epoch=2/160, iter=190/18560, loss=10.817212, lr=0.001014 | ETA 06:28:05 2022-07-25 04:21:21,247 - INFO - [TRAIN] epoch=2/160, iter=200/18560, loss=10.911943, lr=0.001016 | ETA 06:10:26 2022-07-25 04:21:33,646 - INFO - [TRAIN] epoch=2/160, iter=210/18560, loss=10.254212, lr=0.001018 | ETA 06:23:23 2022-07-25 04:21:45,892 - INFO - [TRAIN] epoch=2/160, iter=220/18560, loss=10.304187, lr=0.001019 | ETA 06:07:08 2022-07-25 04:21:58,036 - INFO - [TRAIN] epoch=2/160, iter=230/18560, loss=9.640083, lr=0.001021 | ETA 06:09:13 2022-07-25 04:22:39,466 - INFO - [TRAIN] epoch=3/160, iter=240/18560, loss=10.208808, lr=0.001023 | ETA 06:50:38 2022-07-25 04:22:51,700 - INFO - [TRAIN] epoch=3/160, iter=250/18560, loss=9.180613, lr=0.001025 | ETA 06:16:53 2022-07-25 04:23:04,040 - INFO - [TRAIN] epoch=3/160, iter=260/18560, loss=9.117454, lr=0.001027 | ETA 06:08:01 2022-07-25 04:23:16,429 - INFO - [TRAIN] epoch=3/160, iter=270/18560, loss=9.669688, lr=0.001029 | ETA 06:13:57 2022-07-25 04:23:28,701 - INFO - [TRAIN] epoch=3/160, iter=280/18560, loss=9.379323, lr=0.001031 | ETA 06:13:15 2022-07-25 04:23:41,048 - INFO - [TRAIN] epoch=3/160, iter=290/18560, loss=9.100562, lr=0.001034 | ETA 06:13:39 2022-07-25 04:23:53,382 - INFO - [TRAIN] epoch=3/160, iter=300/18560, loss=9.426084, lr=0.001036 | ETA 06:11:53 2022-07-25 04:24:05,817 - INFO - [TRAIN] epoch=3/160, iter=310/18560, loss=9.325964, lr=0.001038 | ETA 06:11:14 2022-07-25 04:24:17,984 - INFO - [TRAIN] epoch=3/160, iter=320/18560, loss=9.042095, lr=0.001041 | ETA 06:09:02 2022-07-25 04:24:30,106 - INFO - [TRAIN] epoch=3/160, iter=330/18560, loss=8.840905, lr=0.001044 | ETA 06:06:36 2022-07-25 04:24:42,229 - INFO - [TRAIN] epoch=3/160, iter=340/18560, loss=8.892619, lr=0.001046 | ETA 06:13:13 2022-07-25 04:25:24,527 - INFO - [TRAIN] epoch=4/160, iter=350/18560, loss=8.946115, lr=0.001049 | ETA 44:10:31 2022-07-25 04:25:36,920 - INFO - [TRAIN] epoch=4/160, iter=360/18560, loss=8.495484, lr=0.001052 | ETA 06:15:53 2022-07-25 04:25:49,155 - INFO - [TRAIN] epoch=4/160, iter=370/18560, loss=8.018198, lr=0.001055 | ETA 06:15:31 2022-07-25 04:26:01,500 - INFO - [TRAIN] epoch=4/160, iter=380/18560, loss=8.260845, lr=0.001058 | ETA 06:14:32 2022-07-25 04:26:13,690 - INFO - [TRAIN] epoch=4/160, iter=390/18560, loss=8.860928, lr=0.001061 | ETA 06:09:15 2022-07-25 04:26:25,833 - INFO - [TRAIN] epoch=4/160, iter=400/18560, loss=8.445970, lr=0.001064 | ETA 06:12:25 2022-07-25 04:26:37,957 - INFO - [TRAIN] epoch=4/160, iter=410/18560, loss=7.831439, lr=0.001067 | ETA 06:06:49 2022-07-25 04:26:50,383 - INFO - [TRAIN] epoch=4/160, iter=420/18560, loss=7.939265, lr=0.001071 | ETA 06:16:37 2022-07-25 04:27:02,727 - INFO - [TRAIN] epoch=4/160, iter=430/18560, loss=7.715253, lr=0.001074 | ETA 06:06:37 2022-07-25 04:27:14,794 - INFO - [TRAIN] epoch=4/160, iter=440/18560, loss=8.015933, lr=0.001077 | ETA 06:00:57 2022-07-25 04:27:26,958 - INFO - [TRAIN] epoch=4/160, iter=450/18560, loss=8.175361, lr=0.001081 | ETA 06:06:39 2022-07-25 04:27:39,085 - INFO - [TRAIN] epoch=4/160, iter=460/18560, loss=7.739380, lr=0.001085 | ETA 06:02:06 2022-07-25 04:28:20,075 - INFO - [TRAIN] epoch=5/160, iter=470/18560, loss=7.857746, lr=0.001088 | ETA 08:22:24 2022-07-25 04:28:32,327 - INFO - [TRAIN] epoch=5/160, iter=480/18560, loss=7.416232, lr=0.001092 | ETA 06:07:54 2022-07-25 04:28:44,477 - INFO - [TRAIN] epoch=5/160, iter=490/18560, loss=7.361291, lr=0.001096 | ETA 06:10:12 2022-07-25 04:28:56,619 - INFO - [TRAIN] epoch=5/160, iter=500/18560, loss=7.564575, lr=0.001100 | ETA 06:05:49 2022-07-25 04:29:08,716 - INFO - [TRAIN] epoch=5/160, iter=510/18560, loss=7.819630, lr=0.001104 | ETA 06:04:37 2022-07-25 04:29:20,923 - INFO - [TRAIN] epoch=5/160, iter=520/18560, loss=7.624175, lr=0.001108 | ETA 06:06:55 2022-07-25 04:29:33,260 - INFO - [TRAIN] epoch=5/160, iter=530/18560, loss=7.719909, lr=0.001112 | ETA 06:11:08 2022-07-25 04:29:45,398 - INFO - [TRAIN] epoch=5/160, iter=540/18560, loss=7.787444, lr=0.001117 | ETA 05:59:11 2022-07-25 04:29:57,595 - INFO - [TRAIN] epoch=5/160, iter=550/18560, loss=7.495294, lr=0.001121 | ETA 06:06:16 2022-07-25 04:30:09,834 - INFO - [TRAIN] epoch=5/160, iter=560/18560, loss=7.704396, lr=0.001125 | ETA 06:09:35 2022-07-25 04:30:22,050 - INFO - [TRAIN] epoch=5/160, iter=570/18560, loss=6.924171, lr=0.001130 | ETA 06:08:56 2022-07-25 04:30:34,140 - INFO - [TRAIN] epoch=5/160, iter=580/18560, loss=7.622643, lr=0.001134 | ETA 06:02:03 2022-07-25 04:30:34,331 - INFO - Push model to checkpoint output_nospa/epoch_5 2022-07-25 04:31:14,513 - INFO - [TRAIN] epoch=6/160, iter=590/18560, loss=7.479352, lr=0.001139 | ETA 06:14:35 2022-07-25 04:31:26,639 - INFO - [TRAIN] epoch=6/160, iter=600/18560, loss=7.077827, lr=0.001144 | ETA 06:06:28 2022-07-25 04:31:38,931 - INFO - [TRAIN] epoch=6/160, iter=610/18560, loss=7.185942, lr=0.001149 | ETA 06:04:40 2022-07-25 04:31:51,428 - INFO - [TRAIN] epoch=6/160, iter=620/18560, loss=7.413507, lr=0.001153 | ETA 06:12:44 2022-07-25 04:32:03,775 - INFO - [TRAIN] epoch=6/160, iter=630/18560, loss=7.248437, lr=0.001158 | ETA 06:16:48 2022-07-25 04:32:16,035 - INFO - [TRAIN] epoch=6/160, iter=640/18560, loss=7.049243, lr=0.001164 | ETA 06:10:09 2022-07-25 04:32:28,295 - INFO - [TRAIN] epoch=6/160, iter=650/18560, loss=7.130634, lr=0.001169 | ETA 06:10:04 2022-07-25 04:32:40,498 - INFO - [TRAIN] epoch=6/160, iter=660/18560, loss=7.534462, lr=0.001174 | ETA 06:05:49 2022-07-25 04:32:52,804 - INFO - [TRAIN] epoch=6/160, iter=670/18560, loss=7.101891, lr=0.001179 | ETA 06:05:43 2022-07-25 04:33:05,213 - INFO - [TRAIN] epoch=6/160, iter=680/18560, loss=6.846934, lr=0.001184 | ETA 06:10:07 2022-07-25 04:33:17,516 - INFO - [TRAIN] epoch=6/160, iter=690/18560, loss=7.170656, lr=0.001190 | ETA 06:02:18 2022-07-25 04:33:57,375 - INFO - [TRAIN] epoch=7/160, iter=700/18560, loss=7.251814, lr=0.001195 | ETA 14:38:07 2022-07-25 04:34:09,621 - INFO - [TRAIN] epoch=7/160, iter=710/18560, loss=6.626792, lr=0.001201 | ETA 06:00:38 2022-07-25 04:34:21,894 - INFO - [TRAIN] epoch=7/160, iter=720/18560, loss=6.532502, lr=0.001207 | ETA 06:09:42 2022-07-25 04:34:34,179 - INFO - [TRAIN] epoch=7/160, iter=730/18560, loss=6.656493, lr=0.001212 | ETA 06:04:54 2022-07-25 04:34:46,609 - INFO - [TRAIN] epoch=7/160, iter=740/18560, loss=6.992539, lr=0.001218 | ETA 06:05:16 2022-07-25 04:34:58,737 - INFO - [TRAIN] epoch=7/160, iter=750/18560, loss=6.536272, lr=0.001224 | ETA 05:57:22 2022-07-25 04:35:10,981 - INFO - [TRAIN] epoch=7/160, iter=760/18560, loss=6.742303, lr=0.001230 | ETA 06:09:41 2022-07-25 04:35:23,212 - INFO - [TRAIN] epoch=7/160, iter=770/18560, loss=7.027552, lr=0.001236 | ETA 06:00:25 2022-07-25 04:35:35,577 - INFO - [TRAIN] epoch=7/160, iter=780/18560, loss=6.724438, lr=0.001242 | ETA 06:01:07 2022-07-25 04:35:47,757 - INFO - [TRAIN] epoch=7/160, iter=790/18560, loss=6.837561, lr=0.001248 | ETA 06:03:39 2022-07-25 04:36:00,048 - INFO - [TRAIN] epoch=7/160, iter=800/18560, loss=6.446725, lr=0.001255 | ETA 06:04:47 2022-07-25 04:36:12,297 - INFO - [TRAIN] epoch=7/160, iter=810/18560, loss=6.520721, lr=0.001261 | ETA 06:09:00 2022-07-25 04:36:54,945 - INFO - [TRAIN] epoch=8/160, iter=820/18560, loss=7.098329, lr=0.001268 | ETA 06:37:53 2022-07-25 04:37:07,162 - INFO - [TRAIN] epoch=8/160, iter=830/18560, loss=6.431383, lr=0.001274 | ETA 06:01:19 2022-07-25 04:37:19,611 - INFO - [TRAIN] epoch=8/160, iter=840/18560, loss=6.072237, lr=0.001281 | ETA 06:12:26 2022-07-25 04:37:32,248 - INFO - [TRAIN] epoch=8/160, iter=850/18560, loss=6.710091, lr=0.001287 | ETA 06:23:36 2022-07-25 04:37:44,669 - INFO - [TRAIN] epoch=8/160, iter=860/18560, loss=6.502271, lr=0.001294 | ETA 06:05:54 2022-07-25 04:37:57,158 - INFO - [TRAIN] epoch=8/160, iter=870/18560, loss=6.349365, lr=0.001301 | ETA 06:25:08 2022-07-25 04:38:09,713 - INFO - [TRAIN] epoch=8/160, iter=880/18560, loss=6.165489, lr=0.001308 | ETA 06:21:49 2022-07-25 04:38:22,289 - INFO - [TRAIN] epoch=8/160, iter=890/18560, loss=6.418764, lr=0.001315 | ETA 06:19:02 2022-07-25 04:38:34,617 - INFO - [TRAIN] epoch=8/160, iter=900/18560, loss=6.520349, lr=0.001322 | ETA 05:55:20 2022-07-25 04:38:47,149 - INFO - [TRAIN] epoch=8/160, iter=910/18560, loss=6.479019, lr=0.001329 | ETA 05:55:25 2022-07-25 04:38:59,724 - INFO - [TRAIN] epoch=8/160, iter=920/18560, loss=6.142260, lr=0.001336 | ETA 06:06:08 2022-07-25 04:39:47,792 - INFO - [TRAIN] epoch=9/160, iter=930/18560, loss=6.018819, lr=0.001343 | ETA 49:55:49 2022-07-25 04:40:00,339 - INFO - [TRAIN] epoch=9/160, iter=940/18560, loss=6.261985, lr=0.001351 | ETA 06:02:14 2022-07-25 04:40:12,748 - INFO - [TRAIN] epoch=9/160, iter=950/18560, loss=6.331125, lr=0.001358 | ETA 06:02:00 2022-07-25 04:40:25,364 - INFO - [TRAIN] epoch=9/160, iter=960/18560, loss=6.191347, lr=0.001365 | ETA 06:07:56 2022-07-25 04:40:37,640 - INFO - [TRAIN] epoch=9/160, iter=970/18560, loss=6.231148, lr=0.001373 | ETA 06:01:13 2022-07-25 04:40:49,936 - INFO - [TRAIN] epoch=9/160, iter=980/18560, loss=6.216935, lr=0.001381 | ETA 06:02:00 2022-07-25 04:41:02,453 - INFO - [TRAIN] epoch=9/160, iter=990/18560, loss=5.925360, lr=0.001388 | ETA 05:59:18 2022-07-25 04:41:14,808 - INFO - [TRAIN] epoch=9/160, iter=1000/18560, loss=6.272386, lr=0.001396 | ETA 06:00:27 2022-07-25 04:41:27,062 - INFO - [TRAIN] epoch=9/160, iter=1010/18560, loss=6.776674, lr=0.001404 | ETA 05:56:42 2022-07-25 04:41:39,398 - INFO - [TRAIN] epoch=9/160, iter=1020/18560, loss=6.256032, lr=0.001412 | ETA 06:03:20 2022-07-25 04:41:51,808 - INFO - [TRAIN] epoch=9/160, iter=1030/18560, loss=6.276560, lr=0.001420 | ETA 06:01:19 2022-07-25 04:42:04,335 - INFO - [TRAIN] epoch=9/160, iter=1040/18560, loss=6.037384, lr=0.001428 | ETA 05:57:26 2022-07-25 04:42:50,674 - INFO - [TRAIN] epoch=10/160, iter=1050/18560, loss=6.785346, lr=0.001436 | ETA 08:34:56 2022-07-25 04:43:03,035 - INFO - [TRAIN] epoch=10/160, iter=1060/18560, loss=6.244035, lr=0.001444 | ETA 05:55:19 2022-07-25 04:43:15,361 - INFO - [TRAIN] epoch=10/160, iter=1070/18560, loss=5.987175, lr=0.001453 | ETA 06:06:59 2022-07-25 04:43:27,859 - INFO - [TRAIN] epoch=10/160, iter=1080/18560, loss=6.100882, lr=0.001461 | ETA 05:59:08 2022-07-25 04:43:40,656 - INFO - [TRAIN] epoch=10/160, iter=1090/18560, loss=6.706870, lr=0.001469 | ETA 06:13:37 2022-07-25 04:43:53,007 - INFO - [TRAIN] epoch=10/160, iter=1100/18560, loss=5.554276, lr=0.001478 | ETA 05:59:11 2022-07-25 04:44:05,261 - INFO - [TRAIN] epoch=10/160, iter=1110/18560, loss=6.027318, lr=0.001487 | ETA 05:54:51 2022-07-25 04:44:17,721 - INFO - [TRAIN] epoch=10/160, iter=1120/18560, loss=6.405028, lr=0.001495 | ETA 06:03:52 2022-07-25 04:44:29,839 - INFO - [TRAIN] epoch=10/160, iter=1130/18560, loss=6.336456, lr=0.001504 | ETA 05:53:15 2022-07-25 04:44:42,236 - INFO - [TRAIN] epoch=10/160, iter=1140/18560, loss=5.938675, lr=0.001513 | ETA 05:57:49 2022-07-25 04:44:54,505 - INFO - [TRAIN] epoch=10/160, iter=1150/18560, loss=6.202242, lr=0.001522 | ETA 05:52:03 2022-07-25 04:45:06,872 - INFO - [TRAIN] epoch=10/160, iter=1160/18560, loss=6.348836, lr=0.001530 | ETA 06:00:38 2022-07-25 04:45:07,100 - INFO - Push model to checkpoint output_nospa/epoch_10 2022-07-25 04:45:53,722 - INFO - [TRAIN] epoch=11/160, iter=1170/18560, loss=6.066553, lr=0.001539 | ETA 06:05:53 2022-07-25 04:46:06,180 - INFO - [TRAIN] epoch=11/160, iter=1180/18560, loss=5.693972, lr=0.001549 | ETA 06:14:04 2022-07-25 04:46:18,418 - INFO - [TRAIN] epoch=11/160, iter=1190/18560, loss=5.688772, lr=0.001558 | ETA 05:53:05 2022-07-25 04:46:30,863 - INFO - [TRAIN] epoch=11/160, iter=1200/18560, loss=6.092091, lr=0.001567 | ETA 06:01:15 2022-07-25 04:46:43,203 - INFO - [TRAIN] epoch=11/160, iter=1210/18560, loss=5.698970, lr=0.001576 | ETA 05:56:46 2022-07-25 04:46:55,796 - INFO - [TRAIN] epoch=11/160, iter=1220/18560, loss=5.789239, lr=0.001586 | ETA 06:04:56 2022-07-25 04:47:08,435 - INFO - [TRAIN] epoch=11/160, iter=1230/18560, loss=6.218905, lr=0.001595 | ETA 06:07:40 2022-07-25 04:47:21,076 - INFO - [TRAIN] epoch=11/160, iter=1240/18560, loss=6.295112, lr=0.001604 | ETA 06:03:17 2022-07-25 04:47:33,917 - INFO - [TRAIN] epoch=11/160, iter=1250/18560, loss=6.218201, lr=0.001614 | ETA 06:12:26 2022-07-25 04:47:46,224 - INFO - [TRAIN] epoch=11/160, iter=1260/18560, loss=5.797707, lr=0.001624 | ETA 05:55:09 2022-07-25 04:47:58,827 - INFO - [TRAIN] epoch=11/160, iter=1270/18560, loss=5.756060, lr=0.001633 | ETA 05:53:52 2022-07-25 04:48:39,545 - INFO - [TRAIN] epoch=12/160, iter=1280/18560, loss=6.091652, lr=0.001643 | ETA 14:26:54 2022-07-25 04:48:51,887 - INFO - [TRAIN] epoch=12/160, iter=1290/18560, loss=5.803539, lr=0.001653 | ETA 06:01:22 2022-07-25 04:49:04,282 - INFO - [TRAIN] epoch=12/160, iter=1300/18560, loss=5.827503, lr=0.001663 | ETA 06:00:45 2022-07-25 04:49:16,808 - INFO - [TRAIN] epoch=12/160, iter=1310/18560, loss=5.975407, lr=0.001673 | ETA 05:58:04 2022-07-25 04:49:29,307 - INFO - [TRAIN] epoch=12/160, iter=1320/18560, loss=5.681089, lr=0.001683 | ETA 06:01:19 2022-07-25 04:49:41,709 - INFO - [TRAIN] epoch=12/160, iter=1330/18560, loss=5.580436, lr=0.001693 | ETA 06:00:55 2022-07-25 04:49:54,034 - INFO - [TRAIN] epoch=12/160, iter=1340/18560, loss=5.640012, lr=0.001703 | ETA 05:50:31 2022-07-25 04:50:06,380 - INFO - [TRAIN] epoch=12/160, iter=1350/18560, loss=5.702037, lr=0.001714 | ETA 05:55:51 2022-07-25 04:50:18,840 - INFO - [TRAIN] epoch=12/160, iter=1360/18560, loss=6.058827, lr=0.001724 | ETA 06:00:50 2022-07-25 04:50:31,244 - INFO - [TRAIN] epoch=12/160, iter=1370/18560, loss=5.693331, lr=0.001734 | ETA 05:48:42 2022-07-25 04:50:43,680 - INFO - [TRAIN] epoch=12/160, iter=1380/18560, loss=5.685148, lr=0.001745 | ETA 05:55:14 2022-07-25 04:50:56,061 - INFO - [TRAIN] epoch=12/160, iter=1390/18560, loss=5.711348, lr=0.001755 | ETA 05:49:34 2022-07-25 04:51:40,058 - INFO - [TRAIN] epoch=13/160, iter=1400/18560, loss=5.886821, lr=0.001766 | ETA 06:38:36 2022-07-25 04:51:52,628 - INFO - [TRAIN] epoch=13/160, iter=1410/18560, loss=5.597925, lr=0.001776 | ETA 05:56:21 2022-07-25 04:52:05,090 - INFO - [TRAIN] epoch=13/160, iter=1420/18560, loss=5.784278, lr=0.001787 | ETA 06:07:11 2022-07-25 04:52:17,640 - INFO - [TRAIN] epoch=13/160, iter=1430/18560, loss=5.728940, lr=0.001798 | ETA 05:49:48 2022-07-25 04:52:30,021 - INFO - [TRAIN] epoch=13/160, iter=1440/18560, loss=5.729424, lr=0.001809 | ETA 05:55:40 2022-07-25 04:52:42,717 - INFO - [TRAIN] epoch=13/160, iter=1450/18560, loss=5.155701, lr=0.001820 | ETA 05:57:54 2022-07-25 04:52:55,321 - INFO - [TRAIN] epoch=13/160, iter=1460/18560, loss=5.745545, lr=0.001831 | ETA 06:06:14 2022-07-25 04:53:07,644 - INFO - [TRAIN] epoch=13/160, iter=1470/18560, loss=5.723907, lr=0.001842 | ETA 05:45:44 2022-07-25 04:53:19,843 - INFO - [TRAIN] epoch=13/160, iter=1480/18560, loss=5.757863, lr=0.001853 | ETA 05:43:15 2022-07-25 04:53:32,130 - INFO - [TRAIN] epoch=13/160, iter=1490/18560, loss=5.861184, lr=0.001864 | ETA 05:51:25 2022-07-25 04:53:44,711 - INFO - [TRAIN] epoch=13/160, iter=1500/18560, loss=5.418906, lr=0.001875 | ETA 06:05:41 2022-07-25 04:54:32,337 - INFO - [TRAIN] epoch=14/160, iter=1510/18560, loss=5.610175, lr=0.001887 | ETA 48:22:56 2022-07-25 04:54:45,043 - INFO - [TRAIN] epoch=14/160, iter=1520/18560, loss=5.702305, lr=0.001898 | ETA 05:52:57 2022-07-25 04:54:57,345 - INFO - [TRAIN] epoch=14/160, iter=1530/18560, loss=5.269685, lr=0.001910 | ETA 05:52:49 2022-07-25 04:55:09,623 - INFO - [TRAIN] epoch=14/160, iter=1540/18560, loss=5.514036, lr=0.001921 | ETA 05:40:01 2022-07-25 04:55:21,859 - INFO - [TRAIN] epoch=14/160, iter=1550/18560, loss=5.551262, lr=0.001933 | ETA 05:47:37 2022-07-25 04:55:34,071 - INFO - [TRAIN] epoch=14/160, iter=1560/18560, loss=5.849044, lr=0.001944 | ETA 05:55:10 2022-07-25 04:55:46,524 - INFO - [TRAIN] epoch=14/160, iter=1570/18560, loss=5.436214, lr=0.001956 | ETA 05:57:35 2022-07-25 04:55:59,017 - INFO - [TRAIN] epoch=14/160, iter=1580/18560, loss=5.791935, lr=0.001968 | ETA 05:56:17 2022-07-25 04:56:11,474 - INFO - [TRAIN] epoch=14/160, iter=1590/18560, loss=5.454055, lr=0.001980 | ETA 05:47:53 2022-07-25 04:56:23,724 - INFO - [TRAIN] epoch=14/160, iter=1600/18560, loss=5.641341, lr=0.001991 | ETA 05:47:36 2022-07-25 04:56:36,189 - INFO - [TRAIN] epoch=14/160, iter=1610/18560, loss=5.425739, lr=0.002003 | ETA 05:46:20 2022-07-25 04:56:48,516 - INFO - [TRAIN] epoch=14/160, iter=1620/18560, loss=5.217917, lr=0.002015 | ETA 05:52:37 2022-07-25 04:57:31,377 - INFO - [TRAIN] epoch=15/160, iter=1630/18560, loss=5.590543, lr=0.002028 | ETA 08:10:57 2022-07-25 04:57:43,826 - INFO - [TRAIN] epoch=15/160, iter=1640/18560, loss=5.655483, lr=0.002040 | ETA 05:49:43 2022-07-25 04:57:56,315 - INFO - [TRAIN] epoch=15/160, iter=1650/18560, loss=5.497939, lr=0.002052 | ETA 05:49:46 2022-07-25 04:58:08,817 - INFO - [TRAIN] epoch=15/160, iter=1660/18560, loss=5.306262, lr=0.002064 | ETA 05:56:19 2022-07-25 04:58:21,066 - INFO - [TRAIN] epoch=15/160, iter=1670/18560, loss=5.411847, lr=0.002076 | ETA 05:46:57 2022-07-25 04:58:33,800 - INFO - [TRAIN] epoch=15/160, iter=1680/18560, loss=5.145459, lr=0.002089 | ETA 05:48:53 2022-07-25 04:58:46,363 - INFO - [TRAIN] epoch=15/160, iter=1690/18560, loss=5.093674, lr=0.002101 | ETA 05:48:41 2022-07-25 04:58:58,839 - INFO - [TRAIN] epoch=15/160, iter=1700/18560, loss=5.282515, lr=0.002114 | ETA 05:52:13 2022-07-25 04:59:11,134 - INFO - [TRAIN] epoch=15/160, iter=1710/18560, loss=5.457100, lr=0.002126 | ETA 05:44:52 2022-07-25 04:59:23,291 - INFO - [TRAIN] epoch=15/160, iter=1720/18560, loss=5.420542, lr=0.002139 | ETA 05:36:53 2022-07-25 04:59:35,429 - INFO - [TRAIN] epoch=15/160, iter=1730/18560, loss=5.043057, lr=0.002152 | ETA 05:41:26 2022-07-25 04:59:48,046 - INFO - [TRAIN] epoch=15/160, iter=1740/18560, loss=5.612023, lr=0.002164 | ETA 05:54:40 2022-07-25 04:59:48,235 - INFO - Push model to checkpoint output_nospa/epoch_15 2022-07-25 05:00:35,179 - INFO - [TRAIN] epoch=16/160, iter=1750/18560, loss=5.525722, lr=0.002177 | ETA 05:49:54 2022-07-25 05:00:47,452 - INFO - [TRAIN] epoch=16/160, iter=1760/18560, loss=5.443100, lr=0.002190 | ETA 05:43:58 2022-07-25 05:01:00,045 - INFO - [TRAIN] epoch=16/160, iter=1770/18560, loss=5.321769, lr=0.002203 | ETA 05:48:43 2022-07-25 05:01:12,323 - INFO - [TRAIN] epoch=16/160, iter=1780/18560, loss=5.506633, lr=0.002216 | ETA 05:41:02 2022-07-25 05:01:24,730 - INFO - [TRAIN] epoch=16/160, iter=1790/18560, loss=5.388914, lr=0.002229 | ETA 05:49:27 2022-07-25 05:01:36,990 - INFO - [TRAIN] epoch=16/160, iter=1800/18560, loss=5.452415, lr=0.002242 | ETA 05:41:53 2022-07-25 05:01:49,455 - INFO - [TRAIN] epoch=16/160, iter=1810/18560, loss=5.321888, lr=0.002255 | ETA 05:46:37 2022-07-25 05:02:01,951 - INFO - [TRAIN] epoch=16/160, iter=1820/18560, loss=5.354416, lr=0.002269 | ETA 05:52:32 2022-07-25 05:02:14,578 - INFO - [TRAIN] epoch=16/160, iter=1830/18560, loss=5.135136, lr=0.002282 | ETA 05:47:51 2022-07-25 05:02:27,043 - INFO - [TRAIN] epoch=16/160, iter=1840/18560, loss=5.463547, lr=0.002295 | ETA 05:51:05 2022-07-25 05:02:39,486 - INFO - [TRAIN] epoch=16/160, iter=1850/18560, loss=5.200702, lr=0.002309 | ETA 05:34:25 2022-07-25 05:03:24,456 - INFO - [TRAIN] epoch=17/160, iter=1860/18560, loss=5.400869, lr=0.002322 | ETA 15:10:34 2022-07-25 05:03:36,666 - INFO - [TRAIN] epoch=17/160, iter=1870/18560, loss=5.101751, lr=0.002336 | ETA 05:38:26 2022-07-25 05:03:48,809 - INFO - [TRAIN] epoch=17/160, iter=1880/18560, loss=5.168215, lr=0.002349 | ETA 05:40:26 2022-07-25 05:04:01,261 - INFO - [TRAIN] epoch=17/160, iter=1890/18560, loss=5.156201, lr=0.002363 | ETA 05:43:34 2022-07-25 05:04:13,465 - INFO - [TRAIN] epoch=17/160, iter=1900/18560, loss=5.617813, lr=0.002376 | ETA 05:36:34 2022-07-25 05:04:25,910 - INFO - [TRAIN] epoch=17/160, iter=1910/18560, loss=5.031548, lr=0.002390 | ETA 05:41:10 2022-07-25 05:04:38,108 - INFO - [TRAIN] epoch=17/160, iter=1920/18560, loss=5.416697, lr=0.002404 | ETA 05:40:10 2022-07-25 05:04:50,399 - INFO - [TRAIN] epoch=17/160, iter=1930/18560, loss=5.526572, lr=0.002418 | ETA 05:39:21 2022-07-25 05:05:02,477 - INFO - [TRAIN] epoch=17/160, iter=1940/18560, loss=5.482546, lr=0.002432 | ETA 05:35:36 2022-07-25 05:05:14,522 - INFO - [TRAIN] epoch=17/160, iter=1950/18560, loss=5.156214, lr=0.002446 | ETA 05:33:34 2022-07-25 05:05:26,911 - INFO - [TRAIN] epoch=17/160, iter=1960/18560, loss=5.071408, lr=0.002460 | ETA 05:46:16 2022-07-25 05:05:39,406 - INFO - [TRAIN] epoch=17/160, iter=1970/18560, loss=5.251765, lr=0.002474 | ETA 05:43:20 2022-07-25 05:06:27,980 - INFO - [TRAIN] epoch=18/160, iter=1980/18560, loss=5.417059, lr=0.002488 | ETA 06:17:30 2022-07-25 05:06:40,079 - INFO - [TRAIN] epoch=18/160, iter=1990/18560, loss=4.924671, lr=0.002502 | ETA 05:25:14 2022-07-25 05:06:52,334 - INFO - [TRAIN] epoch=18/160, iter=2000/18560, loss=5.107340, lr=0.002516 | ETA 05:35:51 2022-07-25 05:07:04,470 - INFO - [TRAIN] epoch=18/160, iter=2010/18560, loss=5.097789, lr=0.002531 | ETA 05:34:50 2022-07-25 05:07:16,829 - INFO - [TRAIN] epoch=18/160, iter=2020/18560, loss=5.210081, lr=0.002545 | ETA 05:34:45 2022-07-25 05:07:29,060 - INFO - [TRAIN] epoch=18/160, iter=2030/18560, loss=5.440179, lr=0.002559 | ETA 05:36:59 2022-07-25 05:07:41,328 - INFO - [TRAIN] epoch=18/160, iter=2040/18560, loss=5.357552, lr=0.002574 | ETA 05:40:04 2022-07-25 05:07:53,705 - INFO - [TRAIN] epoch=18/160, iter=2050/18560, loss=5.261802, lr=0.002588 | ETA 05:36:08 2022-07-25 05:08:06,135 - INFO - [TRAIN] epoch=18/160, iter=2060/18560, loss=5.255150, lr=0.002603 | ETA 05:42:31 2022-07-25 05:08:18,405 - INFO - [TRAIN] epoch=18/160, iter=2070/18560, loss=5.200585, lr=0.002617 | ETA 05:38:46 2022-07-25 05:08:30,743 - INFO - [TRAIN] epoch=18/160, iter=2080/18560, loss=4.958689, lr=0.002632 | ETA 05:37:12 2022-07-25 05:09:20,350 - INFO - [TRAIN] epoch=19/160, iter=2090/18560, loss=5.644729, lr=0.002647 | ETA 48:35:33 2022-07-25 05:09:32,758 - INFO - [TRAIN] epoch=19/160, iter=2100/18560, loss=5.069749, lr=0.002661 | ETA 05:37:31 2022-07-25 05:09:45,070 - INFO - [TRAIN] epoch=19/160, iter=2110/18560, loss=4.826160, lr=0.002676 | ETA 05:35:54 2022-07-25 05:09:57,379 - INFO - [TRAIN] epoch=19/160, iter=2120/18560, loss=4.954620, lr=0.002691 | ETA 05:39:01 2022-07-25 05:10:09,847 - INFO - [TRAIN] epoch=19/160, iter=2130/18560, loss=5.527720, lr=0.002706 | ETA 05:42:53 2022-07-25 05:10:22,400 - INFO - [TRAIN] epoch=19/160, iter=2140/18560, loss=4.959599, lr=0.002721 | ETA 05:42:29 2022-07-25 05:10:34,845 - INFO - [TRAIN] epoch=19/160, iter=2150/18560, loss=4.966246, lr=0.002736 | ETA 05:41:19 2022-07-25 05:10:47,283 - INFO - [TRAIN] epoch=19/160, iter=2160/18560, loss=5.449213, lr=0.002751 | ETA 05:42:08 2022-07-25 05:10:59,806 - INFO - [TRAIN] epoch=19/160, iter=2170/18560, loss=5.305823, lr=0.002766 | ETA 05:40:38 2022-07-25 05:11:12,191 - INFO - [TRAIN] epoch=19/160, iter=2180/18560, loss=5.174602, lr=0.002781 | ETA 05:37:17 2022-07-25 05:11:24,572 - INFO - [TRAIN] epoch=19/160, iter=2190/18560, loss=5.238507, lr=0.002796 | ETA 05:35:51 2022-07-25 05:11:36,998 - INFO - [TRAIN] epoch=19/160, iter=2200/18560, loss=5.347973, lr=0.002812 | ETA 05:41:13 2022-07-25 05:12:18,434 - INFO - [TRAIN] epoch=20/160, iter=2210/18560, loss=5.514114, lr=0.002827 | ETA 07:45:57 2022-07-25 05:12:30,726 - INFO - [TRAIN] epoch=20/160, iter=2220/18560, loss=4.963869, lr=0.002842 | ETA 05:35:20 2022-07-25 05:12:42,990 - INFO - [TRAIN] epoch=20/160, iter=2230/18560, loss=4.970032, lr=0.002858 | ETA 05:32:00 2022-07-25 05:12:55,286 - INFO - [TRAIN] epoch=20/160, iter=2240/18560, loss=5.195066, lr=0.002873 | ETA 05:36:00 2022-07-25 05:13:07,656 - INFO - [TRAIN] epoch=20/160, iter=2250/18560, loss=5.025692, lr=0.002889 | ETA 05:35:36 2022-07-25 05:13:19,990 - INFO - [TRAIN] epoch=20/160, iter=2260/18560, loss=5.167059, lr=0.002904 | ETA 05:38:03 2022-07-25 05:13:32,386 - INFO - [TRAIN] epoch=20/160, iter=2270/18560, loss=5.016919, lr=0.002920 | ETA 05:32:30 2022-07-25 05:13:44,606 - INFO - [TRAIN] epoch=20/160, iter=2280/18560, loss=5.225919, lr=0.002935 | ETA 05:32:41 2022-07-25 05:13:56,998 - INFO - [TRAIN] epoch=20/160, iter=2290/18560, loss=5.299324, lr=0.002951 | ETA 05:31:53 2022-07-25 05:14:09,274 - INFO - [TRAIN] epoch=20/160, iter=2300/18560, loss=5.130039, lr=0.002967 | ETA 05:32:48 2022-07-25 05:14:21,563 - INFO - [TRAIN] epoch=20/160, iter=2310/18560, loss=4.902270, lr=0.002983 | ETA 05:34:00 2022-07-25 05:14:33,961 - INFO - [TRAIN] epoch=20/160, iter=2320/18560, loss=5.126493, lr=0.002998 | ETA 05:40:43 2022-07-25 05:14:34,160 - INFO - Push model to checkpoint output_nospa/epoch_20 2022-07-25 05:15:16,057 - INFO - [TRAIN] epoch=21/160, iter=2330/18560, loss=5.382560, lr=0.003014 | ETA 05:40:04 2022-07-25 05:15:28,457 - INFO - [TRAIN] epoch=21/160, iter=2340/18560, loss=4.969278, lr=0.003030 | ETA 05:36:19 2022-07-25 05:15:40,815 - INFO - [TRAIN] epoch=21/160, iter=2350/18560, loss=4.707936, lr=0.003046 | ETA 05:29:23 2022-07-25 05:15:53,054 - INFO - [TRAIN] epoch=21/160, iter=2360/18560, loss=5.201287, lr=0.003062 | ETA 05:27:40 2022-07-25 05:16:05,403 - INFO - [TRAIN] epoch=21/160, iter=2370/18560, loss=4.794383, lr=0.003078 | ETA 05:34:40 2022-07-25 05:16:17,668 - INFO - [TRAIN] epoch=21/160, iter=2380/18560, loss=4.763452, lr=0.003094 | ETA 05:30:13 2022-07-25 05:16:29,977 - INFO - [TRAIN] epoch=21/160, iter=2390/18560, loss=5.117810, lr=0.003110 | ETA 05:23:54 2022-07-25 05:16:42,246 - INFO - [TRAIN] epoch=21/160, iter=2400/18560, loss=5.277654, lr=0.003126 | ETA 05:34:49 2022-07-25 05:16:54,575 - INFO - [TRAIN] epoch=21/160, iter=2410/18560, loss=5.033292, lr=0.003143 | ETA 05:29:29 2022-07-25 05:17:06,845 - INFO - [TRAIN] epoch=21/160, iter=2420/18560, loss=4.942200, lr=0.003159 | ETA 05:30:47 2022-07-25 05:17:19,095 - INFO - [TRAIN] epoch=21/160, iter=2430/18560, loss=4.833821, lr=0.003175 | ETA 05:25:47 2022-07-25 05:17:58,914 - INFO - [TRAIN] epoch=22/160, iter=2440/18560, loss=5.245251, lr=0.003191 | ETA 13:13:04 2022-07-25 05:18:11,361 - INFO - [TRAIN] epoch=22/160, iter=2450/18560, loss=5.316954, lr=0.003208 | ETA 05:33:00 2022-07-25 05:18:23,690 - INFO - [TRAIN] epoch=22/160, iter=2460/18560, loss=4.857258, lr=0.003224 | ETA 05:33:15 2022-07-25 05:18:36,064 - INFO - [TRAIN] epoch=22/160, iter=2470/18560, loss=5.066523, lr=0.003241 | ETA 05:26:38 2022-07-25 05:18:48,379 - INFO - [TRAIN] epoch=22/160, iter=2480/18560, loss=5.268293, lr=0.003257 | ETA 05:27:03 2022-07-25 05:19:00,761 - INFO - [TRAIN] epoch=22/160, iter=2490/18560, loss=4.722056, lr=0.003274 | ETA 05:34:21 2022-07-25 05:19:13,025 - INFO - [TRAIN] epoch=22/160, iter=2500/18560, loss=5.012048, lr=0.003290 | ETA 05:29:44 2022-07-25 05:19:25,336 - INFO - [TRAIN] epoch=22/160, iter=2510/18560, loss=4.810682, lr=0.003307 | ETA 05:26:09 2022-07-25 05:19:37,600 - INFO - [TRAIN] epoch=22/160, iter=2520/18560, loss=5.374000, lr=0.003324 | ETA 05:29:42 2022-07-25 05:19:49,811 - INFO - [TRAIN] epoch=22/160, iter=2530/18560, loss=4.869002, lr=0.003340 | ETA 05:23:09 2022-07-25 05:20:02,170 - INFO - [TRAIN] epoch=22/160, iter=2540/18560, loss=5.040904, lr=0.003357 | ETA 05:28:22 2022-07-25 05:20:14,507 - INFO - [TRAIN] epoch=22/160, iter=2550/18560, loss=4.845549, lr=0.003374 | ETA 05:28:48 2022-07-25 05:20:55,130 - INFO - [TRAIN] epoch=23/160, iter=2560/18560, loss=4.904437, lr=0.003390 | ETA 05:52:12 2022-07-25 05:21:07,405 - INFO - [TRAIN] epoch=23/160, iter=2570/18560, loss=4.552978, lr=0.003407 | ETA 05:31:52 2022-07-25 05:21:19,783 - INFO - [TRAIN] epoch=23/160, iter=2580/18560, loss=4.947739, lr=0.003424 | ETA 05:28:52 2022-07-25 05:21:32,040 - INFO - [TRAIN] epoch=23/160, iter=2590/18560, loss=5.033132, lr=0.003441 | ETA 05:27:06 2022-07-25 05:21:44,478 - INFO - [TRAIN] epoch=23/160, iter=2600/18560, loss=5.243205, lr=0.003458 | ETA 05:29:18 2022-07-25 05:21:56,979 - INFO - [TRAIN] epoch=23/160, iter=2610/18560, loss=4.743608, lr=0.003475 | ETA 05:30:35 2022-07-25 05:22:09,307 - INFO - [TRAIN] epoch=23/160, iter=2620/18560, loss=4.746920, lr=0.003492 | ETA 05:23:34 2022-07-25 05:22:21,594 - INFO - [TRAIN] epoch=23/160, iter=2630/18560, loss=5.214561, lr=0.003509 | ETA 05:19:03 2022-07-25 05:22:33,980 - INFO - [TRAIN] epoch=23/160, iter=2640/18560, loss=4.871253, lr=0.003526 | ETA 05:30:37 2022-07-25 05:22:46,386 - INFO - [TRAIN] epoch=23/160, iter=2650/18560, loss=4.814797, lr=0.003543 | ETA 05:31:56 2022-07-25 05:22:58,663 - INFO - [TRAIN] epoch=23/160, iter=2660/18560, loss=4.745772, lr=0.003561 | ETA 05:25:56 2022-07-25 05:23:38,211 - INFO - [TRAIN] epoch=24/160, iter=2670/18560, loss=4.986017, lr=0.003578 | ETA 35:31:03 2022-07-25 05:23:50,689 - INFO - [TRAIN] epoch=24/160, iter=2680/18560, loss=5.020557, lr=0.003595 | ETA 05:34:39 2022-07-25 05:24:03,244 - INFO - [TRAIN] epoch=24/160, iter=2690/18560, loss=5.084521, lr=0.003612 | ETA 05:33:59 2022-07-25 05:24:15,561 - INFO - [TRAIN] epoch=24/160, iter=2700/18560, loss=4.560833, lr=0.003630 | ETA 05:20:08 2022-07-25 05:24:27,948 - INFO - [TRAIN] epoch=24/160, iter=2710/18560, loss=4.807232, lr=0.003647 | ETA 05:28:41 2022-07-25 05:24:40,164 - INFO - [TRAIN] epoch=24/160, iter=2720/18560, loss=4.946429, lr=0.003664 | ETA 05:20:20 2022-07-25 05:24:52,470 - INFO - [TRAIN] epoch=24/160, iter=2730/18560, loss=4.674865, lr=0.003682 | ETA 05:24:45 2022-07-25 05:25:04,748 - INFO - [TRAIN] epoch=24/160, iter=2740/18560, loss=4.815550, lr=0.003699 | ETA 05:25:18 2022-07-25 05:25:17,127 - INFO - [TRAIN] epoch=24/160, iter=2750/18560, loss=4.919056, lr=0.003717 | ETA 05:27:08 2022-07-25 05:25:29,392 - INFO - [TRAIN] epoch=24/160, iter=2760/18560, loss=5.359981, lr=0.003734 | ETA 05:20:29 2022-07-25 05:25:41,736 - INFO - [TRAIN] epoch=24/160, iter=2770/18560, loss=4.714920, lr=0.003752 | ETA 05:31:13 2022-07-25 05:25:54,024 - INFO - [TRAIN] epoch=24/160, iter=2780/18560, loss=4.496359, lr=0.003769 | ETA 05:21:48 2022-07-25 05:26:34,015 - INFO - [TRAIN] epoch=25/160, iter=2790/18560, loss=4.976104, lr=0.003787 | ETA 07:17:22 2022-07-25 05:26:46,263 - INFO - [TRAIN] epoch=25/160, iter=2800/18560, loss=4.653023, lr=0.003804 | ETA 05:16:49 2022-07-25 05:26:58,528 - INFO - [TRAIN] epoch=25/160, iter=2810/18560, loss=4.288504, lr=0.003822 | ETA 05:18:00 2022-07-25 05:27:11,045 - INFO - [TRAIN] epoch=25/160, iter=2820/18560, loss=4.914642, lr=0.003840 | ETA 05:28:04 2022-07-25 05:27:23,408 - INFO - [TRAIN] epoch=25/160, iter=2830/18560, loss=4.807224, lr=0.003857 | ETA 05:22:27 2022-07-25 05:27:35,749 - INFO - [TRAIN] epoch=25/160, iter=2840/18560, loss=4.665361, lr=0.003875 | ETA 05:24:30 2022-07-25 05:27:48,065 - INFO - [TRAIN] epoch=25/160, iter=2850/18560, loss=4.660753, lr=0.003893 | ETA 05:27:40 2022-07-25 05:28:00,380 - INFO - [TRAIN] epoch=25/160, iter=2860/18560, loss=4.722737, lr=0.003911 | ETA 05:18:11 2022-07-25 05:28:12,682 - INFO - [TRAIN] epoch=25/160, iter=2870/18560, loss=4.899332, lr=0.003929 | ETA 05:22:42 2022-07-25 05:28:24,964 - INFO - [TRAIN] epoch=25/160, iter=2880/18560, loss=5.111460, lr=0.003946 | ETA 05:19:42 2022-07-25 05:28:37,268 - INFO - [TRAIN] epoch=25/160, iter=2890/18560, loss=4.738038, lr=0.003964 | ETA 05:23:30 2022-07-25 05:28:49,563 - INFO - [TRAIN] epoch=25/160, iter=2900/18560, loss=5.043152, lr=0.003982 | ETA 05:17:39 2022-07-25 05:28:49,744 - INFO - Push model to checkpoint output_nospa/epoch_25 2022-07-25 05:29:29,446 - INFO - [TRAIN] epoch=26/160, iter=2910/18560, loss=5.101226, lr=0.004000 | ETA 05:26:38 2022-07-25 05:29:41,703 - INFO - [TRAIN] epoch=26/160, iter=2920/18560, loss=4.339637, lr=0.004018 | ETA 05:15:50 2022-07-25 05:29:54,103 - INFO - [TRAIN] epoch=26/160, iter=2930/18560, loss=4.565109, lr=0.004036 | ETA 05:22:05 2022-07-25 05:30:06,559 - INFO - [TRAIN] epoch=26/160, iter=2940/18560, loss=4.918642, lr=0.004054 | ETA 05:25:48 2022-07-25 05:30:18,828 - INFO - [TRAIN] epoch=26/160, iter=2950/18560, loss=4.918286, lr=0.004072 | ETA 05:18:32 2022-07-25 05:30:31,075 - INFO - [TRAIN] epoch=26/160, iter=2960/18560, loss=4.650028, lr=0.004090 | ETA 05:18:48 2022-07-25 05:30:43,447 - INFO - [TRAIN] epoch=26/160, iter=2970/18560, loss=4.779720, lr=0.004108 | ETA 05:18:28 2022-07-25 05:30:55,805 - INFO - [TRAIN] epoch=26/160, iter=2980/18560, loss=4.883163, lr=0.004126 | ETA 05:21:58 2022-07-25 05:31:08,133 - INFO - [TRAIN] epoch=26/160, iter=2990/18560, loss=5.251983, lr=0.004145 | ETA 05:21:18 2022-07-25 05:31:20,487 - INFO - [TRAIN] epoch=26/160, iter=3000/18560, loss=5.072005, lr=0.004163 | ETA 05:17:16 2022-07-25 05:31:32,759 - INFO - [TRAIN] epoch=26/160, iter=3010/18560, loss=4.769956, lr=0.004181 | ETA 05:15:40 2022-07-25 05:32:13,203 - INFO - [TRAIN] epoch=27/160, iter=3020/18560, loss=4.809365, lr=0.004199 | ETA 12:53:38 2022-07-25 05:32:25,672 - INFO - [TRAIN] epoch=27/160, iter=3030/18560, loss=5.027112, lr=0.004217 | ETA 05:18:41 2022-07-25 05:32:38,011 - INFO - [TRAIN] epoch=27/160, iter=3040/18560, loss=4.576209, lr=0.004236 | ETA 05:19:04 2022-07-25 05:32:50,208 - INFO - [TRAIN] epoch=27/160, iter=3050/18560, loss=4.510291, lr=0.004254 | ETA 05:18:44 2022-07-25 05:33:02,622 - INFO - [TRAIN] epoch=27/160, iter=3060/18560, loss=4.840169, lr=0.004272 | ETA 05:22:51 2022-07-25 05:33:15,023 - INFO - [TRAIN] epoch=27/160, iter=3070/18560, loss=4.466704, lr=0.004291 | ETA 05:18:13 2022-07-25 05:33:27,489 - INFO - [TRAIN] epoch=27/160, iter=3080/18560, loss=4.525309, lr=0.004309 | ETA 05:20:46 2022-07-25 05:33:39,904 - INFO - [TRAIN] epoch=27/160, iter=3090/18560, loss=4.630618, lr=0.004327 | ETA 05:20:09 2022-07-25 05:33:52,287 - INFO - [TRAIN] epoch=27/160, iter=3100/18560, loss=4.874683, lr=0.004346 | ETA 05:21:01 2022-07-25 05:34:04,614 - INFO - [TRAIN] epoch=27/160, iter=3110/18560, loss=4.682116, lr=0.004364 | ETA 05:14:58 2022-07-25 05:34:17,017 - INFO - [TRAIN] epoch=27/160, iter=3120/18560, loss=4.836331, lr=0.004383 | ETA 05:15:45 2022-07-25 05:34:29,211 - INFO - [TRAIN] epoch=27/160, iter=3130/18560, loss=4.861477, lr=0.004401 | ETA 05:14:18 2022-07-25 05:35:08,962 - INFO - [TRAIN] epoch=28/160, iter=3140/18560, loss=4.601635, lr=0.004420 | ETA 05:43:44 2022-07-25 05:35:21,373 - INFO - [TRAIN] epoch=28/160, iter=3150/18560, loss=4.761238, lr=0.004438 | ETA 05:16:43 2022-07-25 05:35:33,838 - INFO - [TRAIN] epoch=28/160, iter=3160/18560, loss=4.556197, lr=0.004457 | ETA 05:20:35 2022-07-25 05:35:46,273 - INFO - [TRAIN] epoch=28/160, iter=3170/18560, loss=4.776131, lr=0.004475 | ETA 05:16:08 2022-07-25 05:35:58,595 - INFO - [TRAIN] epoch=28/160, iter=3180/18560, loss=4.840267, lr=0.004494 | ETA 05:16:25 2022-07-25 05:36:11,032 - INFO - [TRAIN] epoch=28/160, iter=3190/18560, loss=4.615585, lr=0.004512 | ETA 05:18:05 2022-07-25 05:36:23,268 - INFO - [TRAIN] epoch=28/160, iter=3200/18560, loss=5.052625, lr=0.004531 | ETA 05:15:09 2022-07-25 05:36:35,550 - INFO - [TRAIN] epoch=28/160, iter=3210/18560, loss=5.029736, lr=0.004549 | ETA 05:13:03 2022-07-25 05:36:47,888 - INFO - [TRAIN] epoch=28/160, iter=3220/18560, loss=4.685531, lr=0.004568 | ETA 05:09:20 2022-07-25 05:37:00,237 - INFO - [TRAIN] epoch=28/160, iter=3230/18560, loss=4.413394, lr=0.004587 | ETA 05:15:03 2022-07-25 05:37:12,719 - INFO - [TRAIN] epoch=28/160, iter=3240/18560, loss=4.534029, lr=0.004605 | ETA 05:17:08 2022-07-25 05:37:53,610 - INFO - [TRAIN] epoch=29/160, iter=3250/18560, loss=4.472505, lr=0.004624 | ETA 35:29:16 2022-07-25 05:38:06,171 - INFO - [TRAIN] epoch=29/160, iter=3260/18560, loss=4.548792, lr=0.004643 | ETA 05:20:15 2022-07-25 05:38:18,563 - INFO - [TRAIN] epoch=29/160, iter=3270/18560, loss=4.390855, lr=0.004661 | ETA 05:18:32 2022-07-25 05:38:30,940 - INFO - [TRAIN] epoch=29/160, iter=3280/18560, loss=4.172726, lr=0.004680 | ETA 05:14:01 2022-07-25 05:38:43,365 - INFO - [TRAIN] epoch=29/160, iter=3290/18560, loss=4.699131, lr=0.004699 | ETA 05:13:54 2022-07-25 05:38:55,738 - INFO - [TRAIN] epoch=29/160, iter=3300/18560, loss=4.583724, lr=0.004718 | ETA 05:15:23 2022-07-25 05:39:08,082 - INFO - [TRAIN] epoch=29/160, iter=3310/18560, loss=4.658811, lr=0.004736 | ETA 05:08:39 2022-07-25 05:39:20,292 - INFO - [TRAIN] epoch=29/160, iter=3320/18560, loss=4.635574, lr=0.004755 | ETA 05:07:19 2022-07-25 05:39:32,639 - INFO - [TRAIN] epoch=29/160, iter=3330/18560, loss=5.186536, lr=0.004774 | ETA 05:15:06 2022-07-25 05:39:44,992 - INFO - [TRAIN] epoch=29/160, iter=3340/18560, loss=4.856126, lr=0.004793 | ETA 05:17:03 2022-07-25 05:39:57,413 - INFO - [TRAIN] epoch=29/160, iter=3350/18560, loss=4.575888, lr=0.004811 | ETA 05:09:37 2022-07-25 05:40:09,691 - INFO - [TRAIN] epoch=29/160, iter=3360/18560, loss=4.520822, lr=0.004830 | ETA 05:15:21 2022-07-25 05:40:50,877 - INFO - [TRAIN] epoch=30/160, iter=3370/18560, loss=4.626788, lr=0.004849 | ETA 07:14:09 2022-07-25 05:41:03,249 - INFO - [TRAIN] epoch=30/160, iter=3380/18560, loss=4.513009, lr=0.004868 | ETA 05:12:36 2022-07-25 05:41:15,657 - INFO - [TRAIN] epoch=30/160, iter=3390/18560, loss=4.664869, lr=0.004887 | ETA 05:10:49 2022-07-25 05:41:27,986 - INFO - [TRAIN] epoch=30/160, iter=3400/18560, loss=4.826841, lr=0.004906 | ETA 05:11:13 2022-07-25 05:41:40,460 - INFO - [TRAIN] epoch=30/160, iter=3410/18560, loss=4.782374, lr=0.004925 | ETA 05:12:58 2022-07-25 05:41:52,781 - INFO - [TRAIN] epoch=30/160, iter=3420/18560, loss=4.257309, lr=0.004943 | ETA 05:11:15 2022-07-25 05:42:05,101 - INFO - [TRAIN] epoch=30/160, iter=3430/18560, loss=4.588744, lr=0.004962 | ETA 05:12:42 2022-07-25 05:42:17,435 - INFO - [TRAIN] epoch=30/160, iter=3440/18560, loss=4.544825, lr=0.004981 | ETA 05:07:11 2022-07-25 05:42:29,695 - INFO - [TRAIN] epoch=30/160, iter=3450/18560, loss=4.701871, lr=0.005000 | ETA 05:14:38 2022-07-25 05:42:41,997 - INFO - [TRAIN] epoch=30/160, iter=3460/18560, loss=4.637210, lr=0.005019 | ETA 05:12:09 2022-07-25 05:42:54,312 - INFO - [TRAIN] epoch=30/160, iter=3470/18560, loss=4.514697, lr=0.005038 | ETA 05:10:28 2022-07-25 05:43:06,649 - INFO - [TRAIN] epoch=30/160, iter=3480/18560, loss=4.814762, lr=0.005057 | ETA 05:08:13 2022-07-25 05:43:06,669 - INFO - Pop model from output_nospa/epoch_5 2022-07-25 05:43:06,828 - INFO - Push model to checkpoint output_nospa/epoch_30 2022-07-25 05:43:47,438 - INFO - [TRAIN] epoch=31/160, iter=3490/18560, loss=4.628254, lr=0.005076 | ETA 05:15:39 2022-07-25 05:43:59,737 - INFO - [TRAIN] epoch=31/160, iter=3500/18560, loss=4.695711, lr=0.005095 | ETA 05:09:37 2022-07-25 05:44:12,081 - INFO - [TRAIN] epoch=31/160, iter=3510/18560, loss=4.117195, lr=0.005114 | ETA 05:10:16 2022-07-25 05:44:24,387 - INFO - [TRAIN] epoch=31/160, iter=3520/18560, loss=4.921402, lr=0.005133 | ETA 05:03:34 2022-07-25 05:44:36,740 - INFO - [TRAIN] epoch=31/160, iter=3530/18560, loss=4.198396, lr=0.005152 | ETA 05:11:54 2022-07-25 05:44:49,075 - INFO - [TRAIN] epoch=31/160, iter=3540/18560, loss=4.466108, lr=0.005171 | ETA 05:05:37 2022-07-25 05:45:01,549 - INFO - [TRAIN] epoch=31/160, iter=3550/18560, loss=4.557206, lr=0.005190 | ETA 05:05:53 2022-07-25 05:45:13,982 - INFO - [TRAIN] epoch=31/160, iter=3560/18560, loss=4.832565, lr=0.005209 | ETA 05:09:38 2022-07-25 05:45:26,271 - INFO - [TRAIN] epoch=31/160, iter=3570/18560, loss=4.478818, lr=0.005228 | ETA 05:03:30 2022-07-25 05:45:38,547 - INFO - [TRAIN] epoch=31/160, iter=3580/18560, loss=4.750787, lr=0.005247 | ETA 05:08:13 2022-07-25 05:45:50,964 - INFO - [TRAIN] epoch=31/160, iter=3590/18560, loss=4.620765, lr=0.005266 | ETA 05:13:25 2022-07-25 05:46:31,290 - INFO - [TRAIN] epoch=32/160, iter=3600/18560, loss=4.864517, lr=0.005285 | ETA 12:24:36 2022-07-25 05:46:43,661 - INFO - [TRAIN] epoch=32/160, iter=3610/18560, loss=4.358212, lr=0.005304 | ETA 05:11:24 2022-07-25 05:46:56,084 - INFO - [TRAIN] epoch=32/160, iter=3620/18560, loss=4.701913, lr=0.005323 | ETA 05:07:59 2022-07-25 05:47:08,491 - INFO - [TRAIN] epoch=32/160, iter=3630/18560, loss=4.348534, lr=0.005342 | ETA 05:12:25 2022-07-25 05:47:20,802 - INFO - [TRAIN] epoch=32/160, iter=3640/18560, loss=4.646991, lr=0.005361 | ETA 05:00:44 2022-07-25 05:47:33,201 - INFO - [TRAIN] epoch=32/160, iter=3650/18560, loss=4.409439, lr=0.005380 | ETA 05:07:21 2022-07-25 05:47:45,465 - INFO - [TRAIN] epoch=32/160, iter=3660/18560, loss=4.266485, lr=0.005399 | ETA 05:01:42 2022-07-25 05:47:57,772 - INFO - [TRAIN] epoch=32/160, iter=3670/18560, loss=4.641358, lr=0.005418 | ETA 05:08:56 2022-07-25 05:48:10,085 - INFO - [TRAIN] epoch=32/160, iter=3680/18560, loss=4.784405, lr=0.005437 | ETA 05:03:12 2022-07-25 05:48:22,362 - INFO - [TRAIN] epoch=32/160, iter=3690/18560, loss=4.680830, lr=0.005456 | ETA 05:05:20 2022-07-25 05:48:34,769 - INFO - [TRAIN] epoch=32/160, iter=3700/18560, loss=4.642642, lr=0.005475 | ETA 05:03:22 2022-07-25 05:48:47,110 - INFO - [TRAIN] epoch=32/160, iter=3710/18560, loss=4.685710, lr=0.005494 | ETA 05:08:46 2022-07-25 05:49:26,922 - INFO - [TRAIN] epoch=33/160, iter=3720/18560, loss=4.621867, lr=0.005513 | ETA 05:25:40 2022-07-25 05:49:39,296 - INFO - [TRAIN] epoch=33/160, iter=3730/18560, loss=4.437127, lr=0.005532 | ETA 05:05:16 2022-07-25 05:49:51,643 - INFO - [TRAIN] epoch=33/160, iter=3740/18560, loss=4.314080, lr=0.005551 | ETA 05:01:16 2022-07-25 05:50:04,135 - INFO - [TRAIN] epoch=33/160, iter=3750/18560, loss=4.454548, lr=0.005570 | ETA 05:06:04 2022-07-25 05:50:16,556 - INFO - [TRAIN] epoch=33/160, iter=3760/18560, loss=4.559933, lr=0.005589 | ETA 05:02:47 2022-07-25 05:50:28,935 - INFO - [TRAIN] epoch=33/160, iter=3770/18560, loss=4.467214, lr=0.005609 | ETA 05:06:18 2022-07-25 05:50:41,528 - INFO - [TRAIN] epoch=33/160, iter=3780/18560, loss=4.448409, lr=0.005628 | ETA 05:08:10 2022-07-25 05:50:53,946 - INFO - [TRAIN] epoch=33/160, iter=3790/18560, loss=4.571214, lr=0.005647 | ETA 05:06:11 2022-07-25 05:51:06,312 - INFO - [TRAIN] epoch=33/160, iter=3800/18560, loss=4.420339, lr=0.005666 | ETA 05:03:58 2022-07-25 05:51:18,829 - INFO - [TRAIN] epoch=33/160, iter=3810/18560, loss=4.586186, lr=0.005685 | ETA 05:07:07 2022-07-25 05:51:31,186 - INFO - [TRAIN] epoch=33/160, iter=3820/18560, loss=4.369352, lr=0.005704 | ETA 05:03:18 2022-07-25 05:52:10,591 - INFO - [TRAIN] epoch=34/160, iter=3830/18560, loss=4.397270, lr=0.005723 | ETA 32:43:02 2022-07-25 05:52:22,972 - INFO - [TRAIN] epoch=34/160, iter=3840/18560, loss=4.449230, lr=0.005742 | ETA 05:11:10 2022-07-25 05:52:35,429 - INFO - [TRAIN] epoch=34/160, iter=3850/18560, loss=4.498663, lr=0.005761 | ETA 05:10:13 2022-07-25 05:52:47,889 - INFO - [TRAIN] epoch=34/160, iter=3860/18560, loss=4.516539, lr=0.005780 | ETA 05:04:04 2022-07-25 05:53:00,141 - INFO - [TRAIN] epoch=34/160, iter=3870/18560, loss=5.000632, lr=0.005799 | ETA 04:58:29 2022-07-25 05:53:12,436 - INFO - [TRAIN] epoch=34/160, iter=3880/18560, loss=4.588281, lr=0.005818 | ETA 05:00:23 2022-07-25 05:53:24,702 - INFO - [TRAIN] epoch=34/160, iter=3890/18560, loss=4.165200, lr=0.005837 | ETA 04:55:07 2022-07-25 05:53:37,075 - INFO - [TRAIN] epoch=34/160, iter=3900/18560, loss=4.545746, lr=0.005856 | ETA 04:54:17 2022-07-25 05:53:49,443 - INFO - [TRAIN] epoch=34/160, iter=3910/18560, loss=4.678275, lr=0.005875 | ETA 04:56:14 2022-07-25 05:54:01,678 - INFO - [TRAIN] epoch=34/160, iter=3920/18560, loss=4.721560, lr=0.005894 | ETA 04:52:34 2022-07-25 05:54:13,976 - INFO - [TRAIN] epoch=34/160, iter=3930/18560, loss=4.483090, lr=0.005913 | ETA 05:04:11 2022-07-25 05:54:26,328 - INFO - [TRAIN] epoch=34/160, iter=3940/18560, loss=4.523378, lr=0.005932 | ETA 04:59:37 2022-07-25 05:55:07,237 - INFO - [TRAIN] epoch=35/160, iter=3950/18560, loss=4.525955, lr=0.005951 | ETA 06:55:31 2022-07-25 05:55:19,598 - INFO - [TRAIN] epoch=35/160, iter=3960/18560, loss=4.160894, lr=0.005969 | ETA 05:08:50 2022-07-25 05:55:31,882 - INFO - [TRAIN] epoch=35/160, iter=3970/18560, loss=4.218462, lr=0.005988 | ETA 04:59:32 2022-07-25 05:55:44,121 - INFO - [TRAIN] epoch=35/160, iter=3980/18560, loss=4.590217, lr=0.006007 | ETA 05:01:54 2022-07-25 05:55:56,551 - INFO - [TRAIN] epoch=35/160, iter=3990/18560, loss=4.653259, lr=0.006026 | ETA 05:00:40 2022-07-25 05:56:08,832 - INFO - [TRAIN] epoch=35/160, iter=4000/18560, loss=4.576318, lr=0.006045 | ETA 04:57:27 2022-07-25 05:56:21,420 - INFO - [TRAIN] epoch=35/160, iter=4010/18560, loss=4.504233, lr=0.006064 | ETA 05:03:50 2022-07-25 05:56:33,681 - INFO - [TRAIN] epoch=35/160, iter=4020/18560, loss=4.342125, lr=0.006083 | ETA 04:55:03 2022-07-25 05:56:46,017 - INFO - [TRAIN] epoch=35/160, iter=4030/18560, loss=4.580776, lr=0.006102 | ETA 04:55:40 2022-07-25 05:56:58,249 - INFO - [TRAIN] epoch=35/160, iter=4040/18560, loss=4.702483, lr=0.006121 | ETA 04:56:52 2022-07-25 05:57:10,470 - INFO - [TRAIN] epoch=35/160, iter=4050/18560, loss=4.580773, lr=0.006140 | ETA 04:57:40 2022-07-25 05:57:22,886 - INFO - [TRAIN] epoch=35/160, iter=4060/18560, loss=4.596805, lr=0.006158 | ETA 05:04:52 2022-07-25 05:57:22,907 - INFO - Pop model from output_nospa/epoch_10 2022-07-25 05:57:23,065 - INFO - Push model to checkpoint output_nospa/epoch_35 2022-07-25 05:58:04,079 - INFO - [TRAIN] epoch=36/160, iter=4070/18560, loss=4.510313, lr=0.006177 | ETA 05:07:38 2022-07-25 05:58:16,496 - INFO - [TRAIN] epoch=36/160, iter=4080/18560, loss=4.283119, lr=0.006196 | ETA 04:58:37 2022-07-25 05:58:28,980 - INFO - [TRAIN] epoch=36/160, iter=4090/18560, loss=4.106238, lr=0.006215 | ETA 05:02:30 2022-07-25 05:58:41,365 - INFO - [TRAIN] epoch=36/160, iter=4100/18560, loss=4.388396, lr=0.006234 | ETA 05:00:19 2022-07-25 05:58:53,603 - INFO - [TRAIN] epoch=36/160, iter=4110/18560, loss=4.488912, lr=0.006252 | ETA 04:53:13 2022-07-25 05:59:05,968 - INFO - [TRAIN] epoch=36/160, iter=4120/18560, loss=4.074219, lr=0.006271 | ETA 04:51:32 2022-07-25 05:59:18,333 - INFO - [TRAIN] epoch=36/160, iter=4130/18560, loss=4.193227, lr=0.006290 | ETA 04:57:43 2022-07-25 05:59:30,714 - INFO - [TRAIN] epoch=36/160, iter=4140/18560, loss=4.546161, lr=0.006309 | ETA 04:57:40 2022-07-25 05:59:43,011 - INFO - [TRAIN] epoch=36/160, iter=4150/18560, loss=4.405211, lr=0.006327 | ETA 04:53:17 2022-07-25 05:59:55,298 - INFO - [TRAIN] epoch=36/160, iter=4160/18560, loss=4.436981, lr=0.006346 | ETA 04:59:24 2022-07-25 06:00:07,729 - INFO - [TRAIN] epoch=36/160, iter=4170/18560, loss=4.642524, lr=0.006365 | ETA 04:58:27 2022-07-25 06:00:47,441 - INFO - [TRAIN] epoch=37/160, iter=4180/18560, loss=4.309879, lr=0.006384 | ETA 11:46:16 2022-07-25 06:00:59,885 - INFO - [TRAIN] epoch=37/160, iter=4190/18560, loss=4.865289, lr=0.006402 | ETA 04:58:24 2022-07-25 06:01:12,326 - INFO - [TRAIN] epoch=37/160, iter=4200/18560, loss=4.241851, lr=0.006421 | ETA 05:01:24 2022-07-25 06:01:24,638 - INFO - [TRAIN] epoch=37/160, iter=4210/18560, loss=4.334075, lr=0.006439 | ETA 04:54:24 2022-07-25 06:01:36,933 - INFO - [TRAIN] epoch=37/160, iter=4220/18560, loss=4.566226, lr=0.006458 | ETA 04:51:35 2022-07-25 06:01:49,309 - INFO - [TRAIN] epoch=37/160, iter=4230/18560, loss=4.374681, lr=0.006477 | ETA 04:56:07 2022-07-25 06:02:01,638 - INFO - [TRAIN] epoch=37/160, iter=4240/18560, loss=4.280169, lr=0.006495 | ETA 04:46:31 2022-07-25 06:02:14,028 - INFO - [TRAIN] epoch=37/160, iter=4250/18560, loss=4.428597, lr=0.006514 | ETA 05:00:54 2022-07-25 06:02:26,375 - INFO - [TRAIN] epoch=37/160, iter=4260/18560, loss=4.421861, lr=0.006532 | ETA 04:55:32 2022-07-25 06:02:38,755 - INFO - [TRAIN] epoch=37/160, iter=4270/18560, loss=4.902889, lr=0.006551 | ETA 04:54:20 2022-07-25 06:02:51,284 - INFO - [TRAIN] epoch=37/160, iter=4280/18560, loss=4.507574, lr=0.006569 | ETA 04:59:13 2022-07-25 06:03:03,545 - INFO - [TRAIN] epoch=37/160, iter=4290/18560, loss=4.414595, lr=0.006588 | ETA 04:49:12 2022-07-25 06:03:45,493 - INFO - [TRAIN] epoch=38/160, iter=4300/18560, loss=4.304848, lr=0.006606 | ETA 05:18:28 2022-07-25 06:03:57,872 - INFO - [TRAIN] epoch=38/160, iter=4310/18560, loss=4.070238, lr=0.006625 | ETA 04:58:50 2022-07-25 06:04:10,298 - INFO - [TRAIN] epoch=38/160, iter=4320/18560, loss=4.138834, lr=0.006643 | ETA 04:51:05 2022-07-25 06:04:22,436 - INFO - [TRAIN] epoch=38/160, iter=4330/18560, loss=4.579633, lr=0.006662 | ETA 04:46:28 2022-07-25 06:04:34,615 - INFO - [TRAIN] epoch=38/160, iter=4340/18560, loss=4.163866, lr=0.006680 | ETA 04:53:29 2022-07-25 06:04:46,873 - INFO - [TRAIN] epoch=38/160, iter=4350/18560, loss=4.130135, lr=0.006698 | ETA 04:50:49 2022-07-25 06:04:59,175 - INFO - [TRAIN] epoch=38/160, iter=4360/18560, loss=4.447231, lr=0.006717 | ETA 04:55:16 2022-07-25 06:05:11,498 - INFO - [TRAIN] epoch=38/160, iter=4370/18560, loss=4.374534, lr=0.006735 | ETA 04:54:12 2022-07-25 06:05:23,931 - INFO - [TRAIN] epoch=38/160, iter=4380/18560, loss=4.440705, lr=0.006753 | ETA 04:46:16 2022-07-25 06:05:36,210 - INFO - [TRAIN] epoch=38/160, iter=4390/18560, loss=4.273660, lr=0.006772 | ETA 04:49:29 2022-07-25 06:05:48,666 - INFO - [TRAIN] epoch=38/160, iter=4400/18560, loss=4.293000, lr=0.006790 | ETA 04:55:02 2022-07-25 06:06:29,499 - INFO - [TRAIN] epoch=39/160, iter=4410/18560, loss=4.500637, lr=0.006808 | ETA 32:51:50 2022-07-25 06:06:41,906 - INFO - [TRAIN] epoch=39/160, iter=4420/18560, loss=4.387714, lr=0.006826 | ETA 05:05:00 2022-07-25 06:06:54,333 - INFO - [TRAIN] epoch=39/160, iter=4430/18560, loss=4.077034, lr=0.006844 | ETA 04:51:37 2022-07-25 06:07:06,685 - INFO - [TRAIN] epoch=39/160, iter=4440/18560, loss=4.499964, lr=0.006863 | ETA 04:50:12 2022-07-25 06:07:18,872 - INFO - [TRAIN] epoch=39/160, iter=4450/18560, loss=4.504192, lr=0.006881 | ETA 04:42:38 2022-07-25 06:07:31,138 - INFO - [TRAIN] epoch=39/160, iter=4460/18560, loss=4.278764, lr=0.006899 | ETA 04:48:22 2022-07-25 06:07:43,461 - INFO - [TRAIN] epoch=39/160, iter=4470/18560, loss=4.072549, lr=0.006917 | ETA 04:46:23 2022-07-25 06:07:55,799 - INFO - [TRAIN] epoch=39/160, iter=4480/18560, loss=4.429876, lr=0.006935 | ETA 04:51:06 2022-07-25 06:08:08,097 - INFO - [TRAIN] epoch=39/160, iter=4490/18560, loss=4.159875, lr=0.006953 | ETA 04:51:27 2022-07-25 06:08:20,455 - INFO - [TRAIN] epoch=39/160, iter=4500/18560, loss=4.214557, lr=0.006971 | ETA 04:46:19 2022-07-25 06:08:32,874 - INFO - [TRAIN] epoch=39/160, iter=4510/18560, loss=4.248653, lr=0.006989 | ETA 04:47:22 2022-07-25 06:08:45,053 - INFO - [TRAIN] epoch=39/160, iter=4520/18560, loss=4.120313, lr=0.007007 | ETA 04:42:20 2022-07-25 06:09:26,105 - INFO - [TRAIN] epoch=40/160, iter=4530/18560, loss=4.372016, lr=0.007025 | ETA 06:36:44 2022-07-25 06:09:38,346 - INFO - [TRAIN] epoch=40/160, iter=4540/18560, loss=4.292796, lr=0.007043 | ETA 04:50:38 2022-07-25 06:09:50,793 - INFO - [TRAIN] epoch=40/160, iter=4550/18560, loss=4.082600, lr=0.007061 | ETA 04:48:24 2022-07-25 06:10:03,266 - INFO - [TRAIN] epoch=40/160, iter=4560/18560, loss=4.143644, lr=0.007079 | ETA 04:51:21 2022-07-25 06:10:15,568 - INFO - [TRAIN] epoch=40/160, iter=4570/18560, loss=4.607028, lr=0.007096 | ETA 04:50:32 2022-07-25 06:10:27,893 - INFO - [TRAIN] epoch=40/160, iter=4580/18560, loss=4.108280, lr=0.007114 | ETA 04:49:08 2022-07-25 06:10:40,280 - INFO - [TRAIN] epoch=40/160, iter=4590/18560, loss=4.093552, lr=0.007132 | ETA 04:50:28 2022-07-25 06:10:52,436 - INFO - [TRAIN] epoch=40/160, iter=4600/18560, loss=4.556025, lr=0.007150 | ETA 04:43:48 2022-07-25 06:11:04,694 - INFO - [TRAIN] epoch=40/160, iter=4610/18560, loss=4.368289, lr=0.007167 | ETA 04:51:04 2022-07-25 06:11:17,101 - INFO - [TRAIN] epoch=40/160, iter=4620/18560, loss=4.651396, lr=0.007185 | ETA 04:49:59 2022-07-25 06:11:29,423 - INFO - [TRAIN] epoch=40/160, iter=4630/18560, loss=4.206117, lr=0.007203 | ETA 04:46:04 2022-07-25 06:11:41,783 - INFO - [TRAIN] epoch=40/160, iter=4640/18560, loss=4.230233, lr=0.007220 | ETA 04:48:41 2022-07-25 06:11:41,804 - INFO - Pop model from output_nospa/epoch_15 2022-07-25 06:11:41,954 - INFO - Push model to checkpoint output_nospa/epoch_40 2022-07-25 06:12:22,401 - INFO - [TRAIN] epoch=41/160, iter=4650/18560, loss=4.215172, lr=0.007238 | ETA 04:51:16 2022-07-25 06:12:34,672 - INFO - [TRAIN] epoch=41/160, iter=4660/18560, loss=4.003377, lr=0.007255 | ETA 04:45:20 2022-07-25 06:12:47,014 - INFO - [TRAIN] epoch=41/160, iter=4670/18560, loss=4.006104, lr=0.007273 | ETA 04:42:38 2022-07-25 06:12:59,343 - INFO - [TRAIN] epoch=41/160, iter=4680/18560, loss=4.076667, lr=0.007290 | ETA 04:45:28 2022-07-25 06:13:11,761 - INFO - [TRAIN] epoch=41/160, iter=4690/18560, loss=4.217104, lr=0.007308 | ETA 04:43:49 2022-07-25 06:13:24,083 - INFO - [TRAIN] epoch=41/160, iter=4700/18560, loss=4.255187, lr=0.007325 | ETA 04:45:38 2022-07-25 06:13:36,568 - INFO - [TRAIN] epoch=41/160, iter=4710/18560, loss=4.360073, lr=0.007343 | ETA 04:42:42 2022-07-25 06:13:48,892 - INFO - [TRAIN] epoch=41/160, iter=4720/18560, loss=4.660443, lr=0.007360 | ETA 04:40:13 2022-07-25 06:14:01,256 - INFO - [TRAIN] epoch=41/160, iter=4730/18560, loss=4.772773, lr=0.007377 | ETA 04:45:17 2022-07-25 06:14:13,634 - INFO - [TRAIN] epoch=41/160, iter=4740/18560, loss=4.238328, lr=0.007395 | ETA 04:42:11 2022-07-25 06:14:26,042 - INFO - [TRAIN] epoch=41/160, iter=4750/18560, loss=4.513209, lr=0.007412 | ETA 04:43:32 2022-07-25 06:15:06,860 - INFO - [TRAIN] epoch=42/160, iter=4760/18560, loss=4.246543, lr=0.007429 | ETA 11:37:02 2022-07-25 06:15:19,363 - INFO - [TRAIN] epoch=42/160, iter=4770/18560, loss=4.185734, lr=0.007446 | ETA 04:50:41 2022-07-25 06:15:31,688 - INFO - [TRAIN] epoch=42/160, iter=4780/18560, loss=3.914614, lr=0.007463 | ETA 04:42:39 2022-07-25 06:15:44,062 - INFO - [TRAIN] epoch=42/160, iter=4790/18560, loss=4.010086, lr=0.007481 | ETA 04:42:00 2022-07-25 06:15:56,368 - INFO - [TRAIN] epoch=42/160, iter=4800/18560, loss=4.446351, lr=0.007498 | ETA 04:43:44 2022-07-25 06:16:08,926 - INFO - [TRAIN] epoch=42/160, iter=4810/18560, loss=4.125952, lr=0.007515 | ETA 04:40:21 2022-07-25 06:16:21,293 - INFO - [TRAIN] epoch=42/160, iter=4820/18560, loss=3.975454, lr=0.007532 | ETA 04:47:01 2022-07-25 06:16:33,524 - INFO - [TRAIN] epoch=42/160, iter=4830/18560, loss=4.620260, lr=0.007549 | ETA 04:42:11 2022-07-25 06:16:45,801 - INFO - [TRAIN] epoch=42/160, iter=4840/18560, loss=4.182938, lr=0.007566 | ETA 04:41:33 2022-07-25 06:16:58,267 - INFO - [TRAIN] epoch=42/160, iter=4850/18560, loss=4.294937, lr=0.007583 | ETA 04:43:15 2022-07-25 06:17:10,638 - INFO - [TRAIN] epoch=42/160, iter=4860/18560, loss=4.251455, lr=0.007599 | ETA 04:40:05 2022-07-25 06:17:23,111 - INFO - [TRAIN] epoch=42/160, iter=4870/18560, loss=4.548231, lr=0.007616 | ETA 04:41:45 2022-07-25 06:18:03,838 - INFO - [TRAIN] epoch=43/160, iter=4880/18560, loss=4.517368, lr=0.007633 | ETA 05:03:07 2022-07-25 06:18:16,283 - INFO - [TRAIN] epoch=43/160, iter=4890/18560, loss=4.172319, lr=0.007650 | ETA 04:45:06 2022-07-25 06:18:28,728 - INFO - [TRAIN] epoch=43/160, iter=4900/18560, loss=3.907953, lr=0.007666 | ETA 04:45:10 2022-07-25 06:18:41,079 - INFO - [TRAIN] epoch=43/160, iter=4910/18560, loss=4.372966, lr=0.007683 | ETA 04:42:36 2022-07-25 06:18:53,438 - INFO - [TRAIN] epoch=43/160, iter=4920/18560, loss=4.178549, lr=0.007700 | ETA 04:42:03 2022-07-25 06:19:05,809 - INFO - [TRAIN] epoch=43/160, iter=4930/18560, loss=3.925505, lr=0.007716 | ETA 04:39:18 2022-07-25 06:19:18,279 - INFO - [TRAIN] epoch=43/160, iter=4940/18560, loss=4.254939, lr=0.007733 | ETA 04:46:43 2022-07-25 06:19:30,669 - INFO - [TRAIN] epoch=43/160, iter=4950/18560, loss=4.108193, lr=0.007749 | ETA 04:44:03 2022-07-25 06:19:42,977 - INFO - [TRAIN] epoch=43/160, iter=4960/18560, loss=4.158488, lr=0.007766 | ETA 04:42:16 2022-07-25 06:19:55,337 - INFO - [TRAIN] epoch=43/160, iter=4970/18560, loss=4.239687, lr=0.007782 | ETA 04:39:01 2022-07-25 06:20:07,726 - INFO - [TRAIN] epoch=43/160, iter=4980/18560, loss=4.251700, lr=0.007799 | ETA 04:38:18 2022-07-25 06:20:47,848 - INFO - [TRAIN] epoch=44/160, iter=4990/18560, loss=4.099160, lr=0.007815 | ETA 31:07:16 2022-07-25 06:21:00,089 - INFO - [TRAIN] epoch=44/160, iter=5000/18560, loss=4.270223, lr=0.007831 | ETA 04:40:34 2022-07-25 06:21:12,459 - INFO - [TRAIN] epoch=44/160, iter=5010/18560, loss=4.034685, lr=0.007848 | ETA 04:39:56 2022-07-25 06:21:24,837 - INFO - [TRAIN] epoch=44/160, iter=5020/18560, loss=4.005969, lr=0.007864 | ETA 04:36:05 2022-07-25 06:21:37,153 - INFO - [TRAIN] epoch=44/160, iter=5030/18560, loss=4.513103, lr=0.007880 | ETA 04:38:36 2022-07-25 06:21:49,543 - INFO - [TRAIN] epoch=44/160, iter=5040/18560, loss=4.004920, lr=0.007896 | ETA 04:36:21 2022-07-25 06:22:01,890 - INFO - [TRAIN] epoch=44/160, iter=5050/18560, loss=4.260841, lr=0.007912 | ETA 04:40:28 2022-07-25 06:22:14,096 - INFO - [TRAIN] epoch=44/160, iter=5060/18560, loss=3.996894, lr=0.007928 | ETA 04:34:18 2022-07-25 06:22:26,462 - INFO - [TRAIN] epoch=44/160, iter=5070/18560, loss=4.441635, lr=0.007944 | ETA 04:36:25 2022-07-25 06:22:38,760 - INFO - [TRAIN] epoch=44/160, iter=5080/18560, loss=4.050540, lr=0.007960 | ETA 04:36:34 2022-07-25 06:22:51,078 - INFO - [TRAIN] epoch=44/160, iter=5090/18560, loss=4.235694, lr=0.007976 | ETA 04:34:25 2022-07-25 06:23:03,442 - INFO - [TRAIN] epoch=44/160, iter=5100/18560, loss=4.115136, lr=0.007992 | ETA 04:38:27 2022-07-25 06:23:44,327 - INFO - [TRAIN] epoch=45/160, iter=5110/18560, loss=4.081525, lr=0.008008 | ETA 06:21:11 2022-07-25 06:23:56,672 - INFO - [TRAIN] epoch=45/160, iter=5120/18560, loss=4.070845, lr=0.008024 | ETA 04:39:08 2022-07-25 06:24:09,170 - INFO - [TRAIN] epoch=45/160, iter=5130/18560, loss=4.114677, lr=0.008040 | ETA 04:41:17 2022-07-25 06:24:21,487 - INFO - [TRAIN] epoch=45/160, iter=5140/18560, loss=4.490505, lr=0.008055 | ETA 04:35:29 2022-07-25 06:24:33,855 - INFO - [TRAIN] epoch=45/160, iter=5150/18560, loss=4.588013, lr=0.008071 | ETA 04:34:09 2022-07-25 06:24:46,263 - INFO - [TRAIN] epoch=45/160, iter=5160/18560, loss=4.124907, lr=0.008086 | ETA 04:35:50 2022-07-25 06:24:58,592 - INFO - [TRAIN] epoch=45/160, iter=5170/18560, loss=4.125379, lr=0.008102 | ETA 04:38:48 2022-07-25 06:25:10,963 - INFO - [TRAIN] epoch=45/160, iter=5180/18560, loss=4.274485, lr=0.008118 | ETA 04:33:47 2022-07-25 06:25:23,298 - INFO - [TRAIN] epoch=45/160, iter=5190/18560, loss=4.207156, lr=0.008133 | ETA 04:37:46 2022-07-25 06:25:35,628 - INFO - [TRAIN] epoch=45/160, iter=5200/18560, loss=4.120395, lr=0.008148 | ETA 04:35:43 2022-07-25 06:25:47,912 - INFO - [TRAIN] epoch=45/160, iter=5210/18560, loss=4.252621, lr=0.008164 | ETA 04:36:02 2022-07-25 06:26:00,367 - INFO - [TRAIN] epoch=45/160, iter=5220/18560, loss=4.238445, lr=0.008179 | ETA 04:42:18 2022-07-25 06:26:00,390 - INFO - Pop model from output_nospa/epoch_20 2022-07-25 06:26:00,548 - INFO - Push model to checkpoint output_nospa/epoch_45 2022-07-25 06:26:39,776 - INFO - [TRAIN] epoch=46/160, iter=5230/18560, loss=4.067903, lr=0.008194 | ETA 04:46:13 2022-07-25 06:26:52,168 - INFO - [TRAIN] epoch=46/160, iter=5240/18560, loss=4.285806, lr=0.008210 | ETA 04:30:37 2022-07-25 06:27:04,525 - INFO - [TRAIN] epoch=46/160, iter=5250/18560, loss=4.267852, lr=0.008225 | ETA 04:33:27 2022-07-25 06:27:16,896 - INFO - [TRAIN] epoch=46/160, iter=5260/18560, loss=4.625574, lr=0.008240 | ETA 04:35:26 2022-07-25 06:27:29,301 - INFO - [TRAIN] epoch=46/160, iter=5270/18560, loss=4.127513, lr=0.008255 | ETA 04:33:15 2022-07-25 06:27:41,558 - INFO - [TRAIN] epoch=46/160, iter=5280/18560, loss=4.226106, lr=0.008270 | ETA 04:32:01 2022-07-25 06:27:53,814 - INFO - [TRAIN] epoch=46/160, iter=5290/18560, loss=4.161022, lr=0.008285 | ETA 04:28:27 2022-07-25 06:28:06,153 - INFO - [TRAIN] epoch=46/160, iter=5300/18560, loss=4.201695, lr=0.008300 | ETA 04:31:48 2022-07-25 06:28:18,448 - INFO - [TRAIN] epoch=46/160, iter=5310/18560, loss=4.364436, lr=0.008315 | ETA 04:29:51 2022-07-25 06:28:30,803 - INFO - [TRAIN] epoch=46/160, iter=5320/18560, loss=4.119272, lr=0.008330 | ETA 04:32:26 2022-07-25 06:28:43,142 - INFO - [TRAIN] epoch=46/160, iter=5330/18560, loss=4.136035, lr=0.008344 | ETA 04:34:55 2022-07-25 06:29:24,240 - INFO - [TRAIN] epoch=47/160, iter=5340/18560, loss=4.466090, lr=0.008359 | ETA 11:10:59 2022-07-25 06:29:36,633 - INFO - [TRAIN] epoch=47/160, iter=5350/18560, loss=4.025458, lr=0.008374 | ETA 04:35:18 2022-07-25 06:29:49,138 - INFO - [TRAIN] epoch=47/160, iter=5360/18560, loss=4.170816, lr=0.008388 | ETA 04:36:54 2022-07-25 06:30:01,583 - INFO - [TRAIN] epoch=47/160, iter=5370/18560, loss=4.036781, lr=0.008403 | ETA 04:33:59 2022-07-25 06:30:13,989 - INFO - [TRAIN] epoch=47/160, iter=5380/18560, loss=4.212564, lr=0.008418 | ETA 04:30:13 2022-07-25 06:30:26,267 - INFO - [TRAIN] epoch=47/160, iter=5390/18560, loss=4.049428, lr=0.008432 | ETA 04:27:34 2022-07-25 06:30:38,635 - INFO - [TRAIN] epoch=47/160, iter=5400/18560, loss=4.028275, lr=0.008446 | ETA 04:32:30 2022-07-25 06:30:50,932 - INFO - [TRAIN] epoch=47/160, iter=5410/18560, loss=4.258045, lr=0.008461 | ETA 04:28:44 2022-07-25 06:31:03,344 - INFO - [TRAIN] epoch=47/160, iter=5420/18560, loss=4.171978, lr=0.008475 | ETA 04:30:22 2022-07-25 06:31:15,638 - INFO - [TRAIN] epoch=47/160, iter=5430/18560, loss=4.365320, lr=0.008489 | ETA 04:24:29 2022-07-25 06:31:27,915 - INFO - [TRAIN] epoch=47/160, iter=5440/18560, loss=3.958411, lr=0.008504 | ETA 04:25:13 2022-07-25 06:31:40,417 - INFO - [TRAIN] epoch=47/160, iter=5450/18560, loss=4.144155, lr=0.008518 | ETA 04:33:21 2022-07-25 06:32:20,891 - INFO - [TRAIN] epoch=48/160, iter=5460/18560, loss=4.208887, lr=0.008532 | ETA 04:58:36 2022-07-25 06:32:33,320 - INFO - [TRAIN] epoch=48/160, iter=5470/18560, loss=3.875103, lr=0.008546 | ETA 04:31:09 2022-07-25 06:32:45,757 - INFO - [TRAIN] epoch=48/160, iter=5480/18560, loss=4.056496, lr=0.008560 | ETA 04:32:07 2022-07-25 06:32:58,083 - INFO - [TRAIN] epoch=48/160, iter=5490/18560, loss=4.147155, lr=0.008574 | ETA 04:31:58 2022-07-25 06:33:10,518 - INFO - [TRAIN] epoch=48/160, iter=5500/18560, loss=4.217813, lr=0.008588 | ETA 04:30:00 2022-07-25 06:33:22,945 - INFO - [TRAIN] epoch=48/160, iter=5510/18560, loss=4.119701, lr=0.008602 | ETA 04:27:31 2022-07-25 06:33:35,293 - INFO - [TRAIN] epoch=48/160, iter=5520/18560, loss=4.132248, lr=0.008615 | ETA 04:30:16 2022-07-25 06:33:47,707 - INFO - [TRAIN] epoch=48/160, iter=5530/18560, loss=4.418298, lr=0.008629 | ETA 04:24:56 2022-07-25 06:34:00,080 - INFO - [TRAIN] epoch=48/160, iter=5540/18560, loss=4.317847, lr=0.008643 | ETA 04:29:38 2022-07-25 06:34:12,557 - INFO - [TRAIN] epoch=48/160, iter=5550/18560, loss=3.930685, lr=0.008656 | ETA 04:32:01 2022-07-25 06:34:25,034 - INFO - [TRAIN] epoch=48/160, iter=5560/18560, loss=4.099585, lr=0.008670 | ETA 04:31:13 2022-07-25 06:35:05,846 - INFO - [TRAIN] epoch=49/160, iter=5570/18560, loss=4.089684, lr=0.008683 | ETA 30:06:25 2022-07-25 06:35:18,098 - INFO - [TRAIN] epoch=49/160, iter=5580/18560, loss=4.340945, lr=0.008697 | ETA 04:29:01 2022-07-25 06:35:30,502 - INFO - [TRAIN] epoch=49/160, iter=5590/18560, loss=3.898925, lr=0.008710 | ETA 04:25:17 2022-07-25 06:35:42,728 - INFO - [TRAIN] epoch=49/160, iter=5600/18560, loss=3.750471, lr=0.008723 | ETA 04:21:01 2022-07-25 06:35:55,109 - INFO - [TRAIN] epoch=49/160, iter=5610/18560, loss=4.229265, lr=0.008737 | ETA 04:24:31 2022-07-25 06:36:07,434 - INFO - [TRAIN] epoch=49/160, iter=5620/18560, loss=4.315656, lr=0.008750 | ETA 04:30:06 2022-07-25 06:36:19,939 - INFO - [TRAIN] epoch=49/160, iter=5630/18560, loss=3.747283, lr=0.008763 | ETA 04:32:20 2022-07-25 06:36:32,318 - INFO - [TRAIN] epoch=49/160, iter=5640/18560, loss=4.268133, lr=0.008776 | ETA 04:27:16 2022-07-25 06:36:44,810 - INFO - [TRAIN] epoch=49/160, iter=5650/18560, loss=4.636257, lr=0.008789 | ETA 04:29:14 2022-07-25 06:36:57,108 - INFO - [TRAIN] epoch=49/160, iter=5660/18560, loss=4.352783, lr=0.008802 | ETA 04:27:34 2022-07-25 06:37:09,498 - INFO - [TRAIN] epoch=49/160, iter=5670/18560, loss=4.252547, lr=0.008815 | ETA 04:28:54 2022-07-25 06:37:21,911 - INFO - [TRAIN] epoch=49/160, iter=5680/18560, loss=4.041420, lr=0.008828 | ETA 04:28:41 2022-07-25 06:38:01,337 - INFO - [TRAIN] epoch=50/160, iter=5690/18560, loss=4.314657, lr=0.008841 | ETA 05:54:40 2022-07-25 06:38:13,802 - INFO - [TRAIN] epoch=50/160, iter=5700/18560, loss=3.983416, lr=0.008853 | ETA 04:27:00 2022-07-25 06:38:26,221 - INFO - [TRAIN] epoch=50/160, iter=5710/18560, loss=3.842910, lr=0.008866 | ETA 04:24:17 2022-07-25 06:38:38,494 - INFO - [TRAIN] epoch=50/160, iter=5720/18560, loss=3.961211, lr=0.008879 | ETA 04:23:51 2022-07-25 06:38:50,831 - INFO - [TRAIN] epoch=50/160, iter=5730/18560, loss=4.036771, lr=0.008891 | ETA 04:24:06 2022-07-25 06:39:03,077 - INFO - [TRAIN] epoch=50/160, iter=5740/18560, loss=4.060882, lr=0.008904 | ETA 04:21:42 2022-07-25 06:39:15,441 - INFO - [TRAIN] epoch=50/160, iter=5750/18560, loss=3.969548, lr=0.008916 | ETA 04:22:49 2022-07-25 06:39:27,918 - INFO - [TRAIN] epoch=50/160, iter=5760/18560, loss=3.929360, lr=0.008928 | ETA 04:25:06 2022-07-25 06:39:40,289 - INFO - [TRAIN] epoch=50/160, iter=5770/18560, loss=4.223801, lr=0.008941 | ETA 04:20:51 2022-07-25 06:39:52,620 - INFO - [TRAIN] epoch=50/160, iter=5780/18560, loss=3.824294, lr=0.008953 | ETA 04:22:54 2022-07-25 06:40:04,977 - INFO - [TRAIN] epoch=50/160, iter=5790/18560, loss=4.286222, lr=0.008965 | ETA 04:23:11 2022-07-25 06:40:17,237 - INFO - [TRAIN] epoch=50/160, iter=5800/18560, loss=4.461258, lr=0.008977 | ETA 04:27:34 2022-07-25 06:40:17,257 - INFO - Pop model from output_nospa/epoch_25 2022-07-25 06:40:17,416 - INFO - Push model to checkpoint output_nospa/epoch_50 2022-07-25 06:40:58,652 - INFO - [TRAIN] epoch=51/160, iter=5810/18560, loss=4.175005, lr=0.008989 | ETA 04:24:35 2022-07-25 06:41:10,993 - INFO - [TRAIN] epoch=51/160, iter=5820/18560, loss=3.902872, lr=0.009001 | ETA 04:23:57 2022-07-25 06:41:23,481 - INFO - [TRAIN] epoch=51/160, iter=5830/18560, loss=4.402008, lr=0.009013 | ETA 04:24:11 2022-07-25 06:41:35,874 - INFO - [TRAIN] epoch=51/160, iter=5840/18560, loss=4.099384, lr=0.009025 | ETA 04:17:40 2022-07-25 06:41:48,111 - INFO - [TRAIN] epoch=51/160, iter=5850/18560, loss=3.871269, lr=0.009037 | ETA 04:18:49 2022-07-25 06:42:00,567 - INFO - [TRAIN] epoch=51/160, iter=5860/18560, loss=4.086847, lr=0.009049 | ETA 04:28:41 2022-07-25 06:42:12,874 - INFO - [TRAIN] epoch=51/160, iter=5870/18560, loss=3.943651, lr=0.009060 | ETA 04:19:42 2022-07-25 06:42:25,206 - INFO - [TRAIN] epoch=51/160, iter=5880/18560, loss=3.911516, lr=0.009072 | ETA 04:18:09 2022-07-25 06:42:37,604 - INFO - [TRAIN] epoch=51/160, iter=5890/18560, loss=4.069645, lr=0.009084 | ETA 04:16:53 2022-07-25 06:42:50,038 - INFO - [TRAIN] epoch=51/160, iter=5900/18560, loss=4.110803, lr=0.009095 | ETA 04:23:05 2022-07-25 06:43:02,269 - INFO - [TRAIN] epoch=51/160, iter=5910/18560, loss=3.880143, lr=0.009106 | ETA 04:18:38 2022-07-25 06:43:44,022 - INFO - [TRAIN] epoch=52/160, iter=5920/18560, loss=4.127276, lr=0.009118 | ETA 10:48:43 2022-07-25 06:43:56,460 - INFO - [TRAIN] epoch=52/160, iter=5930/18560, loss=4.060628, lr=0.009129 | ETA 04:23:30 2022-07-25 06:44:08,937 - INFO - [TRAIN] epoch=52/160, iter=5940/18560, loss=3.739293, lr=0.009140 | ETA 04:24:25 2022-07-25 06:44:21,323 - INFO - [TRAIN] epoch=52/160, iter=5950/18560, loss=4.017100, lr=0.009152 | ETA 04:20:22 2022-07-25 06:44:33,699 - INFO - [TRAIN] epoch=52/160, iter=5960/18560, loss=4.142966, lr=0.009163 | ETA 04:19:50 2022-07-25 06:44:46,174 - INFO - [TRAIN] epoch=52/160, iter=5970/18560, loss=4.186204, lr=0.009174 | ETA 04:25:10 2022-07-25 06:44:58,595 - INFO - [TRAIN] epoch=52/160, iter=5980/18560, loss=3.708496, lr=0.009185 | ETA 04:20:03 2022-07-25 06:45:10,779 - INFO - [TRAIN] epoch=52/160, iter=5990/18560, loss=4.229110, lr=0.009196 | ETA 04:12:47 2022-07-25 06:45:23,079 - INFO - [TRAIN] epoch=52/160, iter=6000/18560, loss=4.056833, lr=0.009206 | ETA 04:14:40 2022-07-25 06:45:35,349 - INFO - [TRAIN] epoch=52/160, iter=6010/18560, loss=4.131886, lr=0.009217 | ETA 04:19:53 2022-07-25 06:45:47,822 - INFO - [TRAIN] epoch=52/160, iter=6020/18560, loss=4.023422, lr=0.009228 | ETA 04:19:57 2022-07-25 06:46:00,129 - INFO - [TRAIN] epoch=52/160, iter=6030/18560, loss=4.198933, lr=0.009238 | ETA 04:19:33 2022-07-25 06:46:42,061 - INFO - [TRAIN] epoch=53/160, iter=6040/18560, loss=4.222940, lr=0.009249 | ETA 04:40:59 2022-07-25 06:46:54,532 - INFO - [TRAIN] epoch=53/160, iter=6050/18560, loss=4.009160, lr=0.009260 | ETA 04:17:22 2022-07-25 06:47:06,877 - INFO - [TRAIN] epoch=53/160, iter=6060/18560, loss=4.086727, lr=0.009270 | ETA 04:17:36 2022-07-25 06:47:19,300 - INFO - [TRAIN] epoch=53/160, iter=6070/18560, loss=4.160235, lr=0.009280 | ETA 04:19:31 2022-07-25 06:47:31,585 - INFO - [TRAIN] epoch=53/160, iter=6080/18560, loss=4.268186, lr=0.009291 | ETA 04:15:23 2022-07-25 06:47:43,939 - INFO - [TRAIN] epoch=53/160, iter=6090/18560, loss=3.806924, lr=0.009301 | ETA 04:15:26 2022-07-25 06:47:56,379 - INFO - [TRAIN] epoch=53/160, iter=6100/18560, loss=3.985843, lr=0.009311 | ETA 04:16:52 2022-07-25 06:48:08,817 - INFO - [TRAIN] epoch=53/160, iter=6110/18560, loss=4.173857, lr=0.009321 | ETA 04:19:43 2022-07-25 06:48:21,182 - INFO - [TRAIN] epoch=53/160, iter=6120/18560, loss=4.266058, lr=0.009331 | ETA 04:16:38 2022-07-25 06:48:33,537 - INFO - [TRAIN] epoch=53/160, iter=6130/18560, loss=4.125471, lr=0.009341 | ETA 04:17:25 2022-07-25 06:48:45,910 - INFO - [TRAIN] epoch=53/160, iter=6140/18560, loss=4.111471, lr=0.009351 | ETA 04:13:39 2022-07-25 06:49:25,927 - INFO - [TRAIN] epoch=54/160, iter=6150/18560, loss=3.849030, lr=0.009361 | ETA 28:05:58 2022-07-25 06:49:38,423 - INFO - [TRAIN] epoch=54/160, iter=6160/18560, loss=4.001868, lr=0.009371 | ETA 04:17:55 2022-07-25 06:49:50,871 - INFO - [TRAIN] epoch=54/160, iter=6170/18560, loss=3.729883, lr=0.009380 | ETA 04:17:39 2022-07-25 06:50:03,233 - INFO - [TRAIN] epoch=54/160, iter=6180/18560, loss=3.710024, lr=0.009390 | ETA 04:21:00 2022-07-25 06:50:15,565 - INFO - [TRAIN] epoch=54/160, iter=6190/18560, loss=4.221656, lr=0.009399 | ETA 04:16:09 2022-07-25 06:50:28,015 - INFO - [TRAIN] epoch=54/160, iter=6200/18560, loss=3.884149, lr=0.009409 | ETA 04:19:06 2022-07-25 06:50:40,428 - INFO - [TRAIN] epoch=54/160, iter=6210/18560, loss=3.955944, lr=0.009418 | ETA 04:15:19 2022-07-25 06:50:52,936 - INFO - [TRAIN] epoch=54/160, iter=6220/18560, loss=4.006731, lr=0.009428 | ETA 04:19:32 2022-07-25 06:51:05,367 - INFO - [TRAIN] epoch=54/160, iter=6230/18560, loss=4.137516, lr=0.009437 | ETA 04:12:42 2022-07-25 06:51:17,675 - INFO - [TRAIN] epoch=54/160, iter=6240/18560, loss=3.927157, lr=0.009446 | ETA 04:17:05 2022-07-25 06:51:29,956 - INFO - [TRAIN] epoch=54/160, iter=6250/18560, loss=3.940304, lr=0.009455 | ETA 04:14:39 2022-07-25 06:51:42,224 - INFO - [TRAIN] epoch=54/160, iter=6260/18560, loss=3.729076, lr=0.009464 | ETA 04:10:58 2022-07-25 06:52:23,002 - INFO - [TRAIN] epoch=55/160, iter=6270/18560, loss=4.130880, lr=0.009473 | ETA 05:45:02 2022-07-25 06:52:35,364 - INFO - [TRAIN] epoch=55/160, iter=6280/18560, loss=3.784554, lr=0.009482 | ETA 04:10:51 2022-07-25 06:52:47,639 - INFO - [TRAIN] epoch=55/160, iter=6290/18560, loss=3.838620, lr=0.009491 | ETA 04:07:30 2022-07-25 06:53:00,000 - INFO - [TRAIN] epoch=55/160, iter=6300/18560, loss=4.024345, lr=0.009500 | ETA 04:08:52 2022-07-25 06:53:12,360 - INFO - [TRAIN] epoch=55/160, iter=6310/18560, loss=3.948501, lr=0.009508 | ETA 04:12:47 2022-07-25 06:53:24,792 - INFO - [TRAIN] epoch=55/160, iter=6320/18560, loss=3.779711, lr=0.009517 | ETA 04:22:44 2022-07-25 06:53:37,095 - INFO - [TRAIN] epoch=55/160, iter=6330/18560, loss=3.772200, lr=0.009525 | ETA 04:08:38 2022-07-25 06:53:49,318 - INFO - [TRAIN] epoch=55/160, iter=6340/18560, loss=3.839296, lr=0.009534 | ETA 04:10:38 2022-07-25 06:54:01,640 - INFO - [TRAIN] epoch=55/160, iter=6350/18560, loss=4.005944, lr=0.009542 | ETA 04:12:01 2022-07-25 06:54:13,888 - INFO - [TRAIN] epoch=55/160, iter=6360/18560, loss=4.080070, lr=0.009551 | ETA 04:11:22 2022-07-25 06:54:26,167 - INFO - [TRAIN] epoch=55/160, iter=6370/18560, loss=4.051747, lr=0.009559 | ETA 04:11:02 2022-07-25 06:54:38,615 - INFO - [TRAIN] epoch=55/160, iter=6380/18560, loss=3.968426, lr=0.009567 | ETA 04:10:18 2022-07-25 06:54:38,638 - INFO - Pop model from output_nospa/epoch_30 2022-07-25 06:54:38,812 - INFO - Push model to checkpoint output_nospa/epoch_55 2022-07-25 06:55:19,425 - INFO - [TRAIN] epoch=56/160, iter=6390/18560, loss=4.024467, lr=0.009575 | ETA 04:21:03 2022-07-25 06:55:31,751 - INFO - [TRAIN] epoch=56/160, iter=6400/18560, loss=3.634674, lr=0.009583 | ETA 04:08:38 2022-07-25 06:55:44,060 - INFO - [TRAIN] epoch=56/160, iter=6410/18560, loss=3.791634, lr=0.009591 | ETA 04:08:39 2022-07-25 06:55:56,376 - INFO - [TRAIN] epoch=56/160, iter=6420/18560, loss=4.106755, lr=0.009599 | ETA 04:12:59 2022-07-25 06:56:08,563 - INFO - [TRAIN] epoch=56/160, iter=6430/18560, loss=4.133889, lr=0.009607 | ETA 04:05:46 2022-07-25 06:56:20,871 - INFO - [TRAIN] epoch=56/160, iter=6440/18560, loss=3.796361, lr=0.009615 | ETA 04:06:23 2022-07-25 06:56:33,255 - INFO - [TRAIN] epoch=56/160, iter=6450/18560, loss=3.997476, lr=0.009622 | ETA 04:07:11 2022-07-25 06:56:45,517 - INFO - [TRAIN] epoch=56/160, iter=6460/18560, loss=4.165869, lr=0.009630 | ETA 04:06:29 2022-07-25 06:56:57,882 - INFO - [TRAIN] epoch=56/160, iter=6470/18560, loss=3.801121, lr=0.009638 | ETA 04:11:57 2022-07-25 06:57:10,081 - INFO - [TRAIN] epoch=56/160, iter=6480/18560, loss=3.586289, lr=0.009645 | ETA 04:03:48 2022-07-25 06:57:22,426 - INFO - [TRAIN] epoch=56/160, iter=6490/18560, loss=3.827870, lr=0.009652 | ETA 04:05:25 2022-07-25 06:58:05,885 - INFO - [TRAIN] epoch=57/160, iter=6500/18560, loss=4.045943, lr=0.009660 | ETA 10:35:13 2022-07-25 06:58:18,087 - INFO - [TRAIN] epoch=57/160, iter=6510/18560, loss=4.014391, lr=0.009667 | ETA 04:05:04 2022-07-25 06:58:30,491 - INFO - [TRAIN] epoch=57/160, iter=6520/18560, loss=3.846237, lr=0.009674 | ETA 04:05:58 2022-07-25 06:58:42,809 - INFO - [TRAIN] epoch=57/160, iter=6530/18560, loss=4.045128, lr=0.009681 | ETA 04:09:02 2022-07-25 06:58:55,040 - INFO - [TRAIN] epoch=57/160, iter=6540/18560, loss=3.968484, lr=0.009688 | ETA 04:07:45 2022-07-25 06:59:07,308 - INFO - [TRAIN] epoch=57/160, iter=6550/18560, loss=3.751822, lr=0.009695 | ETA 04:04:00 2022-07-25 06:59:19,562 - INFO - [TRAIN] epoch=57/160, iter=6560/18560, loss=3.845474, lr=0.009702 | ETA 04:06:38 2022-07-25 06:59:31,805 - INFO - [TRAIN] epoch=57/160, iter=6570/18560, loss=3.964663, lr=0.009709 | ETA 04:03:53 2022-07-25 06:59:44,410 - INFO - [TRAIN] epoch=57/160, iter=6580/18560, loss=4.093597, lr=0.009715 | ETA 04:13:09 2022-07-25 06:59:56,826 - INFO - [TRAIN] epoch=57/160, iter=6590/18560, loss=3.830796, lr=0.009722 | ETA 04:05:55 2022-07-25 07:00:09,134 - INFO - [TRAIN] epoch=57/160, iter=6600/18560, loss=4.251573, lr=0.009729 | ETA 04:05:30 2022-07-25 07:00:21,307 - INFO - [TRAIN] epoch=57/160, iter=6610/18560, loss=3.911423, lr=0.009735 | ETA 04:03:51 2022-07-25 07:01:07,752 - INFO - [TRAIN] epoch=58/160, iter=6620/18560, loss=3.921067, lr=0.009741 | ETA 04:34:26 2022-07-25 07:01:20,038 - INFO - [TRAIN] epoch=58/160, iter=6630/18560, loss=3.818917, lr=0.009748 | ETA 04:06:56 2022-07-25 07:01:32,303 - INFO - [TRAIN] epoch=58/160, iter=6640/18560, loss=3.956653, lr=0.009754 | ETA 04:02:25 2022-07-25 07:01:44,671 - INFO - [TRAIN] epoch=58/160, iter=6650/18560, loss=4.255893, lr=0.009760 | ETA 04:03:13 2022-07-25 07:01:56,902 - INFO - [TRAIN] epoch=58/160, iter=6660/18560, loss=3.687881, lr=0.009766 | ETA 04:04:49 2022-07-25 07:02:09,206 - INFO - [TRAIN] epoch=58/160, iter=6670/18560, loss=3.817792, lr=0.009772 | ETA 04:02:31 2022-07-25 07:02:21,608 - INFO - [TRAIN] epoch=58/160, iter=6680/18560, loss=3.655906, lr=0.009778 | ETA 04:05:03 2022-07-25 07:02:34,045 - INFO - [TRAIN] epoch=58/160, iter=6690/18560, loss=4.207960, lr=0.009784 | ETA 04:03:55 2022-07-25 07:02:46,350 - INFO - [TRAIN] epoch=58/160, iter=6700/18560, loss=3.798245, lr=0.009790 | ETA 03:59:36 2022-07-25 07:02:58,844 - INFO - [TRAIN] epoch=58/160, iter=6710/18560, loss=3.796510, lr=0.009796 | ETA 04:08:36 2022-07-25 07:03:11,310 - INFO - [TRAIN] epoch=58/160, iter=6720/18560, loss=3.764214, lr=0.009801 | ETA 04:05:50 2022-07-25 07:03:56,489 - INFO - [TRAIN] epoch=59/160, iter=6730/18560, loss=3.827596, lr=0.009807 | ETA 30:56:04 2022-07-25 07:04:08,738 - INFO - [TRAIN] epoch=59/160, iter=6740/18560, loss=3.660039, lr=0.009812 | ETA 04:04:36 2022-07-25 07:04:21,072 - INFO - [TRAIN] epoch=59/160, iter=6750/18560, loss=3.685110, lr=0.009818 | ETA 04:02:49 2022-07-25 07:04:33,592 - INFO - [TRAIN] epoch=59/160, iter=6760/18560, loss=3.983809, lr=0.009823 | ETA 04:06:44 2022-07-25 07:04:45,871 - INFO - [TRAIN] epoch=59/160, iter=6770/18560, loss=3.885081, lr=0.009828 | ETA 04:00:30 2022-07-25 07:04:58,197 - INFO - [TRAIN] epoch=59/160, iter=6780/18560, loss=3.727970, lr=0.009833 | ETA 04:04:05 2022-07-25 07:05:10,484 - INFO - [TRAIN] epoch=59/160, iter=6790/18560, loss=3.909018, lr=0.009839 | ETA 04:00:14 2022-07-25 07:05:22,830 - INFO - [TRAIN] epoch=59/160, iter=6800/18560, loss=3.900843, lr=0.009844 | ETA 04:02:35 2022-07-25 07:05:35,181 - INFO - [TRAIN] epoch=59/160, iter=6810/18560, loss=3.848152, lr=0.009848 | ETA 04:00:18 2022-07-25 07:05:47,492 - INFO - [TRAIN] epoch=59/160, iter=6820/18560, loss=3.800946, lr=0.009853 | ETA 04:00:49 2022-07-25 07:05:59,753 - INFO - [TRAIN] epoch=59/160, iter=6830/18560, loss=4.242165, lr=0.009858 | ETA 03:58:41 2022-07-25 07:06:12,021 - INFO - [TRAIN] epoch=59/160, iter=6840/18560, loss=3.891388, lr=0.009863 | ETA 03:58:48 2022-07-25 07:06:58,414 - INFO - [TRAIN] epoch=60/160, iter=6850/18560, loss=4.040818, lr=0.009867 | ETA 05:44:54 2022-07-25 07:07:10,764 - INFO - [TRAIN] epoch=60/160, iter=6860/18560, loss=3.879434, lr=0.009872 | ETA 04:03:07 2022-07-25 07:07:23,131 - INFO - [TRAIN] epoch=60/160, iter=6870/18560, loss=3.588185, lr=0.009876 | ETA 03:58:28 2022-07-25 07:07:35,402 - INFO - [TRAIN] epoch=60/160, iter=6880/18560, loss=3.802164, lr=0.009881 | ETA 03:57:33 2022-07-25 07:07:47,675 - INFO - [TRAIN] epoch=60/160, iter=6890/18560, loss=4.068771, lr=0.009885 | ETA 04:00:16 2022-07-25 07:07:59,827 - INFO - [TRAIN] epoch=60/160, iter=6900/18560, loss=4.001153, lr=0.009889 | ETA 03:56:47 2022-07-25 07:08:12,253 - INFO - [TRAIN] epoch=60/160, iter=6910/18560, loss=3.981517, lr=0.009894 | ETA 04:02:28 2022-07-25 07:08:24,547 - INFO - [TRAIN] epoch=60/160, iter=6920/18560, loss=4.181551, lr=0.009898 | ETA 03:56:53 2022-07-25 07:08:36,887 - INFO - [TRAIN] epoch=60/160, iter=6930/18560, loss=3.884090, lr=0.009902 | ETA 04:00:17 2022-07-25 07:08:49,476 - INFO - [TRAIN] epoch=60/160, iter=6940/18560, loss=3.859790, lr=0.009906 | ETA 03:59:17 2022-07-25 07:09:02,089 - INFO - [TRAIN] epoch=60/160, iter=6950/18560, loss=3.699925, lr=0.009909 | ETA 03:59:45 2022-07-25 07:09:14,334 - INFO - [TRAIN] epoch=60/160, iter=6960/18560, loss=3.621717, lr=0.009913 | ETA 03:53:09 2022-07-25 07:09:14,367 - INFO - Pop model from output_nospa/epoch_35 2022-07-25 07:09:14,633 - INFO - Push model to checkpoint output_nospa/epoch_60 2022-07-25 07:09:58,428 - INFO - [TRAIN] epoch=61/160, iter=6970/18560, loss=3.902296, lr=0.009917 | ETA 04:05:46 2022-07-25 07:10:11,101 - INFO - [TRAIN] epoch=61/160, iter=6980/18560, loss=3.699350, lr=0.009920 | ETA 03:57:24 2022-07-25 07:10:23,585 - INFO - [TRAIN] epoch=61/160, iter=6990/18560, loss=3.715292, lr=0.009924 | ETA 03:59:57 2022-07-25 07:10:36,024 - INFO - [TRAIN] epoch=61/160, iter=7000/18560, loss=4.217429, lr=0.009927 | ETA 04:11:57 2022-07-25 07:10:48,944 - INFO - [TRAIN] epoch=61/160, iter=7010/18560, loss=3.745039, lr=0.009931 | ETA 04:13:06 2022-07-25 07:11:01,426 - INFO - [TRAIN] epoch=61/160, iter=7020/18560, loss=3.639243, lr=0.009934 | ETA 03:57:03 2022-07-25 07:11:13,979 - INFO - [TRAIN] epoch=61/160, iter=7030/18560, loss=4.003915, lr=0.009937 | ETA 03:57:46 2022-07-25 07:11:26,459 - INFO - [TRAIN] epoch=61/160, iter=7040/18560, loss=3.651049, lr=0.009940 | ETA 03:56:42 2022-07-25 07:11:38,709 - INFO - [TRAIN] epoch=61/160, iter=7050/18560, loss=3.805628, lr=0.009943 | ETA 03:53:22 2022-07-25 07:11:51,220 - INFO - [TRAIN] epoch=61/160, iter=7060/18560, loss=4.053917, lr=0.009946 | ETA 03:55:06 2022-07-25 07:12:03,558 - INFO - [TRAIN] epoch=61/160, iter=7070/18560, loss=3.791664, lr=0.009949 | ETA 03:55:01 2022-07-25 07:12:53,770 - INFO - [TRAIN] epoch=62/160, iter=7080/18560, loss=4.067451, lr=0.009952 | ETA 11:34:02 2022-07-25 07:13:06,388 - INFO - [TRAIN] epoch=62/160, iter=7090/18560, loss=3.706248, lr=0.009955 | ETA 04:01:42 2022-07-25 07:13:19,025 - INFO - [TRAIN] epoch=62/160, iter=7100/18560, loss=3.828600, lr=0.009958 | ETA 03:56:34 2022-07-25 07:13:31,477 - INFO - [TRAIN] epoch=62/160, iter=7110/18560, loss=3.774476, lr=0.009960 | ETA 03:57:04 2022-07-25 07:13:44,016 - INFO - [TRAIN] epoch=62/160, iter=7120/18560, loss=3.670839, lr=0.009963 | ETA 03:59:17 2022-07-25 07:13:56,382 - INFO - [TRAIN] epoch=62/160, iter=7130/18560, loss=3.612015, lr=0.009965 | ETA 03:50:26 2022-07-25 07:14:08,880 - INFO - [TRAIN] epoch=62/160, iter=7140/18560, loss=3.975489, lr=0.009967 | ETA 04:00:47 2022-07-25 07:14:21,437 - INFO - [TRAIN] epoch=62/160, iter=7150/18560, loss=3.765720, lr=0.009970 | ETA 04:00:10 2022-07-25 07:14:33,770 - INFO - [TRAIN] epoch=62/160, iter=7160/18560, loss=4.086346, lr=0.009972 | ETA 03:54:56 2022-07-25 07:14:46,222 - INFO - [TRAIN] epoch=62/160, iter=7170/18560, loss=4.005125, lr=0.009974 | ETA 03:55:31 2022-07-25 07:14:58,849 - INFO - [TRAIN] epoch=62/160, iter=7180/18560, loss=3.821580, lr=0.009976 | ETA 03:56:12 2022-07-25 07:15:11,230 - INFO - [TRAIN] epoch=62/160, iter=7190/18560, loss=3.652469, lr=0.009978 | ETA 03:52:37 2022-07-25 07:16:02,555 - INFO - [TRAIN] epoch=63/160, iter=7200/18560, loss=3.875351, lr=0.009980 | ETA 04:28:26 2022-07-25 07:16:15,141 - INFO - [TRAIN] epoch=63/160, iter=7210/18560, loss=3.934776, lr=0.009981 | ETA 03:56:51 2022-07-25 07:16:27,475 - INFO - [TRAIN] epoch=63/160, iter=7220/18560, loss=3.641396, lr=0.009983 | ETA 03:50:44 2022-07-25 07:16:40,086 - INFO - [TRAIN] epoch=63/160, iter=7230/18560, loss=4.156403, lr=0.009985 | ETA 04:02:53 2022-07-25 07:16:52,445 - INFO - [TRAIN] epoch=63/160, iter=7240/18560, loss=3.676079, lr=0.009986 | ETA 03:52:10 2022-07-25 07:17:04,923 - INFO - [TRAIN] epoch=63/160, iter=7250/18560, loss=3.638185, lr=0.009988 | ETA 03:57:50 2022-07-25 07:17:17,525 - INFO - [TRAIN] epoch=63/160, iter=7260/18560, loss=3.712736, lr=0.009989 | ETA 03:52:02 2022-07-25 07:17:30,037 - INFO - [TRAIN] epoch=63/160, iter=7270/18560, loss=3.702601, lr=0.009990 | ETA 03:59:03 2022-07-25 07:17:42,231 - INFO - [TRAIN] epoch=63/160, iter=7280/18560, loss=4.152292, lr=0.009992 | ETA 03:48:17 2022-07-25 07:17:54,566 - INFO - [TRAIN] epoch=63/160, iter=7290/18560, loss=3.846712, lr=0.009993 | ETA 03:51:58 2022-07-25 07:18:06,796 - INFO - [TRAIN] epoch=63/160, iter=7300/18560, loss=4.053350, lr=0.009994 | ETA 03:48:45 2022-07-25 07:19:00,231 - INFO - [TRAIN] epoch=64/160, iter=7310/18560, loss=3.721095, lr=0.009995 | ETA 36:00:50 2022-07-25 07:19:14,171 - INFO - [TRAIN] epoch=64/160, iter=7320/18560, loss=3.903940, lr=0.009996 | ETA 03:57:29 2022-07-25 07:19:26,600 - INFO - [TRAIN] epoch=64/160, iter=7330/18560, loss=3.705169, lr=0.009996 | ETA 03:54:21 2022-07-25 07:19:39,032 - INFO - [TRAIN] epoch=64/160, iter=7340/18560, loss=3.720328, lr=0.009997 | ETA 03:50:45 2022-07-25 07:19:51,140 - INFO - [TRAIN] epoch=64/160, iter=7350/18560, loss=4.000013, lr=0.009998 | ETA 03:46:37 2022-07-25 07:20:03,540 - INFO - [TRAIN] epoch=64/160, iter=7360/18560, loss=3.863085, lr=0.009998 | ETA 03:52:08 2022-07-25 07:20:15,876 - INFO - [TRAIN] epoch=64/160, iter=7370/18560, loss=3.750392, lr=0.009999 | ETA 03:50:41 2022-07-25 07:20:28,134 - INFO - [TRAIN] epoch=64/160, iter=7380/18560, loss=3.753921, lr=0.009999 | ETA 03:49:17 2022-07-25 07:20:40,417 - INFO - [TRAIN] epoch=64/160, iter=7390/18560, loss=3.949673, lr=0.010000 | ETA 03:48:17 2022-07-25 07:20:52,629 - INFO - [TRAIN] epoch=64/160, iter=7400/18560, loss=3.465935, lr=0.010000 | ETA 03:45:43 2022-07-25 07:21:04,861 - INFO - [TRAIN] epoch=64/160, iter=7410/18560, loss=3.409297, lr=0.010000 | ETA 03:46:58 2022-07-25 07:21:17,047 - INFO - [TRAIN] epoch=64/160, iter=7420/18560, loss=3.667473, lr=0.010000 | ETA 03:42:11 2022-07-25 07:22:08,029 - INFO - [TRAIN] epoch=65/160, iter=7430/18560, loss=4.085933, lr=0.010000 | ETA 05:38:46 2022-07-25 07:22:20,412 - INFO - [TRAIN] epoch=65/160, iter=7440/18560, loss=3.923178, lr=0.010000 | ETA 03:50:49 2022-07-25 07:22:32,725 - INFO - [TRAIN] epoch=65/160, iter=7450/18560, loss=3.741284, lr=0.010000 | ETA 03:47:37 2022-07-25 07:22:45,499 - INFO - [TRAIN] epoch=65/160, iter=7460/18560, loss=3.636309, lr=0.010000 | ETA 03:57:20 2022-07-25 07:22:57,801 - INFO - [TRAIN] epoch=65/160, iter=7470/18560, loss=4.050445, lr=0.010000 | ETA 03:48:29 2022-07-25 07:23:10,362 - INFO - [TRAIN] epoch=65/160, iter=7480/18560, loss=3.571111, lr=0.009999 | ETA 03:50:09 2022-07-25 07:23:23,026 - INFO - [TRAIN] epoch=65/160, iter=7490/18560, loss=3.383254, lr=0.009999 | ETA 03:51:32 2022-07-25 07:23:35,346 - INFO - [TRAIN] epoch=65/160, iter=7500/18560, loss=3.943060, lr=0.009999 | ETA 03:51:53 2022-07-25 07:23:47,987 - INFO - [TRAIN] epoch=65/160, iter=7510/18560, loss=4.043374, lr=0.009999 | ETA 03:46:53 2022-07-25 07:24:00,520 - INFO - [TRAIN] epoch=65/160, iter=7520/18560, loss=3.737384, lr=0.009998 | ETA 03:52:54 2022-07-25 07:24:13,030 - INFO - [TRAIN] epoch=65/160, iter=7530/18560, loss=3.948769, lr=0.009998 | ETA 03:46:41 2022-07-25 07:24:25,552 - INFO - [TRAIN] epoch=65/160, iter=7540/18560, loss=3.592710, lr=0.009997 | ETA 03:44:45 2022-07-25 07:24:25,594 - INFO - Pop model from output_nospa/epoch_40 2022-07-25 07:24:25,878 - INFO - Push model to checkpoint output_nospa/epoch_65 2022-07-25 07:25:16,605 - INFO - [TRAIN] epoch=66/160, iter=7550/18560, loss=4.041247, lr=0.009997 | ETA 04:05:10 2022-07-25 07:25:29,133 - INFO - [TRAIN] epoch=66/160, iter=7560/18560, loss=3.664671, lr=0.009996 | ETA 03:49:58 2022-07-25 07:25:41,848 - INFO - [TRAIN] epoch=66/160, iter=7570/18560, loss=3.403407, lr=0.009996 | ETA 03:54:43 2022-07-25 07:25:54,546 - INFO - [TRAIN] epoch=66/160, iter=7580/18560, loss=3.491250, lr=0.009995 | ETA 03:49:14 2022-07-25 07:26:07,064 - INFO - [TRAIN] epoch=66/160, iter=7590/18560, loss=3.736528, lr=0.009995 | ETA 03:50:17 2022-07-25 07:26:19,656 - INFO - [TRAIN] epoch=66/160, iter=7600/18560, loss=3.408975, lr=0.009994 | ETA 03:44:24 2022-07-25 07:26:31,989 - INFO - [TRAIN] epoch=66/160, iter=7610/18560, loss=3.979898, lr=0.009993 | ETA 03:46:50 2022-07-25 07:26:44,352 - INFO - [TRAIN] epoch=66/160, iter=7620/18560, loss=3.982568, lr=0.009992 | ETA 03:45:29 2022-07-25 07:26:56,841 - INFO - [TRAIN] epoch=66/160, iter=7630/18560, loss=3.658856, lr=0.009992 | ETA 03:47:09 2022-07-25 07:27:09,507 - INFO - [TRAIN] epoch=66/160, iter=7640/18560, loss=3.922019, lr=0.009991 | ETA 03:50:50 2022-07-25 07:27:21,960 - INFO - [TRAIN] epoch=66/160, iter=7650/18560, loss=3.765006, lr=0.009990 | ETA 03:44:53 2022-07-25 07:28:10,549 - INFO - [TRAIN] epoch=67/160, iter=7660/18560, loss=3.748764, lr=0.009989 | ETA 10:39:28 2022-07-25 07:28:22,996 - INFO - [TRAIN] epoch=67/160, iter=7670/18560, loss=3.807586, lr=0.009988 | ETA 03:43:25 2022-07-25 07:28:35,637 - INFO - [TRAIN] epoch=67/160, iter=7680/18560, loss=3.514031, lr=0.009987 | ETA 03:46:06 2022-07-25 07:28:48,271 - INFO - [TRAIN] epoch=67/160, iter=7690/18560, loss=3.687646, lr=0.009986 | ETA 03:48:16 2022-07-25 07:29:00,628 - INFO - [TRAIN] epoch=67/160, iter=7700/18560, loss=3.782401, lr=0.009985 | ETA 03:46:06 2022-07-25 07:29:12,840 - INFO - [TRAIN] epoch=67/160, iter=7710/18560, loss=3.501688, lr=0.009984 | ETA 03:41:10 2022-07-25 07:29:25,049 - INFO - [TRAIN] epoch=67/160, iter=7720/18560, loss=3.686124, lr=0.009983 | ETA 03:39:25 2022-07-25 07:29:37,300 - INFO - [TRAIN] epoch=67/160, iter=7730/18560, loss=4.006091, lr=0.009982 | ETA 03:40:37 2022-07-25 07:29:50,053 - INFO - [TRAIN] epoch=67/160, iter=7740/18560, loss=4.059728, lr=0.009980 | ETA 03:58:40 2022-07-25 07:30:02,841 - INFO - [TRAIN] epoch=67/160, iter=7750/18560, loss=3.861947, lr=0.009979 | ETA 03:46:40 2022-07-25 07:30:15,367 - INFO - [TRAIN] epoch=67/160, iter=7760/18560, loss=3.811999, lr=0.009978 | ETA 03:43:18 2022-07-25 07:30:27,885 - INFO - [TRAIN] epoch=67/160, iter=7770/18560, loss=3.849421, lr=0.009976 | ETA 03:45:20 2022-07-25 07:31:17,987 - INFO - [TRAIN] epoch=68/160, iter=7780/18560, loss=3.528620, lr=0.009975 | ETA 04:07:23 2022-07-25 07:31:30,510 - INFO - [TRAIN] epoch=68/160, iter=7790/18560, loss=3.599963, lr=0.009974 | ETA 03:46:55 2022-07-25 07:31:43,167 - INFO - [TRAIN] epoch=68/160, iter=7800/18560, loss=3.545373, lr=0.009972 | ETA 03:45:51 2022-07-25 07:31:55,426 - INFO - [TRAIN] epoch=68/160, iter=7810/18560, loss=3.713701, lr=0.009971 | ETA 03:40:52 2022-07-25 07:32:07,915 - INFO - [TRAIN] epoch=68/160, iter=7820/18560, loss=3.683744, lr=0.009969 | ETA 03:46:27 2022-07-25 07:32:20,334 - INFO - [TRAIN] epoch=68/160, iter=7830/18560, loss=3.567487, lr=0.009967 | ETA 03:39:58 2022-07-25 07:32:32,676 - INFO - [TRAIN] epoch=68/160, iter=7840/18560, loss=3.570051, lr=0.009966 | ETA 03:38:53 2022-07-25 07:32:45,152 - INFO - [TRAIN] epoch=68/160, iter=7850/18560, loss=3.818120, lr=0.009964 | ETA 03:45:14 2022-07-25 07:32:57,825 - INFO - [TRAIN] epoch=68/160, iter=7860/18560, loss=3.600209, lr=0.009962 | ETA 03:48:22 2022-07-25 07:33:10,238 - INFO - [TRAIN] epoch=68/160, iter=7870/18560, loss=3.831952, lr=0.009961 | ETA 03:37:21 2022-07-25 07:33:22,738 - INFO - [TRAIN] epoch=68/160, iter=7880/18560, loss=3.781591, lr=0.009959 | ETA 03:41:48 2022-07-25 07:34:15,728 - INFO - [TRAIN] epoch=69/160, iter=7890/18560, loss=3.746680, lr=0.009957 | ETA 33:50:07 2022-07-25 07:34:28,324 - INFO - [TRAIN] epoch=69/160, iter=7900/18560, loss=3.852843, lr=0.009955 | ETA 03:46:27 2022-07-25 07:34:40,709 - INFO - [TRAIN] epoch=69/160, iter=7910/18560, loss=3.805971, lr=0.009953 | ETA 03:37:53 2022-07-25 07:34:53,056 - INFO - [TRAIN] epoch=69/160, iter=7920/18560, loss=3.546639, lr=0.009951 | ETA 03:37:42 2022-07-25 07:35:05,390 - INFO - [TRAIN] epoch=69/160, iter=7930/18560, loss=3.965891, lr=0.009949 | ETA 03:40:34 2022-07-25 07:35:17,753 - INFO - [TRAIN] epoch=69/160, iter=7940/18560, loss=3.636496, lr=0.009947 | ETA 03:37:49 2022-07-25 07:35:30,440 - INFO - [TRAIN] epoch=69/160, iter=7950/18560, loss=3.756960, lr=0.009945 | ETA 03:48:05 2022-07-25 07:35:43,162 - INFO - [TRAIN] epoch=69/160, iter=7960/18560, loss=3.771576, lr=0.009943 | ETA 03:42:45 2022-07-25 07:35:55,506 - INFO - [TRAIN] epoch=69/160, iter=7970/18560, loss=3.850642, lr=0.009941 | ETA 03:36:02 2022-07-25 07:36:08,239 - INFO - [TRAIN] epoch=69/160, iter=7980/18560, loss=3.976571, lr=0.009939 | ETA 03:43:31 2022-07-25 07:36:20,832 - INFO - [TRAIN] epoch=69/160, iter=7990/18560, loss=3.574166, lr=0.009937 | ETA 03:43:11 2022-07-25 07:36:33,482 - INFO - [TRAIN] epoch=69/160, iter=8000/18560, loss=3.325819, lr=0.009934 | ETA 03:38:20 2022-07-25 07:37:17,845 - INFO - [TRAIN] epoch=70/160, iter=8010/18560, loss=3.796519, lr=0.009932 | ETA 05:07:55 2022-07-25 07:37:30,638 - INFO - [TRAIN] epoch=70/160, iter=8020/18560, loss=3.620840, lr=0.009930 | ETA 03:49:07 2022-07-25 07:37:43,149 - INFO - [TRAIN] epoch=70/160, iter=8030/18560, loss=3.585376, lr=0.009927 | ETA 03:37:01 2022-07-25 07:37:55,697 - INFO - [TRAIN] epoch=70/160, iter=8040/18560, loss=3.352866, lr=0.009925 | ETA 03:37:22 2022-07-25 07:38:07,979 - INFO - [TRAIN] epoch=70/160, iter=8050/18560, loss=3.764906, lr=0.009922 | ETA 03:35:10 2022-07-25 07:38:20,459 - INFO - [TRAIN] epoch=70/160, iter=8060/18560, loss=3.405602, lr=0.009920 | ETA 03:36:42 2022-07-25 07:38:32,740 - INFO - [TRAIN] epoch=70/160, iter=8070/18560, loss=3.178940, lr=0.009917 | ETA 03:33:15 2022-07-25 07:38:45,182 - INFO - [TRAIN] epoch=70/160, iter=8080/18560, loss=3.716255, lr=0.009915 | ETA 03:40:08 2022-07-25 07:38:57,440 - INFO - [TRAIN] epoch=70/160, iter=8090/18560, loss=3.752544, lr=0.009912 | ETA 03:33:07 2022-07-25 07:39:09,644 - INFO - [TRAIN] epoch=70/160, iter=8100/18560, loss=3.742054, lr=0.009910 | ETA 03:31:26 2022-07-25 07:39:22,143 - INFO - [TRAIN] epoch=70/160, iter=8110/18560, loss=3.462722, lr=0.009907 | ETA 03:41:40 2022-07-25 07:39:34,553 - INFO - [TRAIN] epoch=70/160, iter=8120/18560, loss=4.009322, lr=0.009904 | ETA 03:33:24 2022-07-25 07:39:34,577 - INFO - Pop model from output_nospa/epoch_45 2022-07-25 07:39:34,776 - INFO - Push model to checkpoint output_nospa/epoch_70 2022-07-25 07:40:22,811 - INFO - [TRAIN] epoch=71/160, iter=8130/18560, loss=3.791151, lr=0.009901 | ETA 03:41:25 2022-07-25 07:40:35,346 - INFO - [TRAIN] epoch=71/160, iter=8140/18560, loss=3.553302, lr=0.009899 | ETA 03:43:01 2022-07-25 07:40:48,002 - INFO - [TRAIN] epoch=71/160, iter=8150/18560, loss=3.543680, lr=0.009896 | ETA 03:44:02 2022-07-25 07:41:00,466 - INFO - [TRAIN] epoch=71/160, iter=8160/18560, loss=3.933927, lr=0.009893 | ETA 03:43:45 2022-07-25 07:41:12,816 - INFO - [TRAIN] epoch=71/160, iter=8170/18560, loss=3.632069, lr=0.009890 | ETA 03:39:43 2022-07-25 07:41:25,183 - INFO - [TRAIN] epoch=71/160, iter=8180/18560, loss=3.536246, lr=0.009887 | ETA 03:32:10 2022-07-25 07:41:37,696 - INFO - [TRAIN] epoch=71/160, iter=8190/18560, loss=3.644002, lr=0.009884 | ETA 03:34:34 2022-07-25 07:41:50,087 - INFO - [TRAIN] epoch=71/160, iter=8200/18560, loss=3.543950, lr=0.009881 | ETA 03:36:47 2022-07-25 07:42:02,612 - INFO - [TRAIN] epoch=71/160, iter=8210/18560, loss=3.548589, lr=0.009878 | ETA 03:35:50 2022-07-25 07:42:15,226 - INFO - [TRAIN] epoch=71/160, iter=8220/18560, loss=3.868059, lr=0.009875 | ETA 03:37:04 2022-07-25 07:42:27,652 - INFO - [TRAIN] epoch=71/160, iter=8230/18560, loss=3.781619, lr=0.009872 | ETA 03:28:25 2022-07-25 07:43:08,173 - INFO - [TRAIN] epoch=72/160, iter=8240/18560, loss=3.796754, lr=0.009868 | ETA 08:36:23 2022-07-25 07:43:20,608 - INFO - [TRAIN] epoch=72/160, iter=8250/18560, loss=3.489691, lr=0.009865 | ETA 03:30:14 2022-07-25 07:43:33,096 - INFO - [TRAIN] epoch=72/160, iter=8260/18560, loss=3.436004, lr=0.009862 | ETA 03:32:34 2022-07-25 07:43:45,511 - INFO - [TRAIN] epoch=72/160, iter=8270/18560, loss=3.440431, lr=0.009859 | ETA 03:32:53 2022-07-25 07:43:58,044 - INFO - [TRAIN] epoch=72/160, iter=8280/18560, loss=3.748070, lr=0.009855 | ETA 03:34:54 2022-07-25 07:44:10,473 - INFO - [TRAIN] epoch=72/160, iter=8290/18560, loss=3.578934, lr=0.009852 | ETA 03:30:15 2022-07-25 07:44:22,963 - INFO - [TRAIN] epoch=72/160, iter=8300/18560, loss=3.514444, lr=0.009848 | ETA 03:34:01 2022-07-25 07:44:35,292 - INFO - [TRAIN] epoch=72/160, iter=8310/18560, loss=3.691700, lr=0.009845 | ETA 03:32:11 2022-07-25 07:44:47,545 - INFO - [TRAIN] epoch=72/160, iter=8320/18560, loss=4.093426, lr=0.009841 | ETA 03:28:54 2022-07-25 07:44:59,938 - INFO - [TRAIN] epoch=72/160, iter=8330/18560, loss=3.397266, lr=0.009838 | ETA 03:32:15 2022-07-25 07:45:12,306 - INFO - [TRAIN] epoch=72/160, iter=8340/18560, loss=3.705075, lr=0.009834 | ETA 03:31:17 2022-07-25 07:45:24,640 - INFO - [TRAIN] epoch=72/160, iter=8350/18560, loss=3.709002, lr=0.009831 | ETA 03:29:15 2022-07-25 07:46:04,253 - INFO - [TRAIN] epoch=73/160, iter=8360/18560, loss=3.754094, lr=0.009827 | ETA 03:49:41 2022-07-25 07:46:16,906 - INFO - [TRAIN] epoch=73/160, iter=8370/18560, loss=3.412206, lr=0.009823 | ETA 03:35:28 2022-07-25 07:46:29,321 - INFO - [TRAIN] epoch=73/160, iter=8380/18560, loss=3.374363, lr=0.009820 | ETA 03:30:34 2022-07-25 07:46:41,617 - INFO - [TRAIN] epoch=73/160, iter=8390/18560, loss=3.675907, lr=0.009816 | ETA 03:27:28 2022-07-25 07:46:53,924 - INFO - [TRAIN] epoch=73/160, iter=8400/18560, loss=3.706240, lr=0.009812 | ETA 03:28:07 2022-07-25 07:47:06,325 - INFO - [TRAIN] epoch=73/160, iter=8410/18560, loss=3.519502, lr=0.009808 | ETA 03:31:48 2022-07-25 07:47:18,629 - INFO - [TRAIN] epoch=73/160, iter=8420/18560, loss=3.561496, lr=0.009804 | ETA 03:28:34 2022-07-25 07:47:30,915 - INFO - [TRAIN] epoch=73/160, iter=8430/18560, loss=3.703257, lr=0.009800 | ETA 03:25:17 2022-07-25 07:47:43,263 - INFO - [TRAIN] epoch=73/160, iter=8440/18560, loss=3.529137, lr=0.009796 | ETA 03:29:34 2022-07-25 07:47:55,520 - INFO - [TRAIN] epoch=73/160, iter=8450/18560, loss=3.470604, lr=0.009792 | ETA 03:26:46 2022-07-25 07:48:07,861 - INFO - [TRAIN] epoch=73/160, iter=8460/18560, loss=3.344711, lr=0.009788 | ETA 03:27:33 2022-07-25 07:48:48,994 - INFO - [TRAIN] epoch=74/160, iter=8470/18560, loss=3.682595, lr=0.009784 | ETA 23:35:59 2022-07-25 07:49:01,545 - INFO - [TRAIN] epoch=74/160, iter=8480/18560, loss=3.658180, lr=0.009780 | ETA 03:33:29 2022-07-25 07:49:13,925 - INFO - [TRAIN] epoch=74/160, iter=8490/18560, loss=3.361843, lr=0.009776 | ETA 03:28:17 2022-07-25 07:49:26,211 - INFO - [TRAIN] epoch=74/160, iter=8500/18560, loss=3.608003, lr=0.009772 | ETA 03:27:20 2022-07-25 07:49:38,652 - INFO - [TRAIN] epoch=74/160, iter=8510/18560, loss=3.702143, lr=0.009768 | ETA 03:28:35 2022-07-25 07:49:51,125 - INFO - [TRAIN] epoch=74/160, iter=8520/18560, loss=3.744623, lr=0.009763 | ETA 03:31:13 2022-07-25 07:50:03,392 - INFO - [TRAIN] epoch=74/160, iter=8530/18560, loss=3.607784, lr=0.009759 | ETA 03:29:39 2022-07-25 07:50:15,818 - INFO - [TRAIN] epoch=74/160, iter=8540/18560, loss=3.487783, lr=0.009755 | ETA 03:27:59 2022-07-25 07:50:28,214 - INFO - [TRAIN] epoch=74/160, iter=8550/18560, loss=3.737367, lr=0.009750 | ETA 03:22:58 2022-07-25 07:50:40,439 - INFO - [TRAIN] epoch=74/160, iter=8560/18560, loss=3.306135, lr=0.009746 | ETA 03:23:49 2022-07-25 07:50:52,731 - INFO - [TRAIN] epoch=74/160, iter=8570/18560, loss=3.580703, lr=0.009741 | ETA 03:25:40 2022-07-25 07:51:04,795 - INFO - [TRAIN] epoch=74/160, iter=8580/18560, loss=3.683797, lr=0.009737 | ETA 03:20:49 2022-07-25 07:51:45,237 - INFO - [TRAIN] epoch=75/160, iter=8590/18560, loss=3.493112, lr=0.009732 | ETA 04:38:41 2022-07-25 07:51:57,498 - INFO - [TRAIN] epoch=75/160, iter=8600/18560, loss=3.670706, lr=0.009728 | ETA 03:24:26 2022-07-25 07:52:09,676 - INFO - [TRAIN] epoch=75/160, iter=8610/18560, loss=3.361343, lr=0.009723 | ETA 03:21:25 2022-07-25 07:52:21,810 - INFO - [TRAIN] epoch=75/160, iter=8620/18560, loss=3.562099, lr=0.009719 | ETA 03:23:38 2022-07-25 07:52:34,049 - INFO - [TRAIN] epoch=75/160, iter=8630/18560, loss=3.438572, lr=0.009714 | ETA 03:24:40 2022-07-25 07:52:46,298 - INFO - [TRAIN] epoch=75/160, iter=8640/18560, loss=3.457123, lr=0.009709 | ETA 03:22:16 2022-07-25 07:52:58,553 - INFO - [TRAIN] epoch=75/160, iter=8650/18560, loss=3.752472, lr=0.009704 | ETA 03:23:09 2022-07-25 07:53:10,810 - INFO - [TRAIN] epoch=75/160, iter=8660/18560, loss=3.850782, lr=0.009700 | ETA 03:22:40 2022-07-25 07:53:23,049 - INFO - [TRAIN] epoch=75/160, iter=8670/18560, loss=3.460026, lr=0.009695 | ETA 03:22:51 2022-07-25 07:53:35,220 - INFO - [TRAIN] epoch=75/160, iter=8680/18560, loss=3.545553, lr=0.009690 | ETA 03:19:43 2022-07-25 07:53:47,499 - INFO - [TRAIN] epoch=75/160, iter=8690/18560, loss=3.473513, lr=0.009685 | ETA 03:23:12 2022-07-25 07:53:59,803 - INFO - [TRAIN] epoch=75/160, iter=8700/18560, loss=3.441169, lr=0.009680 | ETA 03:23:51 2022-07-25 07:53:59,822 - INFO - Pop model from output_nospa/epoch_50 2022-07-25 07:54:00,071 - INFO - Push model to checkpoint output_nospa/epoch_75 2022-07-25 07:54:43,209 - INFO - [TRAIN] epoch=76/160, iter=8710/18560, loss=3.673371, lr=0.009675 | ETA 03:26:27 2022-07-25 07:54:55,598 - INFO - [TRAIN] epoch=76/160, iter=8720/18560, loss=3.397059, lr=0.009670 | ETA 03:19:52 2022-07-25 07:55:07,942 - INFO - [TRAIN] epoch=76/160, iter=8730/18560, loss=3.370057, lr=0.009665 | ETA 03:23:42 2022-07-25 07:55:20,237 - INFO - [TRAIN] epoch=76/160, iter=8740/18560, loss=3.736545, lr=0.009660 | ETA 03:22:09 2022-07-25 07:55:32,363 - INFO - [TRAIN] epoch=76/160, iter=8750/18560, loss=3.680709, lr=0.009655 | ETA 03:17:06 2022-07-25 07:55:44,553 - INFO - [TRAIN] epoch=76/160, iter=8760/18560, loss=3.431102, lr=0.009650 | ETA 03:17:01 2022-07-25 07:55:56,814 - INFO - [TRAIN] epoch=76/160, iter=8770/18560, loss=3.488012, lr=0.009644 | ETA 03:17:59 2022-07-25 07:56:09,019 - INFO - [TRAIN] epoch=76/160, iter=8780/18560, loss=3.691606, lr=0.009639 | ETA 03:17:23 2022-07-25 07:56:21,314 - INFO - [TRAIN] epoch=76/160, iter=8790/18560, loss=3.493721, lr=0.009634 | ETA 03:21:29 2022-07-25 07:56:33,509 - INFO - [TRAIN] epoch=76/160, iter=8800/18560, loss=3.738828, lr=0.009629 | ETA 03:18:37 2022-07-25 07:56:45,706 - INFO - [TRAIN] epoch=76/160, iter=8810/18560, loss=3.439663, lr=0.009623 | ETA 03:13:42 2022-07-25 07:57:31,791 - INFO - [TRAIN] epoch=77/160, iter=8820/18560, loss=3.296347, lr=0.009618 | ETA 09:02:54 2022-07-25 07:57:44,176 - INFO - [TRAIN] epoch=77/160, iter=8830/18560, loss=3.569680, lr=0.009612 | ETA 03:24:57 2022-07-25 07:57:56,700 - INFO - [TRAIN] epoch=77/160, iter=8840/18560, loss=3.356465, lr=0.009607 | ETA 03:25:37 2022-07-25 07:58:09,404 - INFO - [TRAIN] epoch=77/160, iter=8850/18560, loss=3.650765, lr=0.009601 | ETA 03:26:14 2022-07-25 07:58:22,022 - INFO - [TRAIN] epoch=77/160, iter=8860/18560, loss=3.457458, lr=0.009596 | ETA 03:20:10 2022-07-25 07:58:34,262 - INFO - [TRAIN] epoch=77/160, iter=8870/18560, loss=3.377877, lr=0.009590 | ETA 03:18:50 2022-07-25 07:58:46,593 - INFO - [TRAIN] epoch=77/160, iter=8880/18560, loss=3.638906, lr=0.009585 | ETA 03:19:52 2022-07-25 07:58:59,092 - INFO - [TRAIN] epoch=77/160, iter=8890/18560, loss=3.744231, lr=0.009579 | ETA 03:18:27 2022-07-25 07:59:11,566 - INFO - [TRAIN] epoch=77/160, iter=8900/18560, loss=3.776976, lr=0.009573 | ETA 03:20:26 2022-07-25 07:59:23,995 - INFO - [TRAIN] epoch=77/160, iter=8910/18560, loss=3.846321, lr=0.009568 | ETA 03:20:45 2022-07-25 07:59:36,452 - INFO - [TRAIN] epoch=77/160, iter=8920/18560, loss=3.390177, lr=0.009562 | ETA 03:21:06 2022-07-25 07:59:48,895 - INFO - [TRAIN] epoch=77/160, iter=8930/18560, loss=3.813742, lr=0.009556 | ETA 03:15:37 2022-07-25 08:00:38,549 - INFO - [TRAIN] epoch=78/160, iter=8940/18560, loss=3.607627, lr=0.009550 | ETA 03:44:03 2022-07-25 08:00:50,791 - INFO - [TRAIN] epoch=78/160, iter=8950/18560, loss=3.531566, lr=0.009544 | ETA 03:17:56 2022-07-25 08:01:03,261 - INFO - [TRAIN] epoch=78/160, iter=8960/18560, loss=3.460867, lr=0.009538 | ETA 03:20:11 2022-07-25 08:01:15,670 - INFO - [TRAIN] epoch=78/160, iter=8970/18560, loss=3.693913, lr=0.009533 | ETA 03:18:34 2022-07-25 08:01:28,122 - INFO - [TRAIN] epoch=78/160, iter=8980/18560, loss=3.368493, lr=0.009527 | ETA 03:22:17 2022-07-25 08:01:40,361 - INFO - [TRAIN] epoch=78/160, iter=8990/18560, loss=3.546207, lr=0.009521 | ETA 03:13:37 2022-07-25 08:01:52,664 - INFO - [TRAIN] epoch=78/160, iter=9000/18560, loss=3.336472, lr=0.009515 | ETA 03:14:05 2022-07-25 08:02:04,943 - INFO - [TRAIN] epoch=78/160, iter=9010/18560, loss=3.766714, lr=0.009508 | ETA 03:14:48 2022-07-25 08:02:17,260 - INFO - [TRAIN] epoch=78/160, iter=9020/18560, loss=3.400751, lr=0.009502 | ETA 03:14:26 2022-07-25 08:02:29,557 - INFO - [TRAIN] epoch=78/160, iter=9030/18560, loss=3.323532, lr=0.009496 | ETA 03:13:47 2022-07-25 08:02:41,965 - INFO - [TRAIN] epoch=78/160, iter=9040/18560, loss=3.347608, lr=0.009490 | ETA 03:13:52 2022-07-25 08:03:27,529 - INFO - [TRAIN] epoch=79/160, iter=9050/18560, loss=3.526950, lr=0.009484 | ETA 25:15:17 2022-07-25 08:03:39,890 - INFO - [TRAIN] epoch=79/160, iter=9060/18560, loss=3.346579, lr=0.009477 | ETA 03:18:51 2022-07-25 08:03:52,587 - INFO - [TRAIN] epoch=79/160, iter=9070/18560, loss=3.364770, lr=0.009471 | ETA 03:21:55 2022-07-25 08:04:04,980 - INFO - [TRAIN] epoch=79/160, iter=9080/18560, loss=3.562746, lr=0.009465 | ETA 03:13:41 2022-07-25 08:04:17,248 - INFO - [TRAIN] epoch=79/160, iter=9090/18560, loss=3.619535, lr=0.009458 | ETA 03:13:09 2022-07-25 08:04:29,523 - INFO - [TRAIN] epoch=79/160, iter=9100/18560, loss=3.522199, lr=0.009452 | ETA 03:12:01 2022-07-25 08:04:41,755 - INFO - [TRAIN] epoch=79/160, iter=9110/18560, loss=3.601765, lr=0.009446 | ETA 03:11:21 2022-07-25 08:04:54,142 - INFO - [TRAIN] epoch=79/160, iter=9120/18560, loss=3.589159, lr=0.009439 | ETA 03:12:06 2022-07-25 08:05:06,484 - INFO - [TRAIN] epoch=79/160, iter=9130/18560, loss=3.667692, lr=0.009433 | ETA 03:16:29 2022-07-25 08:05:18,916 - INFO - [TRAIN] epoch=79/160, iter=9140/18560, loss=3.382593, lr=0.009426 | ETA 03:16:52 2022-07-25 08:05:31,136 - INFO - [TRAIN] epoch=79/160, iter=9150/18560, loss=3.450384, lr=0.009420 | ETA 03:11:14 2022-07-25 08:05:43,377 - INFO - [TRAIN] epoch=79/160, iter=9160/18560, loss=3.445092, lr=0.009413 | ETA 03:10:08 2022-07-25 08:06:29,366 - INFO - [TRAIN] epoch=80/160, iter=9170/18560, loss=3.494115, lr=0.009406 | ETA 04:35:34 2022-07-25 08:06:42,037 - INFO - [TRAIN] epoch=80/160, iter=9180/18560, loss=3.207797, lr=0.009400 | ETA 03:27:39 2022-07-25 08:06:54,708 - INFO - [TRAIN] epoch=80/160, iter=9190/18560, loss=3.674624, lr=0.009393 | ETA 03:28:27 2022-07-25 08:07:07,445 - INFO - [TRAIN] epoch=80/160, iter=9200/18560, loss=3.489638, lr=0.009386 | ETA 03:27:41 2022-07-25 08:07:19,805 - INFO - [TRAIN] epoch=80/160, iter=9210/18560, loss=3.707080, lr=0.009379 | ETA 03:16:04 2022-07-25 08:07:32,105 - INFO - [TRAIN] epoch=80/160, iter=9220/18560, loss=3.358419, lr=0.009373 | ETA 03:09:02 2022-07-25 08:07:44,506 - INFO - [TRAIN] epoch=80/160, iter=9230/18560, loss=3.250311, lr=0.009366 | ETA 03:13:32 2022-07-25 08:07:57,081 - INFO - [TRAIN] epoch=80/160, iter=9240/18560, loss=3.312706, lr=0.009359 | ETA 03:17:22 2022-07-25 08:08:09,615 - INFO - [TRAIN] epoch=80/160, iter=9250/18560, loss=3.819437, lr=0.009352 | ETA 03:11:31 2022-07-25 08:08:22,054 - INFO - [TRAIN] epoch=80/160, iter=9260/18560, loss=3.428592, lr=0.009345 | ETA 03:14:01 2022-07-25 08:08:34,571 - INFO - [TRAIN] epoch=80/160, iter=9270/18560, loss=3.304952, lr=0.009338 | ETA 03:16:58 2022-07-25 08:08:47,095 - INFO - [TRAIN] epoch=80/160, iter=9280/18560, loss=3.647962, lr=0.009331 | ETA 03:13:35 2022-07-25 08:08:47,120 - INFO - Pop model from output_nospa/epoch_55 2022-07-25 08:08:47,370 - INFO - Push model to checkpoint output_nospa/epoch_80 2022-07-25 08:09:30,444 - INFO - [TRAIN] epoch=81/160, iter=9290/18560, loss=3.634476, lr=0.009324 | ETA 03:21:14 2022-07-25 08:09:42,917 - INFO - [TRAIN] epoch=81/160, iter=9300/18560, loss=3.266754, lr=0.009317 | ETA 03:14:16 2022-07-25 08:09:55,260 - INFO - [TRAIN] epoch=81/160, iter=9310/18560, loss=3.348125, lr=0.009310 | ETA 03:10:12 2022-07-25 08:10:07,757 - INFO - [TRAIN] epoch=81/160, iter=9320/18560, loss=3.321622, lr=0.009302 | ETA 03:11:56 2022-07-25 08:10:20,085 - INFO - [TRAIN] epoch=81/160, iter=9330/18560, loss=3.364274, lr=0.009295 | ETA 03:09:37 2022-07-25 08:10:32,496 - INFO - [TRAIN] epoch=81/160, iter=9340/18560, loss=3.541820, lr=0.009288 | ETA 03:12:05 2022-07-25 08:10:44,899 - INFO - [TRAIN] epoch=81/160, iter=9350/18560, loss=3.455652, lr=0.009281 | ETA 03:11:26 2022-07-25 08:10:57,474 - INFO - [TRAIN] epoch=81/160, iter=9360/18560, loss=3.668101, lr=0.009273 | ETA 03:10:27 2022-07-25 08:11:10,028 - INFO - [TRAIN] epoch=81/160, iter=9370/18560, loss=3.480426, lr=0.009266 | ETA 03:11:30 2022-07-25 08:11:22,474 - INFO - [TRAIN] epoch=81/160, iter=9380/18560, loss=3.444193, lr=0.009259 | ETA 03:10:06 2022-07-25 08:11:34,839 - INFO - [TRAIN] epoch=81/160, iter=9390/18560, loss=3.521509, lr=0.009251 | ETA 03:11:34 2022-07-25 08:12:16,109 - INFO - [TRAIN] epoch=82/160, iter=9400/18560, loss=3.421633, lr=0.009244 | ETA 07:46:18 2022-07-25 08:12:28,574 - INFO - [TRAIN] epoch=82/160, iter=9410/18560, loss=3.351946, lr=0.009236 | ETA 03:09:07 2022-07-25 08:12:41,019 - INFO - [TRAIN] epoch=82/160, iter=9420/18560, loss=3.309750, lr=0.009229 | ETA 03:08:03 2022-07-25 08:12:53,444 - INFO - [TRAIN] epoch=82/160, iter=9430/18560, loss=3.360100, lr=0.009221 | ETA 03:10:02 2022-07-25 08:13:05,843 - INFO - [TRAIN] epoch=82/160, iter=9440/18560, loss=3.237798, lr=0.009214 | ETA 03:07:37 2022-07-25 08:13:18,200 - INFO - [TRAIN] epoch=82/160, iter=9450/18560, loss=3.380136, lr=0.009206 | ETA 03:07:36 2022-07-25 08:13:30,648 - INFO - [TRAIN] epoch=82/160, iter=9460/18560, loss=3.471879, lr=0.009198 | ETA 03:08:12 2022-07-25 08:13:43,265 - INFO - [TRAIN] epoch=82/160, iter=9470/18560, loss=3.567660, lr=0.009191 | ETA 03:06:09 2022-07-25 08:13:55,611 - INFO - [TRAIN] epoch=82/160, iter=9480/18560, loss=3.631871, lr=0.009183 | ETA 03:06:10 2022-07-25 08:14:08,049 - INFO - [TRAIN] epoch=82/160, iter=9490/18560, loss=3.388497, lr=0.009175 | ETA 03:08:31 2022-07-25 08:14:20,422 - INFO - [TRAIN] epoch=82/160, iter=9500/18560, loss=3.261516, lr=0.009168 | ETA 03:07:22 2022-07-25 08:14:32,722 - INFO - [TRAIN] epoch=82/160, iter=9510/18560, loss=3.428783, lr=0.009160 | ETA 03:04:39 2022-07-25 08:15:12,394 - INFO - [TRAIN] epoch=83/160, iter=9520/18560, loss=3.510748, lr=0.009152 | ETA 03:23:22 2022-07-25 08:15:24,774 - INFO - [TRAIN] epoch=83/160, iter=9530/18560, loss=3.448099, lr=0.009144 | ETA 03:03:53 2022-07-25 08:15:37,331 - INFO - [TRAIN] epoch=83/160, iter=9540/18560, loss=3.113998, lr=0.009136 | ETA 03:05:13 2022-07-25 08:15:49,642 - INFO - [TRAIN] epoch=83/160, iter=9550/18560, loss=3.706029, lr=0.009128 | ETA 03:04:28 2022-07-25 08:16:01,997 - INFO - [TRAIN] epoch=83/160, iter=9560/18560, loss=3.513803, lr=0.009120 | ETA 03:06:27 2022-07-25 08:16:14,518 - INFO - [TRAIN] epoch=83/160, iter=9570/18560, loss=3.242008, lr=0.009112 | ETA 03:08:08 2022-07-25 08:16:26,861 - INFO - [TRAIN] epoch=83/160, iter=9580/18560, loss=3.341433, lr=0.009104 | ETA 03:03:38 2022-07-25 08:16:39,391 - INFO - [TRAIN] epoch=83/160, iter=9590/18560, loss=3.465596, lr=0.009096 | ETA 03:06:44 2022-07-25 08:16:51,803 - INFO - [TRAIN] epoch=83/160, iter=9600/18560, loss=3.384930, lr=0.009088 | ETA 03:04:47 2022-07-25 08:17:04,329 - INFO - [TRAIN] epoch=83/160, iter=9610/18560, loss=3.380590, lr=0.009080 | ETA 03:05:18 2022-07-25 08:17:16,857 - INFO - [TRAIN] epoch=83/160, iter=9620/18560, loss=3.287804, lr=0.009072 | ETA 03:05:54 2022-07-25 08:17:57,600 - INFO - [TRAIN] epoch=84/160, iter=9630/18560, loss=3.493300, lr=0.009063 | ETA 21:18:04 2022-07-25 08:18:09,981 - INFO - [TRAIN] epoch=84/160, iter=9640/18560, loss=3.419723, lr=0.009055 | ETA 03:08:25 2022-07-25 08:18:22,519 - INFO - [TRAIN] epoch=84/160, iter=9650/18560, loss=3.096943, lr=0.009047 | ETA 03:04:19 2022-07-25 08:18:34,867 - INFO - [TRAIN] epoch=84/160, iter=9660/18560, loss=3.267535, lr=0.009039 | ETA 03:01:05 2022-07-25 08:18:47,272 - INFO - [TRAIN] epoch=84/160, iter=9670/18560, loss=3.266222, lr=0.009030 | ETA 03:04:57 2022-07-25 08:18:59,825 - INFO - [TRAIN] epoch=84/160, iter=9680/18560, loss=3.222490, lr=0.009022 | ETA 03:08:37 2022-07-25 08:19:12,252 - INFO - [TRAIN] epoch=84/160, iter=9690/18560, loss=3.271694, lr=0.009014 | ETA 03:04:21 2022-07-25 08:19:24,953 - INFO - [TRAIN] epoch=84/160, iter=9700/18560, loss=3.483917, lr=0.009005 | ETA 03:07:05 2022-07-25 08:19:37,196 - INFO - [TRAIN] epoch=84/160, iter=9710/18560, loss=3.649204, lr=0.008997 | ETA 03:04:35 2022-07-25 08:19:49,732 - INFO - [TRAIN] epoch=84/160, iter=9720/18560, loss=3.309334, lr=0.008988 | ETA 03:03:25 2022-07-25 08:20:02,194 - INFO - [TRAIN] epoch=84/160, iter=9730/18560, loss=3.512726, lr=0.008980 | ETA 03:03:13 2022-07-25 08:20:14,612 - INFO - [TRAIN] epoch=84/160, iter=9740/18560, loss=3.683714, lr=0.008971 | ETA 03:04:47 2022-07-25 08:20:55,900 - INFO - [TRAIN] epoch=85/160, iter=9750/18560, loss=3.298815, lr=0.008962 | ETA 04:11:20 2022-07-25 08:21:08,522 - INFO - [TRAIN] epoch=85/160, iter=9760/18560, loss=3.154660, lr=0.008954 | ETA 03:04:22 2022-07-25 08:21:21,078 - INFO - [TRAIN] epoch=85/160, iter=9770/18560, loss=3.133054, lr=0.008945 | ETA 03:02:59 2022-07-25 08:21:33,701 - INFO - [TRAIN] epoch=85/160, iter=9780/18560, loss=3.120700, lr=0.008937 | ETA 03:06:28 2022-07-25 08:21:46,234 - INFO - [TRAIN] epoch=85/160, iter=9790/18560, loss=3.227604, lr=0.008928 | ETA 03:03:22 2022-07-25 08:21:58,715 - INFO - [TRAIN] epoch=85/160, iter=9800/18560, loss=3.241292, lr=0.008919 | ETA 03:00:28 2022-07-25 08:22:11,029 - INFO - [TRAIN] epoch=85/160, iter=9810/18560, loss=3.489306, lr=0.008910 | ETA 03:01:52 2022-07-25 08:22:23,466 - INFO - [TRAIN] epoch=85/160, iter=9820/18560, loss=3.362939, lr=0.008901 | ETA 03:02:35 2022-07-25 08:22:36,106 - INFO - [TRAIN] epoch=85/160, iter=9830/18560, loss=3.315267, lr=0.008893 | ETA 03:06:22 2022-07-25 08:22:48,578 - INFO - [TRAIN] epoch=85/160, iter=9840/18560, loss=3.454982, lr=0.008884 | ETA 02:59:12 2022-07-25 08:23:01,020 - INFO - [TRAIN] epoch=85/160, iter=9850/18560, loss=3.289891, lr=0.008875 | ETA 03:02:15 2022-07-25 08:23:13,665 - INFO - [TRAIN] epoch=85/160, iter=9860/18560, loss=3.262405, lr=0.008866 | ETA 03:04:14 2022-07-25 08:23:13,687 - INFO - Pop model from output_nospa/epoch_60 2022-07-25 08:23:13,861 - INFO - Push model to checkpoint output_nospa/epoch_85 2022-07-25 08:23:54,167 - INFO - [TRAIN] epoch=86/160, iter=9870/18560, loss=3.485555, lr=0.008857 | ETA 03:07:09 2022-07-25 08:24:06,631 - INFO - [TRAIN] epoch=86/160, iter=9880/18560, loss=3.005390, lr=0.008848 | ETA 02:59:28 2022-07-25 08:24:19,072 - INFO - [TRAIN] epoch=86/160, iter=9890/18560, loss=3.243140, lr=0.008839 | ETA 03:01:39 2022-07-25 08:24:31,456 - INFO - [TRAIN] epoch=86/160, iter=9900/18560, loss=3.489470, lr=0.008830 | ETA 02:58:41 2022-07-25 08:24:43,816 - INFO - [TRAIN] epoch=86/160, iter=9910/18560, loss=3.337585, lr=0.008821 | ETA 02:58:05 2022-07-25 08:24:56,176 - INFO - [TRAIN] epoch=86/160, iter=9920/18560, loss=3.114650, lr=0.008812 | ETA 02:57:49 2022-07-25 08:25:08,596 - INFO - [TRAIN] epoch=86/160, iter=9930/18560, loss=3.380211, lr=0.008803 | ETA 02:59:28 2022-07-25 08:25:20,954 - INFO - [TRAIN] epoch=86/160, iter=9940/18560, loss=3.357419, lr=0.008793 | ETA 02:58:54 2022-07-25 08:25:33,285 - INFO - [TRAIN] epoch=86/160, iter=9950/18560, loss=3.134872, lr=0.008784 | ETA 02:56:13 2022-07-25 08:25:45,502 - INFO - [TRAIN] epoch=86/160, iter=9960/18560, loss=3.278058, lr=0.008775 | ETA 02:54:31 2022-07-25 08:25:57,978 - INFO - [TRAIN] epoch=86/160, iter=9970/18560, loss=3.447657, lr=0.008766 | ETA 02:58:53 2022-07-25 08:26:39,957 - INFO - [TRAIN] epoch=87/160, iter=9980/18560, loss=3.470152, lr=0.008756 | ETA 07:24:49 2022-07-25 08:26:52,467 - INFO - [TRAIN] epoch=87/160, iter=9990/18560, loss=3.488458, lr=0.008747 | ETA 02:56:32 2022-07-25 08:27:05,072 - INFO - [TRAIN] epoch=87/160, iter=10000/18560, loss=3.261996, lr=0.008738 | ETA 03:03:15 2022-07-25 08:27:17,380 - INFO - [TRAIN] epoch=87/160, iter=10010/18560, loss=3.355085, lr=0.008728 | ETA 02:54:01 2022-07-25 08:27:29,880 - INFO - [TRAIN] epoch=87/160, iter=10020/18560, loss=3.761435, lr=0.008719 | ETA 02:58:12 2022-07-25 08:27:42,269 - INFO - [TRAIN] epoch=87/160, iter=10030/18560, loss=3.266600, lr=0.008710 | ETA 02:59:36 2022-07-25 08:27:54,658 - INFO - [TRAIN] epoch=87/160, iter=10040/18560, loss=3.204396, lr=0.008700 | ETA 02:57:10 2022-07-25 08:28:06,892 - INFO - [TRAIN] epoch=87/160, iter=10050/18560, loss=3.506287, lr=0.008691 | ETA 02:54:12 2022-07-25 08:28:19,232 - INFO - [TRAIN] epoch=87/160, iter=10060/18560, loss=3.437841, lr=0.008681 | ETA 02:53:55 2022-07-25 08:28:31,746 - INFO - [TRAIN] epoch=87/160, iter=10070/18560, loss=3.457854, lr=0.008671 | ETA 02:56:13 2022-07-25 08:28:44,188 - INFO - [TRAIN] epoch=87/160, iter=10080/18560, loss=3.134370, lr=0.008662 | ETA 02:54:10 2022-07-25 08:28:56,679 - INFO - [TRAIN] epoch=87/160, iter=10090/18560, loss=3.619268, lr=0.008652 | ETA 02:57:45 2022-07-25 08:29:38,645 - INFO - [TRAIN] epoch=88/160, iter=10100/18560, loss=3.596011, lr=0.008643 | ETA 03:15:12 2022-07-25 08:29:51,155 - INFO - [TRAIN] epoch=88/160, iter=10110/18560, loss=3.083885, lr=0.008633 | ETA 02:56:53 2022-07-25 08:30:03,533 - INFO - [TRAIN] epoch=88/160, iter=10120/18560, loss=3.215671, lr=0.008623 | ETA 02:54:23 2022-07-25 08:30:16,114 - INFO - [TRAIN] epoch=88/160, iter=10130/18560, loss=3.452597, lr=0.008613 | ETA 02:57:24 2022-07-25 08:30:28,708 - INFO - [TRAIN] epoch=88/160, iter=10140/18560, loss=3.391615, lr=0.008604 | ETA 03:00:16 2022-07-25 08:30:41,141 - INFO - [TRAIN] epoch=88/160, iter=10150/18560, loss=3.306890, lr=0.008594 | ETA 02:54:21 2022-07-25 08:30:53,496 - INFO - [TRAIN] epoch=88/160, iter=10160/18560, loss=3.254950, lr=0.008584 | ETA 02:55:42 2022-07-25 08:31:05,982 - INFO - [TRAIN] epoch=88/160, iter=10170/18560, loss=3.634405, lr=0.008574 | ETA 02:57:01 2022-07-25 08:31:18,491 - INFO - [TRAIN] epoch=88/160, iter=10180/18560, loss=3.485415, lr=0.008564 | ETA 02:54:30 2022-07-25 08:31:30,842 - INFO - [TRAIN] epoch=88/160, iter=10190/18560, loss=3.434372, lr=0.008554 | ETA 02:55:53 2022-07-25 08:31:43,284 - INFO - [TRAIN] epoch=88/160, iter=10200/18560, loss=3.360700, lr=0.008545 | ETA 02:51:45 2022-07-25 08:32:24,847 - INFO - [TRAIN] epoch=89/160, iter=10210/18560, loss=3.559737, lr=0.008535 | ETA 19:50:55 2022-07-25 08:32:37,412 - INFO - [TRAIN] epoch=89/160, iter=10220/18560, loss=3.046626, lr=0.008525 | ETA 02:56:39 2022-07-25 08:32:49,845 - INFO - [TRAIN] epoch=89/160, iter=10230/18560, loss=3.228349, lr=0.008515 | ETA 02:54:10 2022-07-25 08:33:02,272 - INFO - [TRAIN] epoch=89/160, iter=10240/18560, loss=3.009584, lr=0.008504 | ETA 02:53:18 2022-07-25 08:33:14,584 - INFO - [TRAIN] epoch=89/160, iter=10250/18560, loss=3.276048, lr=0.008494 | ETA 02:50:20 2022-07-25 08:33:27,115 - INFO - [TRAIN] epoch=89/160, iter=10260/18560, loss=3.281761, lr=0.008484 | ETA 02:55:11 2022-07-25 08:33:39,638 - INFO - [TRAIN] epoch=89/160, iter=10270/18560, loss=3.210946, lr=0.008474 | ETA 02:54:03 2022-07-25 08:33:52,078 - INFO - [TRAIN] epoch=89/160, iter=10280/18560, loss=3.369486, lr=0.008464 | ETA 02:50:00 2022-07-25 08:34:04,483 - INFO - [TRAIN] epoch=89/160, iter=10290/18560, loss=3.630920, lr=0.008454 | ETA 02:49:48 2022-07-25 08:34:16,898 - INFO - [TRAIN] epoch=89/160, iter=10300/18560, loss=3.423745, lr=0.008444 | ETA 02:48:35 2022-07-25 08:34:29,349 - INFO - [TRAIN] epoch=89/160, iter=10310/18560, loss=3.355894, lr=0.008433 | ETA 02:51:08 2022-07-25 08:34:41,804 - INFO - [TRAIN] epoch=89/160, iter=10320/18560, loss=3.484310, lr=0.008423 | ETA 02:46:19 2022-07-25 08:35:24,965 - INFO - [TRAIN] epoch=90/160, iter=10330/18560, loss=3.753442, lr=0.008413 | ETA 03:53:47 2022-07-25 08:35:37,503 - INFO - [TRAIN] epoch=90/160, iter=10340/18560, loss=3.485920, lr=0.008403 | ETA 02:51:48 2022-07-25 08:35:50,133 - INFO - [TRAIN] epoch=90/160, iter=10350/18560, loss=3.062736, lr=0.008392 | ETA 02:53:55 2022-07-25 08:36:02,532 - INFO - [TRAIN] epoch=90/160, iter=10360/18560, loss=3.048450, lr=0.008382 | ETA 02:50:01 2022-07-25 08:36:15,154 - INFO - [TRAIN] epoch=90/160, iter=10370/18560, loss=3.277883, lr=0.008371 | ETA 02:51:47 2022-07-25 08:36:27,550 - INFO - [TRAIN] epoch=90/160, iter=10380/18560, loss=2.948320, lr=0.008361 | ETA 02:50:01 2022-07-25 08:36:39,912 - INFO - [TRAIN] epoch=90/160, iter=10390/18560, loss=3.047098, lr=0.008350 | ETA 02:48:40 2022-07-25 08:36:52,280 - INFO - [TRAIN] epoch=90/160, iter=10400/18560, loss=3.526889, lr=0.008340 | ETA 02:49:25 2022-07-25 08:37:04,833 - INFO - [TRAIN] epoch=90/160, iter=10410/18560, loss=3.338371, lr=0.008329 | ETA 02:47:44 2022-07-25 08:37:17,214 - INFO - [TRAIN] epoch=90/160, iter=10420/18560, loss=3.217394, lr=0.008319 | ETA 02:47:01 2022-07-25 08:37:29,726 - INFO - [TRAIN] epoch=90/160, iter=10430/18560, loss=3.352630, lr=0.008308 | ETA 02:48:01 2022-07-25 08:37:42,153 - INFO - [TRAIN] epoch=90/160, iter=10440/18560, loss=3.386920, lr=0.008298 | ETA 02:50:04 2022-07-25 08:37:42,171 - INFO - Pop model from output_nospa/epoch_65 2022-07-25 08:37:42,324 - INFO - Push model to checkpoint output_nospa/epoch_90 2022-07-25 08:38:23,764 - INFO - [TRAIN] epoch=91/160, iter=10450/18560, loss=3.177610, lr=0.008287 | ETA 02:51:08 2022-07-25 08:38:36,222 - INFO - [TRAIN] epoch=91/160, iter=10460/18560, loss=3.300298, lr=0.008277 | ETA 02:48:17 2022-07-25 08:38:48,681 - INFO - [TRAIN] epoch=91/160, iter=10470/18560, loss=3.481592, lr=0.008266 | ETA 02:46:10 2022-07-25 08:39:01,139 - INFO - [TRAIN] epoch=91/160, iter=10480/18560, loss=3.368852, lr=0.008255 | ETA 02:50:33 2022-07-25 08:39:13,499 - INFO - [TRAIN] epoch=91/160, iter=10490/18560, loss=3.256306, lr=0.008244 | ETA 02:45:23 2022-07-25 08:39:25,907 - INFO - [TRAIN] epoch=91/160, iter=10500/18560, loss=3.131463, lr=0.008234 | ETA 02:47:43 2022-07-25 08:39:38,394 - INFO - [TRAIN] epoch=91/160, iter=10510/18560, loss=3.287805, lr=0.008223 | ETA 02:47:34 2022-07-25 08:39:50,888 - INFO - [TRAIN] epoch=91/160, iter=10520/18560, loss=3.209331, lr=0.008212 | ETA 02:45:06 2022-07-25 08:40:03,389 - INFO - [TRAIN] epoch=91/160, iter=10530/18560, loss=3.148533, lr=0.008201 | ETA 02:46:09 2022-07-25 08:40:15,916 - INFO - [TRAIN] epoch=91/160, iter=10540/18560, loss=3.198325, lr=0.008190 | ETA 02:45:32 2022-07-25 08:40:28,495 - INFO - [TRAIN] epoch=91/160, iter=10550/18560, loss=3.255211, lr=0.008180 | ETA 02:46:47 2022-07-25 08:41:10,075 - INFO - [TRAIN] epoch=92/160, iter=10560/18560, loss=3.360529, lr=0.008169 | ETA 06:57:03 2022-07-25 08:41:22,619 - INFO - [TRAIN] epoch=92/160, iter=10570/18560, loss=3.211112, lr=0.008158 | ETA 02:45:58 2022-07-25 08:41:35,110 - INFO - [TRAIN] epoch=92/160, iter=10580/18560, loss=3.196652, lr=0.008147 | ETA 02:45:39 2022-07-25 08:41:47,604 - INFO - [TRAIN] epoch=92/160, iter=10590/18560, loss=3.033416, lr=0.008136 | ETA 02:44:49 2022-07-25 08:41:59,965 - INFO - [TRAIN] epoch=92/160, iter=10600/18560, loss=3.353486, lr=0.008125 | ETA 02:44:09 2022-07-25 08:42:12,389 - INFO - [TRAIN] epoch=92/160, iter=10610/18560, loss=3.133088, lr=0.008114 | ETA 02:46:45 2022-07-25 08:42:24,998 - INFO - [TRAIN] epoch=92/160, iter=10620/18560, loss=3.150702, lr=0.008103 | ETA 02:46:51 2022-07-25 08:42:37,326 - INFO - [TRAIN] epoch=92/160, iter=10630/18560, loss=3.312844, lr=0.008092 | ETA 02:42:38 2022-07-25 08:42:49,693 - INFO - [TRAIN] epoch=92/160, iter=10640/18560, loss=3.387940, lr=0.008081 | ETA 02:45:25 2022-07-25 08:43:02,127 - INFO - [TRAIN] epoch=92/160, iter=10650/18560, loss=3.210300, lr=0.008070 | ETA 02:42:28 2022-07-25 08:43:14,505 - INFO - [TRAIN] epoch=92/160, iter=10660/18560, loss=3.220086, lr=0.008058 | ETA 02:43:40 2022-07-25 08:43:26,827 - INFO - [TRAIN] epoch=92/160, iter=10670/18560, loss=3.161281, lr=0.008047 | ETA 02:44:40 2022-07-25 08:44:09,027 - INFO - [TRAIN] epoch=93/160, iter=10680/18560, loss=3.395065, lr=0.008036 | ETA 02:56:50 2022-07-25 08:44:21,464 - INFO - [TRAIN] epoch=93/160, iter=10690/18560, loss=3.169246, lr=0.008025 | ETA 02:43:37 2022-07-25 08:44:33,946 - INFO - [TRAIN] epoch=93/160, iter=10700/18560, loss=3.038050, lr=0.008014 | ETA 02:43:17 2022-07-25 08:44:46,272 - INFO - [TRAIN] epoch=93/160, iter=10710/18560, loss=3.237648, lr=0.008002 | ETA 02:41:49 2022-07-25 08:44:58,581 - INFO - [TRAIN] epoch=93/160, iter=10720/18560, loss=3.436042, lr=0.007991 | ETA 02:40:55 2022-07-25 08:45:10,896 - INFO - [TRAIN] epoch=93/160, iter=10730/18560, loss=3.309338, lr=0.007980 | ETA 02:40:09 2022-07-25 08:45:23,321 - INFO - [TRAIN] epoch=93/160, iter=10740/18560, loss=3.477970, lr=0.007968 | ETA 02:40:28 2022-07-25 08:45:35,745 - INFO - [TRAIN] epoch=93/160, iter=10750/18560, loss=3.297312, lr=0.007957 | ETA 02:41:41 2022-07-25 08:45:48,125 - INFO - [TRAIN] epoch=93/160, iter=10760/18560, loss=3.521025, lr=0.007946 | ETA 02:43:02 2022-07-25 08:46:00,506 - INFO - [TRAIN] epoch=93/160, iter=10770/18560, loss=3.193161, lr=0.007934 | ETA 02:35:44 2022-07-25 08:46:12,839 - INFO - [TRAIN] epoch=93/160, iter=10780/18560, loss=3.274135, lr=0.007923 | ETA 02:38:58 2022-07-25 08:46:53,825 - INFO - [TRAIN] epoch=94/160, iter=10790/18560, loss=3.374496, lr=0.007911 | ETA 18:00:04 2022-07-25 08:47:06,846 - INFO - [TRAIN] epoch=94/160, iter=10800/18560, loss=3.032939, lr=0.007900 | ETA 02:43:31 2022-07-25 08:47:19,316 - INFO - [TRAIN] epoch=94/160, iter=10810/18560, loss=3.081813, lr=0.007888 | ETA 02:38:20 2022-07-25 08:47:31,742 - INFO - [TRAIN] epoch=94/160, iter=10820/18560, loss=3.217890, lr=0.007877 | ETA 02:42:14 2022-07-25 08:47:44,069 - INFO - [TRAIN] epoch=94/160, iter=10830/18560, loss=3.319854, lr=0.007865 | ETA 02:39:55 2022-07-25 08:47:56,410 - INFO - [TRAIN] epoch=94/160, iter=10840/18560, loss=3.347079, lr=0.007854 | ETA 02:39:58 2022-07-25 08:48:08,809 - INFO - [TRAIN] epoch=94/160, iter=10850/18560, loss=3.236895, lr=0.007842 | ETA 02:40:22 2022-07-25 08:48:21,229 - INFO - [TRAIN] epoch=94/160, iter=10860/18560, loss=3.280587, lr=0.007830 | ETA 02:39:52 2022-07-25 08:48:33,719 - INFO - [TRAIN] epoch=94/160, iter=10870/18560, loss=3.312039, lr=0.007819 | ETA 02:39:42 2022-07-25 08:48:46,297 - INFO - [TRAIN] epoch=94/160, iter=10880/18560, loss=3.101091, lr=0.007807 | ETA 02:38:54 2022-07-25 08:48:58,708 - INFO - [TRAIN] epoch=94/160, iter=10890/18560, loss=3.319399, lr=0.007795 | ETA 02:38:42 2022-07-25 08:49:11,237 - INFO - [TRAIN] epoch=94/160, iter=10900/18560, loss=3.223247, lr=0.007784 | ETA 02:41:38 2022-07-25 08:49:51,097 - INFO - [TRAIN] epoch=95/160, iter=10910/18560, loss=2.893342, lr=0.007772 | ETA 03:33:10 2022-07-25 08:50:03,675 - INFO - [TRAIN] epoch=95/160, iter=10920/18560, loss=3.225018, lr=0.007760 | ETA 02:36:08 2022-07-25 08:50:16,212 - INFO - [TRAIN] epoch=95/160, iter=10930/18560, loss=3.053012, lr=0.007748 | ETA 02:40:07 2022-07-25 08:50:28,672 - INFO - [TRAIN] epoch=95/160, iter=10940/18560, loss=2.962200, lr=0.007737 | ETA 02:39:58 2022-07-25 08:50:41,046 - INFO - [TRAIN] epoch=95/160, iter=10950/18560, loss=3.212374, lr=0.007725 | ETA 02:38:25 2022-07-25 08:50:53,392 - INFO - [TRAIN] epoch=95/160, iter=10960/18560, loss=3.207016, lr=0.007713 | ETA 02:36:11 2022-07-25 08:51:05,843 - INFO - [TRAIN] epoch=95/160, iter=10970/18560, loss=2.945694, lr=0.007701 | ETA 02:35:47 2022-07-25 08:51:18,123 - INFO - [TRAIN] epoch=95/160, iter=10980/18560, loss=3.302408, lr=0.007689 | ETA 02:34:17 2022-07-25 08:51:30,585 - INFO - [TRAIN] epoch=95/160, iter=10990/18560, loss=3.477212, lr=0.007677 | ETA 02:37:26 2022-07-25 08:51:43,121 - INFO - [TRAIN] epoch=95/160, iter=11000/18560, loss=3.287319, lr=0.007665 | ETA 02:38:20 2022-07-25 08:51:55,412 - INFO - [TRAIN] epoch=95/160, iter=11010/18560, loss=3.215349, lr=0.007654 | ETA 02:33:59 2022-07-25 08:52:07,784 - INFO - [TRAIN] epoch=95/160, iter=11020/18560, loss=3.205257, lr=0.007642 | ETA 02:37:09 2022-07-25 08:52:07,805 - INFO - Pop model from output_nospa/epoch_70 2022-07-25 08:52:08,000 - INFO - Push model to checkpoint output_nospa/epoch_95 2022-07-25 08:52:50,916 - INFO - [TRAIN] epoch=96/160, iter=11030/18560, loss=3.255156, lr=0.007630 | ETA 02:40:25 2022-07-25 08:53:03,350 - INFO - [TRAIN] epoch=96/160, iter=11040/18560, loss=3.249937, lr=0.007618 | ETA 02:35:21 2022-07-25 08:53:15,559 - INFO - [TRAIN] epoch=96/160, iter=11050/18560, loss=3.135648, lr=0.007606 | ETA 02:33:03 2022-07-25 08:53:27,793 - INFO - [TRAIN] epoch=96/160, iter=11060/18560, loss=3.316716, lr=0.007593 | ETA 02:32:23 2022-07-25 08:53:40,492 - INFO - [TRAIN] epoch=96/160, iter=11070/18560, loss=3.051198, lr=0.007581 | ETA 02:41:42 2022-07-25 08:53:52,996 - INFO - [TRAIN] epoch=96/160, iter=11080/18560, loss=3.143267, lr=0.007569 | ETA 02:37:05 2022-07-25 08:54:05,442 - INFO - [TRAIN] epoch=96/160, iter=11090/18560, loss=3.110779, lr=0.007557 | ETA 02:35:43 2022-07-25 08:54:17,924 - INFO - [TRAIN] epoch=96/160, iter=11100/18560, loss=3.248771, lr=0.007545 | ETA 02:35:17 2022-07-25 08:54:30,291 - INFO - [TRAIN] epoch=96/160, iter=11110/18560, loss=3.324015, lr=0.007533 | ETA 02:32:18 2022-07-25 08:54:42,534 - INFO - [TRAIN] epoch=96/160, iter=11120/18560, loss=3.046066, lr=0.007521 | ETA 02:30:42 2022-07-25 08:54:54,650 - INFO - [TRAIN] epoch=96/160, iter=11130/18560, loss=3.240249, lr=0.007509 | ETA 02:29:59 2022-07-25 08:55:35,689 - INFO - [TRAIN] epoch=97/160, iter=11140/18560, loss=3.485696, lr=0.007496 | ETA 06:14:03 2022-07-25 08:55:48,124 - INFO - [TRAIN] epoch=97/160, iter=11150/18560, loss=3.488059, lr=0.007484 | ETA 02:30:46 2022-07-25 08:56:00,387 - INFO - [TRAIN] epoch=97/160, iter=11160/18560, loss=2.985596, lr=0.007472 | ETA 02:29:15 2022-07-25 08:56:12,756 - INFO - [TRAIN] epoch=97/160, iter=11170/18560, loss=3.075811, lr=0.007460 | ETA 02:32:31 2022-07-25 08:56:24,987 - INFO - [TRAIN] epoch=97/160, iter=11180/18560, loss=3.227405, lr=0.007447 | ETA 02:31:00 2022-07-25 08:56:37,231 - INFO - [TRAIN] epoch=97/160, iter=11190/18560, loss=3.037826, lr=0.007435 | ETA 02:31:21 2022-07-25 08:56:49,690 - INFO - [TRAIN] epoch=97/160, iter=11200/18560, loss=3.350542, lr=0.007423 | ETA 02:32:23 2022-07-25 08:57:02,113 - INFO - [TRAIN] epoch=97/160, iter=11210/18560, loss=3.170082, lr=0.007410 | ETA 02:32:51 2022-07-25 08:57:14,405 - INFO - [TRAIN] epoch=97/160, iter=11220/18560, loss=3.284231, lr=0.007398 | ETA 02:29:05 2022-07-25 08:57:26,611 - INFO - [TRAIN] epoch=97/160, iter=11230/18560, loss=3.361252, lr=0.007386 | ETA 02:28:39 2022-07-25 08:57:38,936 - INFO - [TRAIN] epoch=97/160, iter=11240/18560, loss=3.075322, lr=0.007373 | ETA 02:31:27 2022-07-25 08:57:51,189 - INFO - [TRAIN] epoch=97/160, iter=11250/18560, loss=3.180089, lr=0.007361 | ETA 02:31:11 2022-07-25 08:58:33,550 - INFO - [TRAIN] epoch=98/160, iter=11260/18560, loss=3.137568, lr=0.007348 | ETA 02:46:39 2022-07-25 08:58:46,067 - INFO - [TRAIN] epoch=98/160, iter=11270/18560, loss=3.187177, lr=0.007336 | ETA 02:31:49 2022-07-25 08:58:58,567 - INFO - [TRAIN] epoch=98/160, iter=11280/18560, loss=2.775687, lr=0.007323 | ETA 02:30:12 2022-07-25 08:59:10,836 - INFO - [TRAIN] epoch=98/160, iter=11290/18560, loss=3.148010, lr=0.007311 | ETA 02:27:30 2022-07-25 08:59:23,000 - INFO - [TRAIN] epoch=98/160, iter=11300/18560, loss=2.960816, lr=0.007298 | ETA 02:25:23 2022-07-25 08:59:35,255 - INFO - [TRAIN] epoch=98/160, iter=11310/18560, loss=3.098032, lr=0.007286 | ETA 02:28:08 2022-07-25 08:59:47,641 - INFO - [TRAIN] epoch=98/160, iter=11320/18560, loss=3.187270, lr=0.007273 | ETA 02:30:57 2022-07-25 09:00:00,099 - INFO - [TRAIN] epoch=98/160, iter=11330/18560, loss=3.367446, lr=0.007261 | ETA 02:31:16 2022-07-25 09:00:12,548 - INFO - [TRAIN] epoch=98/160, iter=11340/18560, loss=3.304839, lr=0.007248 | ETA 02:29:02 2022-07-25 09:00:24,889 - INFO - [TRAIN] epoch=98/160, iter=11350/18560, loss=3.174959, lr=0.007235 | ETA 02:24:51 2022-07-25 09:00:37,185 - INFO - [TRAIN] epoch=98/160, iter=11360/18560, loss=3.112798, lr=0.007223 | ETA 02:25:51 2022-07-25 09:01:20,090 - INFO - [TRAIN] epoch=99/160, iter=11370/18560, loss=3.100169, lr=0.007210 | ETA 17:50:00 2022-07-25 09:01:32,614 - INFO - [TRAIN] epoch=99/160, iter=11380/18560, loss=3.258999, lr=0.007198 | ETA 02:30:40 2022-07-25 09:01:45,060 - INFO - [TRAIN] epoch=99/160, iter=11390/18560, loss=3.122404, lr=0.007185 | ETA 02:28:08 2022-07-25 09:01:57,447 - INFO - [TRAIN] epoch=99/160, iter=11400/18560, loss=3.180544, lr=0.007172 | ETA 02:27:12 2022-07-25 09:02:10,166 - INFO - [TRAIN] epoch=99/160, iter=11410/18560, loss=3.245804, lr=0.007159 | ETA 02:31:36 2022-07-25 09:02:22,528 - INFO - [TRAIN] epoch=99/160, iter=11420/18560, loss=3.346723, lr=0.007147 | ETA 02:27:20 2022-07-25 09:02:34,753 - INFO - [TRAIN] epoch=99/160, iter=11430/18560, loss=3.145696, lr=0.007134 | ETA 02:24:18 2022-07-25 09:02:47,009 - INFO - [TRAIN] epoch=99/160, iter=11440/18560, loss=3.126498, lr=0.007121 | ETA 02:25:02 2022-07-25 09:02:59,454 - INFO - [TRAIN] epoch=99/160, iter=11450/18560, loss=3.361419, lr=0.007108 | ETA 02:28:06 2022-07-25 09:03:11,949 - INFO - [TRAIN] epoch=99/160, iter=11460/18560, loss=3.212723, lr=0.007096 | ETA 02:27:35 2022-07-25 09:03:24,152 - INFO - [TRAIN] epoch=99/160, iter=11470/18560, loss=3.139335, lr=0.007083 | ETA 02:22:03 2022-07-25 09:03:36,386 - INFO - [TRAIN] epoch=99/160, iter=11480/18560, loss=2.974145, lr=0.007070 | ETA 02:24:23 2022-07-25 09:04:16,029 - INFO - [TRAIN] epoch=100/160, iter=11490/18560, loss=3.215379, lr=0.007057 | ETA 03:16:57 2022-07-25 09:04:28,504 - INFO - [TRAIN] epoch=100/160, iter=11500/18560, loss=3.076925, lr=0.007044 | ETA 02:25:35 2022-07-25 09:04:41,018 - INFO - [TRAIN] epoch=100/160, iter=11510/18560, loss=3.009512, lr=0.007031 | ETA 02:25:52 2022-07-25 09:04:53,444 - INFO - [TRAIN] epoch=100/160, iter=11520/18560, loss=3.155597, lr=0.007018 | ETA 02:24:37 2022-07-25 09:05:05,831 - INFO - [TRAIN] epoch=100/160, iter=11530/18560, loss=3.366283, lr=0.007006 | ETA 02:25:42 2022-07-25 09:05:18,261 - INFO - [TRAIN] epoch=100/160, iter=11540/18560, loss=3.271309, lr=0.006993 | ETA 02:25:19 2022-07-25 09:05:30,753 - INFO - [TRAIN] epoch=100/160, iter=11550/18560, loss=2.908116, lr=0.006980 | ETA 02:26:43 2022-07-25 09:05:43,252 - INFO - [TRAIN] epoch=100/160, iter=11560/18560, loss=3.114237, lr=0.006967 | ETA 02:23:38 2022-07-25 09:05:55,749 - INFO - [TRAIN] epoch=100/160, iter=11570/18560, loss=3.498159, lr=0.006954 | ETA 02:26:30 2022-07-25 09:06:08,137 - INFO - [TRAIN] epoch=100/160, iter=11580/18560, loss=3.202354, lr=0.006941 | ETA 02:24:05 2022-07-25 09:06:20,292 - INFO - [TRAIN] epoch=100/160, iter=11590/18560, loss=3.252499, lr=0.006928 | ETA 02:19:08 2022-07-25 09:06:32,537 - INFO - [TRAIN] epoch=100/160, iter=11600/18560, loss=2.922505, lr=0.006915 | ETA 02:21:28 2022-07-25 09:06:32,558 - INFO - Pop model from output_nospa/epoch_75 2022-07-25 09:06:32,727 - INFO - Push model to checkpoint output_nospa/epoch_100 2022-07-25 09:07:12,374 - INFO - [TRAIN] epoch=101/160, iter=11610/18560, loss=3.046828, lr=0.006902 | ETA 02:28:56 2022-07-25 09:07:24,740 - INFO - [TRAIN] epoch=101/160, iter=11620/18560, loss=3.363029, lr=0.006889 | ETA 02:23:10 2022-07-25 09:07:37,320 - INFO - [TRAIN] epoch=101/160, iter=11630/18560, loss=3.073394, lr=0.006876 | ETA 02:25:46 2022-07-25 09:07:49,761 - INFO - [TRAIN] epoch=101/160, iter=11640/18560, loss=3.400119, lr=0.006863 | ETA 02:23:38 2022-07-25 09:08:02,213 - INFO - [TRAIN] epoch=101/160, iter=11650/18560, loss=3.147130, lr=0.006849 | ETA 02:23:09 2022-07-25 09:08:14,772 - INFO - [TRAIN] epoch=101/160, iter=11660/18560, loss=2.962882, lr=0.006836 | ETA 02:23:52 2022-07-25 09:08:27,167 - INFO - [TRAIN] epoch=101/160, iter=11670/18560, loss=3.027799, lr=0.006823 | ETA 02:24:33 2022-07-25 09:08:39,641 - INFO - [TRAIN] epoch=101/160, iter=11680/18560, loss=3.163370, lr=0.006810 | ETA 02:23:57 2022-07-25 09:08:52,011 - INFO - [TRAIN] epoch=101/160, iter=11690/18560, loss=3.055327, lr=0.006797 | ETA 02:21:57 2022-07-25 09:09:04,166 - INFO - [TRAIN] epoch=101/160, iter=11700/18560, loss=3.122769, lr=0.006784 | ETA 02:20:24 2022-07-25 09:09:16,437 - INFO - [TRAIN] epoch=101/160, iter=11710/18560, loss=3.326756, lr=0.006771 | ETA 02:20:58 2022-07-25 09:09:56,198 - INFO - [TRAIN] epoch=102/160, iter=11720/18560, loss=3.003887, lr=0.006757 | ETA 05:35:46 2022-07-25 09:10:08,587 - INFO - [TRAIN] epoch=102/160, iter=11730/18560, loss=2.864175, lr=0.006744 | ETA 02:20:01 2022-07-25 09:10:21,040 - INFO - [TRAIN] epoch=102/160, iter=11740/18560, loss=2.856811, lr=0.006731 | ETA 02:21:54 2022-07-25 09:10:33,261 - INFO - [TRAIN] epoch=102/160, iter=11750/18560, loss=3.039739, lr=0.006718 | ETA 02:17:20 2022-07-25 09:10:45,740 - INFO - [TRAIN] epoch=102/160, iter=11760/18560, loss=3.211318, lr=0.006704 | ETA 02:22:58 2022-07-25 09:10:58,281 - INFO - [TRAIN] epoch=102/160, iter=11770/18560, loss=3.017108, lr=0.006691 | ETA 02:22:32 2022-07-25 09:11:10,814 - INFO - [TRAIN] epoch=102/160, iter=11780/18560, loss=3.027817, lr=0.006678 | ETA 02:20:59 2022-07-25 09:11:23,161 - INFO - [TRAIN] epoch=102/160, iter=11790/18560, loss=3.280376, lr=0.006665 | ETA 02:16:49 2022-07-25 09:11:35,315 - INFO - [TRAIN] epoch=102/160, iter=11800/18560, loss=3.206937, lr=0.006651 | ETA 02:16:44 2022-07-25 09:11:47,605 - INFO - [TRAIN] epoch=102/160, iter=11810/18560, loss=3.048560, lr=0.006638 | ETA 02:18:41 2022-07-25 09:11:59,754 - INFO - [TRAIN] epoch=102/160, iter=11820/18560, loss=3.073536, lr=0.006625 | ETA 02:16:58 2022-07-25 09:12:11,926 - INFO - [TRAIN] epoch=102/160, iter=11830/18560, loss=3.211591, lr=0.006611 | ETA 02:15:40 2022-07-25 09:12:52,157 - INFO - [TRAIN] epoch=103/160, iter=11840/18560, loss=3.123532, lr=0.006598 | ETA 02:31:50 2022-07-25 09:13:04,649 - INFO - [TRAIN] epoch=103/160, iter=11850/18560, loss=2.835950, lr=0.006585 | ETA 02:18:29 2022-07-25 09:13:16,938 - INFO - [TRAIN] epoch=103/160, iter=11860/18560, loss=3.120294, lr=0.006571 | ETA 02:21:44 2022-07-25 09:13:29,322 - INFO - [TRAIN] epoch=103/160, iter=11870/18560, loss=3.006298, lr=0.006558 | ETA 02:17:36 2022-07-25 09:13:41,717 - INFO - [TRAIN] epoch=103/160, iter=11880/18560, loss=3.093966, lr=0.006544 | ETA 02:16:29 2022-07-25 09:13:54,105 - INFO - [TRAIN] epoch=103/160, iter=11890/18560, loss=2.660739, lr=0.006531 | ETA 02:15:51 2022-07-25 09:14:06,310 - INFO - [TRAIN] epoch=103/160, iter=11900/18560, loss=3.157116, lr=0.006517 | ETA 02:17:00 2022-07-25 09:14:18,563 - INFO - [TRAIN] epoch=103/160, iter=11910/18560, loss=2.988012, lr=0.006504 | ETA 02:16:15 2022-07-25 09:14:30,960 - INFO - [TRAIN] epoch=103/160, iter=11920/18560, loss=2.940656, lr=0.006491 | ETA 02:16:53 2022-07-25 09:14:43,325 - INFO - [TRAIN] epoch=103/160, iter=11930/18560, loss=3.257174, lr=0.006477 | ETA 02:16:03 2022-07-25 09:14:55,835 - INFO - [TRAIN] epoch=103/160, iter=11940/18560, loss=3.192298, lr=0.006464 | ETA 02:19:06 2022-07-25 09:15:35,709 - INFO - [TRAIN] epoch=104/160, iter=11950/18560, loss=2.881410, lr=0.006450 | ETA 15:03:26 2022-07-25 09:15:47,965 - INFO - [TRAIN] epoch=104/160, iter=11960/18560, loss=3.134841, lr=0.006437 | ETA 02:18:12 2022-07-25 09:16:00,185 - INFO - [TRAIN] epoch=104/160, iter=11970/18560, loss=3.108372, lr=0.006423 | ETA 02:13:01 2022-07-25 09:16:12,699 - INFO - [TRAIN] epoch=104/160, iter=11980/18560, loss=2.853607, lr=0.006410 | ETA 02:15:37 2022-07-25 09:16:24,961 - INFO - [TRAIN] epoch=104/160, iter=11990/18560, loss=3.220882, lr=0.006396 | ETA 02:13:05 2022-07-25 09:16:37,152 - INFO - [TRAIN] epoch=104/160, iter=12000/18560, loss=3.095835, lr=0.006382 | ETA 02:12:02 2022-07-25 09:16:49,399 - INFO - [TRAIN] epoch=104/160, iter=12010/18560, loss=2.753338, lr=0.006369 | ETA 02:13:34 2022-07-25 09:17:01,593 - INFO - [TRAIN] epoch=104/160, iter=12020/18560, loss=2.975628, lr=0.006355 | ETA 02:11:26 2022-07-25 09:17:13,828 - INFO - [TRAIN] epoch=104/160, iter=12030/18560, loss=3.219790, lr=0.006342 | ETA 02:14:03 2022-07-25 09:17:26,097 - INFO - [TRAIN] epoch=104/160, iter=12040/18560, loss=2.993219, lr=0.006328 | ETA 02:13:16 2022-07-25 09:17:38,211 - INFO - [TRAIN] epoch=104/160, iter=12050/18560, loss=3.248033, lr=0.006315 | ETA 02:11:56 2022-07-25 09:17:50,440 - INFO - [TRAIN] epoch=104/160, iter=12060/18560, loss=2.927231, lr=0.006301 | ETA 02:12:00 2022-07-25 09:18:31,278 - INFO - [TRAIN] epoch=105/160, iter=12070/18560, loss=3.069689, lr=0.006287 | ETA 03:03:59 2022-07-25 09:18:43,795 - INFO - [TRAIN] epoch=105/160, iter=12080/18560, loss=2.912500, lr=0.006274 | ETA 02:11:53 2022-07-25 09:18:56,153 - INFO - [TRAIN] epoch=105/160, iter=12090/18560, loss=3.016253, lr=0.006260 | ETA 02:12:45 2022-07-25 09:19:08,449 - INFO - [TRAIN] epoch=105/160, iter=12100/18560, loss=3.074001, lr=0.006246 | ETA 02:12:04 2022-07-25 09:19:20,617 - INFO - [TRAIN] epoch=105/160, iter=12110/18560, loss=2.964130, lr=0.006233 | ETA 02:10:01 2022-07-25 09:19:32,749 - INFO - [TRAIN] epoch=105/160, iter=12120/18560, loss=2.788512, lr=0.006219 | ETA 02:09:38 2022-07-25 09:19:45,129 - INFO - [TRAIN] epoch=105/160, iter=12130/18560, loss=2.914119, lr=0.006205 | ETA 02:13:23 2022-07-25 09:19:57,509 - INFO - [TRAIN] epoch=105/160, iter=12140/18560, loss=3.281083, lr=0.006192 | ETA 02:11:06 2022-07-25 09:20:10,054 - INFO - [TRAIN] epoch=105/160, iter=12150/18560, loss=3.117263, lr=0.006178 | ETA 02:14:15 2022-07-25 09:20:22,533 - INFO - [TRAIN] epoch=105/160, iter=12160/18560, loss=3.066859, lr=0.006164 | ETA 02:15:40 2022-07-25 09:20:34,998 - INFO - [TRAIN] epoch=105/160, iter=12170/18560, loss=2.735717, lr=0.006151 | ETA 02:13:00 2022-07-25 09:20:47,525 - INFO - [TRAIN] epoch=105/160, iter=12180/18560, loss=3.213688, lr=0.006137 | ETA 02:12:29 2022-07-25 09:20:47,547 - INFO - Pop model from output_nospa/epoch_80 2022-07-25 09:20:47,769 - INFO - Push model to checkpoint output_nospa/epoch_105 2022-07-25 09:21:30,046 - INFO - [TRAIN] epoch=106/160, iter=12190/18560, loss=3.117693, lr=0.006123 | ETA 02:13:43 2022-07-25 09:21:42,284 - INFO - [TRAIN] epoch=106/160, iter=12200/18560, loss=3.061020, lr=0.006109 | ETA 02:10:45 2022-07-25 09:21:54,563 - INFO - [TRAIN] epoch=106/160, iter=12210/18560, loss=2.698622, lr=0.006096 | ETA 02:08:23 2022-07-25 09:22:06,724 - INFO - [TRAIN] epoch=106/160, iter=12220/18560, loss=3.469048, lr=0.006082 | ETA 02:08:31 2022-07-25 09:22:19,174 - INFO - [TRAIN] epoch=106/160, iter=12230/18560, loss=2.936943, lr=0.006068 | ETA 02:10:08 2022-07-25 09:22:31,349 - INFO - [TRAIN] epoch=106/160, iter=12240/18560, loss=2.782640, lr=0.006054 | ETA 02:08:26 2022-07-25 09:22:43,550 - INFO - [TRAIN] epoch=106/160, iter=12250/18560, loss=3.021471, lr=0.006040 | ETA 02:10:57 2022-07-25 09:22:55,921 - INFO - [TRAIN] epoch=106/160, iter=12260/18560, loss=3.064545, lr=0.006027 | ETA 02:10:53 2022-07-25 09:23:08,209 - INFO - [TRAIN] epoch=106/160, iter=12270/18560, loss=3.222048, lr=0.006013 | ETA 02:08:37 2022-07-25 09:23:20,401 - INFO - [TRAIN] epoch=106/160, iter=12280/18560, loss=3.190164, lr=0.005999 | ETA 02:08:12 2022-07-25 09:23:32,901 - INFO - [TRAIN] epoch=106/160, iter=12290/18560, loss=3.022629, lr=0.005985 | ETA 02:09:32 2022-07-25 09:24:14,403 - INFO - [TRAIN] epoch=107/160, iter=12300/18560, loss=3.011619, lr=0.005971 | ETA 05:20:42 2022-07-25 09:24:26,893 - INFO - [TRAIN] epoch=107/160, iter=12310/18560, loss=2.928687, lr=0.005958 | ETA 02:05:59 2022-07-25 09:24:39,103 - INFO - [TRAIN] epoch=107/160, iter=12320/18560, loss=2.763550, lr=0.005944 | ETA 02:05:19 2022-07-25 09:24:51,236 - INFO - [TRAIN] epoch=107/160, iter=12330/18560, loss=2.851703, lr=0.005930 | ETA 02:05:08 2022-07-25 09:25:03,482 - INFO - [TRAIN] epoch=107/160, iter=12340/18560, loss=3.097929, lr=0.005916 | ETA 02:06:30 2022-07-25 09:25:15,906 - INFO - [TRAIN] epoch=107/160, iter=12350/18560, loss=2.696522, lr=0.005902 | ETA 02:08:49 2022-07-25 09:25:28,196 - INFO - [TRAIN] epoch=107/160, iter=12360/18560, loss=2.797340, lr=0.005888 | ETA 02:04:50 2022-07-25 09:25:40,314 - INFO - [TRAIN] epoch=107/160, iter=12370/18560, loss=2.988560, lr=0.005874 | ETA 02:06:32 2022-07-25 09:25:52,495 - INFO - [TRAIN] epoch=107/160, iter=12380/18560, loss=3.190410, lr=0.005860 | ETA 02:05:26 2022-07-25 09:26:04,798 - INFO - [TRAIN] epoch=107/160, iter=12390/18560, loss=2.876551, lr=0.005847 | ETA 02:06:01 2022-07-25 09:26:16,866 - INFO - [TRAIN] epoch=107/160, iter=12400/18560, loss=2.849775, lr=0.005833 | ETA 02:05:11 2022-07-25 09:26:29,016 - INFO - [TRAIN] epoch=107/160, iter=12410/18560, loss=2.961748, lr=0.005819 | ETA 02:03:21 2022-07-25 09:27:11,932 - INFO - [TRAIN] epoch=108/160, iter=12420/18560, loss=3.193575, lr=0.005805 | ETA 02:19:20 2022-07-25 09:27:24,390 - INFO - [TRAIN] epoch=108/160, iter=12430/18560, loss=3.038968, lr=0.005791 | ETA 02:07:45 2022-07-25 09:27:36,555 - INFO - [TRAIN] epoch=108/160, iter=12440/18560, loss=2.662687, lr=0.005777 | ETA 02:05:06 2022-07-25 09:27:48,877 - INFO - [TRAIN] epoch=108/160, iter=12450/18560, loss=3.030926, lr=0.005763 | ETA 02:04:33 2022-07-25 09:28:01,045 - INFO - [TRAIN] epoch=108/160, iter=12460/18560, loss=2.827563, lr=0.005749 | ETA 02:04:43 2022-07-25 09:28:13,462 - INFO - [TRAIN] epoch=108/160, iter=12470/18560, loss=2.953290, lr=0.005735 | ETA 02:06:04 2022-07-25 09:28:26,000 - INFO - [TRAIN] epoch=108/160, iter=12480/18560, loss=2.776870, lr=0.005721 | ETA 02:08:35 2022-07-25 09:28:38,525 - INFO - [TRAIN] epoch=108/160, iter=12490/18560, loss=3.018478, lr=0.005707 | ETA 02:06:49 2022-07-25 09:28:50,959 - INFO - [TRAIN] epoch=108/160, iter=12500/18560, loss=3.258904, lr=0.005693 | ETA 02:05:31 2022-07-25 09:29:03,444 - INFO - [TRAIN] epoch=108/160, iter=12510/18560, loss=2.855791, lr=0.005679 | ETA 02:05:23 2022-07-25 09:29:15,772 - INFO - [TRAIN] epoch=108/160, iter=12520/18560, loss=3.191514, lr=0.005665 | ETA 02:05:12 2022-07-25 09:29:56,837 - INFO - [TRAIN] epoch=109/160, iter=12530/18560, loss=3.126360, lr=0.005651 | ETA 14:10:32 2022-07-25 09:30:09,478 - INFO - [TRAIN] epoch=109/160, iter=12540/18560, loss=3.066236, lr=0.005637 | ETA 02:08:15 2022-07-25 09:30:21,815 - INFO - [TRAIN] epoch=109/160, iter=12550/18560, loss=2.903769, lr=0.005623 | ETA 02:01:22 2022-07-25 09:30:34,051 - INFO - [TRAIN] epoch=109/160, iter=12560/18560, loss=3.054416, lr=0.005609 | ETA 02:02:52 2022-07-25 09:30:46,377 - INFO - [TRAIN] epoch=109/160, iter=12570/18560, loss=3.022200, lr=0.005595 | ETA 02:02:04 2022-07-25 09:30:58,604 - INFO - [TRAIN] epoch=109/160, iter=12580/18560, loss=3.145201, lr=0.005581 | ETA 02:02:36 2022-07-25 09:31:10,817 - INFO - [TRAIN] epoch=109/160, iter=12590/18560, loss=3.161731, lr=0.005567 | ETA 02:02:21 2022-07-25 09:31:23,023 - INFO - [TRAIN] epoch=109/160, iter=12600/18560, loss=2.982542, lr=0.005553 | ETA 01:59:43 2022-07-25 09:31:35,266 - INFO - [TRAIN] epoch=109/160, iter=12610/18560, loss=3.001334, lr=0.005539 | ETA 02:01:30 2022-07-25 09:31:47,353 - INFO - [TRAIN] epoch=109/160, iter=12620/18560, loss=2.937108, lr=0.005525 | ETA 01:58:23 2022-07-25 09:31:59,594 - INFO - [TRAIN] epoch=109/160, iter=12630/18560, loss=2.926286, lr=0.005511 | ETA 02:00:17 2022-07-25 09:32:11,664 - INFO - [TRAIN] epoch=109/160, iter=12640/18560, loss=2.975343, lr=0.005497 | ETA 01:57:15 2022-07-25 09:32:53,217 - INFO - [TRAIN] epoch=110/160, iter=12650/18560, loss=3.171241, lr=0.005483 | ETA 02:50:04 2022-07-25 09:33:05,973 - INFO - [TRAIN] epoch=110/160, iter=12660/18560, loss=2.670820, lr=0.005469 | ETA 02:06:36 2022-07-25 09:33:18,519 - INFO - [TRAIN] epoch=110/160, iter=12670/18560, loss=2.942844, lr=0.005455 | ETA 02:01:30 2022-07-25 09:33:30,781 - INFO - [TRAIN] epoch=110/160, iter=12680/18560, loss=2.959691, lr=0.005441 | ETA 01:59:42 2022-07-25 09:33:43,034 - INFO - [TRAIN] epoch=110/160, iter=12690/18560, loss=2.960634, lr=0.005427 | ETA 02:00:13 2022-07-25 09:33:55,233 - INFO - [TRAIN] epoch=110/160, iter=12700/18560, loss=2.997308, lr=0.005413 | ETA 01:59:36 2022-07-25 09:34:07,472 - INFO - [TRAIN] epoch=110/160, iter=12710/18560, loss=2.954322, lr=0.005399 | ETA 01:58:32 2022-07-25 09:34:19,626 - INFO - [TRAIN] epoch=110/160, iter=12720/18560, loss=3.025868, lr=0.005385 | ETA 01:58:02 2022-07-25 09:34:31,908 - INFO - [TRAIN] epoch=110/160, iter=12730/18560, loss=3.061494, lr=0.005371 | ETA 01:58:21 2022-07-25 09:34:44,124 - INFO - [TRAIN] epoch=110/160, iter=12740/18560, loss=2.938634, lr=0.005357 | ETA 01:57:26 2022-07-25 09:34:56,332 - INFO - [TRAIN] epoch=110/160, iter=12750/18560, loss=2.838875, lr=0.005343 | ETA 01:56:48 2022-07-25 09:35:08,549 - INFO - [TRAIN] epoch=110/160, iter=12760/18560, loss=2.728631, lr=0.005328 | ETA 01:57:45 2022-07-25 09:35:08,568 - INFO - Pop model from output_nospa/epoch_85 2022-07-25 09:35:08,722 - INFO - Push model to checkpoint output_nospa/epoch_110 2022-07-25 09:35:51,194 - INFO - [TRAIN] epoch=111/160, iter=12770/18560, loss=3.058146, lr=0.005314 | ETA 02:04:44 2022-07-25 09:36:03,833 - INFO - [TRAIN] epoch=111/160, iter=12780/18560, loss=2.861742, lr=0.005300 | ETA 02:01:48 2022-07-25 09:36:16,312 - INFO - [TRAIN] epoch=111/160, iter=12790/18560, loss=2.926597, lr=0.005286 | ETA 01:57:15 2022-07-25 09:36:28,622 - INFO - [TRAIN] epoch=111/160, iter=12800/18560, loss=3.272033, lr=0.005272 | ETA 02:00:20 2022-07-25 09:36:41,027 - INFO - [TRAIN] epoch=111/160, iter=12810/18560, loss=3.018439, lr=0.005258 | ETA 01:59:19 2022-07-25 09:36:53,564 - INFO - [TRAIN] epoch=111/160, iter=12820/18560, loss=3.121690, lr=0.005244 | ETA 01:57:48 2022-07-25 09:37:05,965 - INFO - [TRAIN] epoch=111/160, iter=12830/18560, loss=2.804519, lr=0.005230 | ETA 01:58:44 2022-07-25 09:37:18,071 - INFO - [TRAIN] epoch=111/160, iter=12840/18560, loss=3.016219, lr=0.005216 | ETA 01:54:49 2022-07-25 09:37:30,261 - INFO - [TRAIN] epoch=111/160, iter=12850/18560, loss=2.835923, lr=0.005202 | ETA 01:54:32 2022-07-25 09:37:42,492 - INFO - [TRAIN] epoch=111/160, iter=12860/18560, loss=2.949499, lr=0.005188 | ETA 01:58:21 2022-07-25 09:37:54,687 - INFO - [TRAIN] epoch=111/160, iter=12870/18560, loss=3.007155, lr=0.005174 | ETA 01:55:55 2022-07-25 09:38:37,154 - INFO - [TRAIN] epoch=112/160, iter=12880/18560, loss=3.026572, lr=0.005159 | ETA 04:57:12 2022-07-25 09:38:49,579 - INFO - [TRAIN] epoch=112/160, iter=12890/18560, loss=2.867600, lr=0.005145 | ETA 01:55:55 2022-07-25 09:39:01,905 - INFO - [TRAIN] epoch=112/160, iter=12900/18560, loss=2.964299, lr=0.005131 | ETA 01:58:04 2022-07-25 09:39:14,341 - INFO - [TRAIN] epoch=112/160, iter=12910/18560, loss=2.858211, lr=0.005117 | ETA 01:57:33 2022-07-25 09:39:26,586 - INFO - [TRAIN] epoch=112/160, iter=12920/18560, loss=2.879121, lr=0.005103 | ETA 01:55:27 2022-07-25 09:39:38,830 - INFO - [TRAIN] epoch=112/160, iter=12930/18560, loss=2.892757, lr=0.005089 | ETA 01:53:50 2022-07-25 09:39:51,135 - INFO - [TRAIN] epoch=112/160, iter=12940/18560, loss=3.358923, lr=0.005075 | ETA 01:57:04 2022-07-25 09:40:03,539 - INFO - [TRAIN] epoch=112/160, iter=12950/18560, loss=2.913265, lr=0.005061 | ETA 01:55:41 2022-07-25 09:40:16,001 - INFO - [TRAIN] epoch=112/160, iter=12960/18560, loss=3.051701, lr=0.005047 | ETA 01:54:58 2022-07-25 09:40:28,239 - INFO - [TRAIN] epoch=112/160, iter=12970/18560, loss=3.023029, lr=0.005032 | ETA 01:54:30 2022-07-25 09:40:40,422 - INFO - [TRAIN] epoch=112/160, iter=12980/18560, loss=2.744194, lr=0.005018 | ETA 01:53:26 2022-07-25 09:40:52,825 - INFO - [TRAIN] epoch=112/160, iter=12990/18560, loss=2.928500, lr=0.005004 | ETA 01:53:38 2022-07-25 09:41:33,892 - INFO - [TRAIN] epoch=113/160, iter=13000/18560, loss=2.998898, lr=0.004990 | ETA 02:06:00 2022-07-25 09:41:46,365 - INFO - [TRAIN] epoch=113/160, iter=13010/18560, loss=2.802120, lr=0.004976 | ETA 01:55:04 2022-07-25 09:41:58,733 - INFO - [TRAIN] epoch=113/160, iter=13020/18560, loss=2.829621, lr=0.004962 | ETA 01:55:46 2022-07-25 09:42:11,070 - INFO - [TRAIN] epoch=113/160, iter=13030/18560, loss=2.756041, lr=0.004948 | ETA 01:52:48 2022-07-25 09:42:23,363 - INFO - [TRAIN] epoch=113/160, iter=13040/18560, loss=2.821954, lr=0.004934 | ETA 01:53:30 2022-07-25 09:42:35,569 - INFO - [TRAIN] epoch=113/160, iter=13050/18560, loss=2.636221, lr=0.004920 | ETA 01:51:19 2022-07-25 09:42:47,943 - INFO - [TRAIN] epoch=113/160, iter=13060/18560, loss=2.954033, lr=0.004906 | ETA 01:52:58 2022-07-25 09:43:00,324 - INFO - [TRAIN] epoch=113/160, iter=13070/18560, loss=3.069132, lr=0.004891 | ETA 01:53:31 2022-07-25 09:43:12,807 - INFO - [TRAIN] epoch=113/160, iter=13080/18560, loss=3.144830, lr=0.004877 | ETA 01:54:32 2022-07-25 09:43:25,236 - INFO - [TRAIN] epoch=113/160, iter=13090/18560, loss=2.918121, lr=0.004863 | ETA 01:52:26 2022-07-25 09:43:37,713 - INFO - [TRAIN] epoch=113/160, iter=13100/18560, loss=2.711970, lr=0.004849 | ETA 01:53:21 2022-07-25 09:44:18,460 - INFO - [TRAIN] epoch=114/160, iter=13110/18560, loss=2.845116, lr=0.004835 | ETA 12:35:30 2022-07-25 09:44:31,074 - INFO - [TRAIN] epoch=114/160, iter=13120/18560, loss=2.777922, lr=0.004821 | ETA 01:54:42 2022-07-25 09:44:43,519 - INFO - [TRAIN] epoch=114/160, iter=13130/18560, loss=2.789223, lr=0.004807 | ETA 01:51:31 2022-07-25 09:44:55,773 - INFO - [TRAIN] epoch=114/160, iter=13140/18560, loss=2.912534, lr=0.004793 | ETA 01:50:41 2022-07-25 09:45:08,061 - INFO - [TRAIN] epoch=114/160, iter=13150/18560, loss=2.871362, lr=0.004779 | ETA 01:52:27 2022-07-25 09:45:20,165 - INFO - [TRAIN] epoch=114/160, iter=13160/18560, loss=2.937797, lr=0.004765 | ETA 01:49:49 2022-07-25 09:45:32,361 - INFO - [TRAIN] epoch=114/160, iter=13170/18560, loss=2.795305, lr=0.004750 | ETA 01:49:45 2022-07-25 09:45:44,615 - INFO - [TRAIN] epoch=114/160, iter=13180/18560, loss=2.846329, lr=0.004736 | ETA 01:50:18 2022-07-25 09:45:57,009 - INFO - [TRAIN] epoch=114/160, iter=13190/18560, loss=3.098463, lr=0.004722 | ETA 01:50:28 2022-07-25 09:46:09,420 - INFO - [TRAIN] epoch=114/160, iter=13200/18560, loss=2.817074, lr=0.004708 | ETA 01:51:13 2022-07-25 09:46:21,925 - INFO - [TRAIN] epoch=114/160, iter=13210/18560, loss=3.113898, lr=0.004694 | ETA 01:51:31 2022-07-25 09:46:34,419 - INFO - [TRAIN] epoch=114/160, iter=13220/18560, loss=2.752725, lr=0.004680 | ETA 01:51:14 2022-07-25 09:47:15,279 - INFO - [TRAIN] epoch=115/160, iter=13230/18560, loss=3.423265, lr=0.004666 | ETA 02:31:04 2022-07-25 09:47:27,835 - INFO - [TRAIN] epoch=115/160, iter=13240/18560, loss=2.967497, lr=0.004652 | ETA 01:51:56 2022-07-25 09:47:40,401 - INFO - [TRAIN] epoch=115/160, iter=13250/18560, loss=2.787879, lr=0.004638 | ETA 01:51:25 2022-07-25 09:47:53,010 - INFO - [TRAIN] epoch=115/160, iter=13260/18560, loss=3.012979, lr=0.004624 | ETA 01:48:32 2022-07-25 09:48:05,234 - INFO - [TRAIN] epoch=115/160, iter=13270/18560, loss=2.833176, lr=0.004610 | ETA 01:46:16 2022-07-25 09:48:17,350 - INFO - [TRAIN] epoch=115/160, iter=13280/18560, loss=2.768294, lr=0.004596 | ETA 01:46:22 2022-07-25 09:48:29,519 - INFO - [TRAIN] epoch=115/160, iter=13290/18560, loss=2.732734, lr=0.004582 | ETA 01:46:04 2022-07-25 09:48:41,881 - INFO - [TRAIN] epoch=115/160, iter=13300/18560, loss=3.052545, lr=0.004568 | ETA 01:50:00 2022-07-25 09:48:54,392 - INFO - [TRAIN] epoch=115/160, iter=13310/18560, loss=3.076243, lr=0.004554 | ETA 01:49:51 2022-07-25 09:49:06,682 - INFO - [TRAIN] epoch=115/160, iter=13320/18560, loss=2.771562, lr=0.004539 | ETA 01:45:58 2022-07-25 09:49:19,112 - INFO - [TRAIN] epoch=115/160, iter=13330/18560, loss=3.028786, lr=0.004525 | ETA 01:51:55 2022-07-25 09:49:31,316 - INFO - [TRAIN] epoch=115/160, iter=13340/18560, loss=3.000511, lr=0.004511 | ETA 01:46:10 2022-07-25 09:49:31,335 - INFO - Pop model from output_nospa/epoch_90 2022-07-25 09:49:31,516 - INFO - Push model to checkpoint output_nospa/epoch_115 2022-07-25 09:50:14,102 - INFO - [TRAIN] epoch=116/160, iter=13350/18560, loss=3.228206, lr=0.004497 | ETA 02:02:08 2022-07-25 09:50:27,780 - INFO - [TRAIN] epoch=116/160, iter=13360/18560, loss=2.591483, lr=0.004483 | ETA 02:01:26 2022-07-25 09:50:41,533 - INFO - [TRAIN] epoch=116/160, iter=13370/18560, loss=2.765433, lr=0.004469 | ETA 01:58:17 2022-07-25 09:50:55,920 - INFO - [TRAIN] epoch=116/160, iter=13380/18560, loss=2.708510, lr=0.004455 | ETA 02:12:00 2022-07-25 09:51:09,821 - INFO - [TRAIN] epoch=116/160, iter=13390/18560, loss=2.800878, lr=0.004441 | ETA 01:58:52 2022-07-25 09:51:23,785 - INFO - [TRAIN] epoch=116/160, iter=13400/18560, loss=2.617322, lr=0.004427 | ETA 02:00:32 2022-07-25 09:51:37,741 - INFO - [TRAIN] epoch=116/160, iter=13410/18560, loss=2.806715, lr=0.004413 | ETA 02:03:19 2022-07-25 09:51:51,792 - INFO - [TRAIN] epoch=116/160, iter=13420/18560, loss=3.055010, lr=0.004399 | ETA 02:02:39 2022-07-25 09:52:05,629 - INFO - [TRAIN] epoch=116/160, iter=13430/18560, loss=2.878325, lr=0.004385 | ETA 01:57:50 2022-07-25 09:52:19,430 - INFO - [TRAIN] epoch=116/160, iter=13440/18560, loss=2.704388, lr=0.004371 | ETA 01:59:12 2022-07-25 09:52:33,412 - INFO - [TRAIN] epoch=116/160, iter=13450/18560, loss=2.838555, lr=0.004357 | ETA 01:57:37 2022-07-25 09:53:23,516 - INFO - [TRAIN] epoch=117/160, iter=13460/18560, loss=2.884025, lr=0.004343 | ETA 05:10:37 2022-07-25 09:53:37,791 - INFO - [TRAIN] epoch=117/160, iter=13470/18560, loss=2.841584, lr=0.004329 | ETA 01:58:27 2022-07-25 09:53:51,902 - INFO - [TRAIN] epoch=117/160, iter=13480/18560, loss=2.625488, lr=0.004315 | ETA 01:58:28 2022-07-25 09:54:05,916 - INFO - [TRAIN] epoch=117/160, iter=13490/18560, loss=2.651875, lr=0.004301 | ETA 01:55:22 2022-07-25 09:54:19,733 - INFO - [TRAIN] epoch=117/160, iter=13500/18560, loss=2.900291, lr=0.004287 | ETA 01:56:45 2022-07-25 09:54:33,621 - INFO - [TRAIN] epoch=117/160, iter=13510/18560, loss=2.614683, lr=0.004273 | ETA 01:58:23 2022-07-25 09:54:47,296 - INFO - [TRAIN] epoch=117/160, iter=13520/18560, loss=2.696587, lr=0.004259 | ETA 01:54:58 2022-07-25 09:55:01,267 - INFO - [TRAIN] epoch=117/160, iter=13530/18560, loss=3.048158, lr=0.004245 | ETA 01:57:55 2022-07-25 09:55:15,285 - INFO - [TRAIN] epoch=117/160, iter=13540/18560, loss=3.103744, lr=0.004232 | ETA 01:58:10 2022-07-25 09:55:29,169 - INFO - [TRAIN] epoch=117/160, iter=13550/18560, loss=2.753805, lr=0.004218 | ETA 01:54:10 2022-07-25 09:55:42,976 - INFO - [TRAIN] epoch=117/160, iter=13560/18560, loss=2.764100, lr=0.004204 | ETA 01:54:29 2022-07-25 09:55:56,670 - INFO - [TRAIN] epoch=117/160, iter=13570/18560, loss=2.833183, lr=0.004190 | ETA 01:54:20 2022-07-25 09:56:45,126 - INFO - [TRAIN] epoch=118/160, iter=13580/18560, loss=3.083548, lr=0.004176 | ETA 02:08:21 2022-07-25 09:56:58,970 - INFO - [TRAIN] epoch=118/160, iter=13590/18560, loss=2.734642, lr=0.004162 | ETA 01:53:34 2022-07-25 09:57:12,726 - INFO - [TRAIN] epoch=118/160, iter=13600/18560, loss=2.721396, lr=0.004148 | ETA 01:52:57 2022-07-25 09:57:26,619 - INFO - [TRAIN] epoch=118/160, iter=13610/18560, loss=2.684975, lr=0.004134 | ETA 01:54:58 2022-07-25 09:57:40,450 - INFO - [TRAIN] epoch=118/160, iter=13620/18560, loss=2.974143, lr=0.004120 | ETA 01:55:10 2022-07-25 09:57:54,363 - INFO - [TRAIN] epoch=118/160, iter=13630/18560, loss=2.936585, lr=0.004106 | ETA 01:54:02 2022-07-25 09:58:08,314 - INFO - [TRAIN] epoch=118/160, iter=13640/18560, loss=2.906258, lr=0.004092 | ETA 01:54:06 2022-07-25 09:58:22,134 - INFO - [TRAIN] epoch=118/160, iter=13650/18560, loss=3.066869, lr=0.004079 | ETA 01:53:41 2022-07-25 09:58:35,996 - INFO - [TRAIN] epoch=118/160, iter=13660/18560, loss=2.937614, lr=0.004065 | ETA 01:54:04 2022-07-25 09:58:49,908 - INFO - [TRAIN] epoch=118/160, iter=13670/18560, loss=2.779646, lr=0.004051 | ETA 01:53:43 2022-07-25 09:59:03,631 - INFO - [TRAIN] epoch=118/160, iter=13680/18560, loss=2.940316, lr=0.004037 | ETA 01:50:55 2022-07-25 09:59:50,390 - INFO - [TRAIN] epoch=119/160, iter=13690/18560, loss=2.777801, lr=0.004023 | ETA 13:05:05 2022-07-25 10:00:04,511 - INFO - [TRAIN] epoch=119/160, iter=13700/18560, loss=2.712643, lr=0.004009 | ETA 01:56:57 2022-07-25 10:00:18,933 - INFO - [TRAIN] epoch=119/160, iter=13710/18560, loss=2.489337, lr=0.003996 | ETA 01:57:06 2022-07-25 10:00:32,960 - INFO - [TRAIN] epoch=119/160, iter=13720/18560, loss=2.893882, lr=0.003982 | ETA 01:51:46 2022-07-25 10:00:46,946 - INFO - [TRAIN] epoch=119/160, iter=13730/18560, loss=2.893421, lr=0.003968 | ETA 01:53:16 2022-07-25 10:01:00,732 - INFO - [TRAIN] epoch=119/160, iter=13740/18560, loss=2.778087, lr=0.003954 | ETA 01:51:23 2022-07-25 10:01:14,601 - INFO - [TRAIN] epoch=119/160, iter=13750/18560, loss=2.704439, lr=0.003940 | ETA 01:51:06 2022-07-25 10:01:28,564 - INFO - [TRAIN] epoch=119/160, iter=13760/18560, loss=2.867511, lr=0.003927 | ETA 01:51:40 2022-07-25 10:01:42,473 - INFO - [TRAIN] epoch=119/160, iter=13770/18560, loss=2.976455, lr=0.003913 | ETA 01:51:42 2022-07-25 10:01:56,522 - INFO - [TRAIN] epoch=119/160, iter=13780/18560, loss=2.775675, lr=0.003899 | ETA 01:50:44 2022-07-25 10:02:10,276 - INFO - [TRAIN] epoch=119/160, iter=13790/18560, loss=2.691551, lr=0.003885 | ETA 01:49:54 2022-07-25 10:02:24,098 - INFO - [TRAIN] epoch=119/160, iter=13800/18560, loss=2.661012, lr=0.003872 | ETA 01:50:05 2022-07-25 10:03:11,986 - INFO - [TRAIN] epoch=120/160, iter=13810/18560, loss=2.769362, lr=0.003858 | ETA 02:33:43 2022-07-25 10:03:26,061 - INFO - [TRAIN] epoch=120/160, iter=13820/18560, loss=2.681166, lr=0.003844 | ETA 01:50:48 2022-07-25 10:03:40,146 - INFO - [TRAIN] epoch=120/160, iter=13830/18560, loss=2.930105, lr=0.003830 | ETA 01:51:51 2022-07-25 10:03:54,073 - INFO - [TRAIN] epoch=120/160, iter=13840/18560, loss=2.733546, lr=0.003817 | ETA 01:50:59 2022-07-25 10:04:08,109 - INFO - [TRAIN] epoch=120/160, iter=13850/18560, loss=3.017451, lr=0.003803 | ETA 01:47:40 2022-07-25 10:04:21,886 - INFO - [TRAIN] epoch=120/160, iter=13860/18560, loss=2.806630, lr=0.003789 | ETA 01:46:44 2022-07-25 10:04:35,753 - INFO - [TRAIN] epoch=120/160, iter=13870/18560, loss=2.722983, lr=0.003776 | ETA 01:47:47 2022-07-25 10:04:49,734 - INFO - [TRAIN] epoch=120/160, iter=13880/18560, loss=2.843115, lr=0.003762 | ETA 01:48:54 2022-07-25 10:05:03,802 - INFO - [TRAIN] epoch=120/160, iter=13890/18560, loss=3.033002, lr=0.003748 | ETA 01:51:44 2022-07-25 10:05:17,677 - INFO - [TRAIN] epoch=120/160, iter=13900/18560, loss=2.810964, lr=0.003735 | ETA 01:48:41 2022-07-25 10:05:31,404 - INFO - [TRAIN] epoch=120/160, iter=13910/18560, loss=2.746339, lr=0.003721 | ETA 01:47:01 2022-07-25 10:05:45,125 - INFO - [TRAIN] epoch=120/160, iter=13920/18560, loss=2.981041, lr=0.003707 | ETA 01:45:39 2022-07-25 10:05:45,321 - INFO - Pop model from output_nospa/epoch_95 2022-07-25 10:05:45,486 - INFO - Push model to checkpoint output_nospa/epoch_120 2022-07-25 10:06:36,122 - INFO - [TRAIN] epoch=121/160, iter=13930/18560, loss=2.810544, lr=0.003694 | ETA 01:50:41 2022-07-25 10:06:50,123 - INFO - [TRAIN] epoch=121/160, iter=13940/18560, loss=2.599391, lr=0.003680 | ETA 01:46:39 2022-07-25 10:07:04,088 - INFO - [TRAIN] epoch=121/160, iter=13950/18560, loss=2.607915, lr=0.003666 | ETA 01:45:54 2022-07-25 10:07:17,817 - INFO - [TRAIN] epoch=121/160, iter=13960/18560, loss=2.932354, lr=0.003653 | ETA 01:46:36 2022-07-25 10:07:31,572 - INFO - [TRAIN] epoch=121/160, iter=13970/18560, loss=2.694795, lr=0.003639 | ETA 01:43:43 2022-07-25 10:07:45,075 - INFO - [TRAIN] epoch=121/160, iter=13980/18560, loss=2.509564, lr=0.003626 | ETA 01:42:59 2022-07-25 10:07:58,868 - INFO - [TRAIN] epoch=121/160, iter=13990/18560, loss=2.629709, lr=0.003612 | ETA 01:46:02 2022-07-25 10:08:12,544 - INFO - [TRAIN] epoch=121/160, iter=14000/18560, loss=2.817722, lr=0.003599 | ETA 01:45:33 2022-07-25 10:08:26,424 - INFO - [TRAIN] epoch=121/160, iter=14010/18560, loss=2.795926, lr=0.003585 | ETA 01:47:06 2022-07-25 10:08:40,267 - INFO - [TRAIN] epoch=121/160, iter=14020/18560, loss=2.835901, lr=0.003572 | ETA 01:43:40 2022-07-25 10:08:54,205 - INFO - [TRAIN] epoch=121/160, iter=14030/18560, loss=2.785043, lr=0.003558 | ETA 01:47:45 2022-07-25 10:09:42,730 - INFO - [TRAIN] epoch=122/160, iter=14040/18560, loss=2.871156, lr=0.003545 | ETA 04:19:18 2022-07-25 10:09:55,348 - INFO - [TRAIN] epoch=122/160, iter=14050/18560, loss=2.682373, lr=0.003531 | ETA 01:35:41 2022-07-25 10:10:07,659 - INFO - [TRAIN] epoch=122/160, iter=14060/18560, loss=2.725378, lr=0.003518 | ETA 01:32:33 2022-07-25 10:10:20,308 - INFO - [TRAIN] epoch=122/160, iter=14070/18560, loss=2.628823, lr=0.003504 | ETA 01:36:29 2022-07-25 10:10:32,800 - INFO - [TRAIN] epoch=122/160, iter=14080/18560, loss=2.983528, lr=0.003491 | ETA 01:32:07 2022-07-25 10:10:45,114 - INFO - [TRAIN] epoch=122/160, iter=14090/18560, loss=2.669528, lr=0.003477 | ETA 01:31:04 2022-07-25 10:10:57,408 - INFO - [TRAIN] epoch=122/160, iter=14100/18560, loss=2.685547, lr=0.003464 | ETA 01:32:13 2022-07-25 10:11:09,880 - INFO - [TRAIN] epoch=122/160, iter=14110/18560, loss=2.899215, lr=0.003450 | ETA 01:32:04 2022-07-25 10:11:22,319 - INFO - [TRAIN] epoch=122/160, iter=14120/18560, loss=2.784200, lr=0.003437 | ETA 01:32:11 2022-07-25 10:11:34,738 - INFO - [TRAIN] epoch=122/160, iter=14130/18560, loss=2.726007, lr=0.003424 | ETA 01:30:45 2022-07-25 10:11:47,219 - INFO - [TRAIN] epoch=122/160, iter=14140/18560, loss=2.791980, lr=0.003410 | ETA 01:31:26 2022-07-25 10:11:59,720 - INFO - [TRAIN] epoch=122/160, iter=14150/18560, loss=2.990901, lr=0.003397 | ETA 01:32:05 2022-07-25 10:12:42,145 - INFO - [TRAIN] epoch=123/160, iter=14160/18560, loss=2.840964, lr=0.003384 | ETA 01:38:39 2022-07-25 10:12:54,609 - INFO - [TRAIN] epoch=123/160, iter=14170/18560, loss=2.701292, lr=0.003370 | ETA 01:29:33 2022-07-25 10:13:06,905 - INFO - [TRAIN] epoch=123/160, iter=14180/18560, loss=2.608938, lr=0.003357 | ETA 01:27:50 2022-07-25 10:13:19,223 - INFO - [TRAIN] epoch=123/160, iter=14190/18560, loss=2.910695, lr=0.003344 | ETA 01:29:00 2022-07-25 10:13:31,529 - INFO - [TRAIN] epoch=123/160, iter=14200/18560, loss=2.743278, lr=0.003330 | ETA 01:31:13 2022-07-25 10:13:43,892 - INFO - [TRAIN] epoch=123/160, iter=14210/18560, loss=2.564655, lr=0.003317 | ETA 01:28:13 2022-07-25 10:13:56,261 - INFO - [TRAIN] epoch=123/160, iter=14220/18560, loss=2.629269, lr=0.003304 | ETA 01:29:31 2022-07-25 10:14:08,570 - INFO - [TRAIN] epoch=123/160, iter=14230/18560, loss=2.787069, lr=0.003290 | ETA 01:29:54 2022-07-25 10:14:20,918 - INFO - [TRAIN] epoch=123/160, iter=14240/18560, loss=2.806916, lr=0.003277 | ETA 01:27:37 2022-07-25 10:14:33,177 - INFO - [TRAIN] epoch=123/160, iter=14250/18560, loss=2.768360, lr=0.003264 | ETA 01:28:23 2022-07-25 10:14:45,452 - INFO - [TRAIN] epoch=123/160, iter=14260/18560, loss=2.812455, lr=0.003251 | ETA 01:27:40 2022-07-25 10:15:25,867 - INFO - [TRAIN] epoch=124/160, iter=14270/18560, loss=2.567048, lr=0.003237 | ETA 09:53:18 2022-07-25 10:15:38,443 - INFO - [TRAIN] epoch=124/160, iter=14280/18560, loss=2.740814, lr=0.003224 | ETA 01:30:13 2022-07-25 10:15:50,938 - INFO - [TRAIN] epoch=124/160, iter=14290/18560, loss=2.779139, lr=0.003211 | ETA 01:27:46 2022-07-25 10:16:03,200 - INFO - [TRAIN] epoch=124/160, iter=14300/18560, loss=2.624823, lr=0.003198 | ETA 01:26:15 2022-07-25 10:16:15,419 - INFO - [TRAIN] epoch=124/160, iter=14310/18560, loss=2.688299, lr=0.003185 | ETA 01:27:20 2022-07-25 10:16:27,781 - INFO - [TRAIN] epoch=124/160, iter=14320/18560, loss=2.802276, lr=0.003172 | ETA 01:26:21 2022-07-25 10:16:40,165 - INFO - [TRAIN] epoch=124/160, iter=14330/18560, loss=2.731264, lr=0.003159 | ETA 01:28:14 2022-07-25 10:16:52,676 - INFO - [TRAIN] epoch=124/160, iter=14340/18560, loss=2.748791, lr=0.003145 | ETA 01:28:32 2022-07-25 10:17:05,111 - INFO - [TRAIN] epoch=124/160, iter=14350/18560, loss=2.832307, lr=0.003132 | ETA 01:26:49 2022-07-25 10:17:17,386 - INFO - [TRAIN] epoch=124/160, iter=14360/18560, loss=2.740029, lr=0.003119 | ETA 01:25:31 2022-07-25 10:17:29,607 - INFO - [TRAIN] epoch=124/160, iter=14370/18560, loss=2.773217, lr=0.003106 | ETA 01:24:34 2022-07-25 10:17:41,854 - INFO - [TRAIN] epoch=124/160, iter=14380/18560, loss=2.637900, lr=0.003093 | ETA 01:26:14 2022-07-25 10:18:22,207 - INFO - [TRAIN] epoch=125/160, iter=14390/18560, loss=2.712875, lr=0.003080 | ETA 01:55:38 2022-07-25 10:18:34,655 - INFO - [TRAIN] epoch=125/160, iter=14400/18560, loss=2.577811, lr=0.003067 | ETA 01:26:14 2022-07-25 10:18:46,928 - INFO - [TRAIN] epoch=125/160, iter=14410/18560, loss=2.331335, lr=0.003054 | ETA 01:23:42 2022-07-25 10:18:59,179 - INFO - [TRAIN] epoch=125/160, iter=14420/18560, loss=2.713170, lr=0.003041 | ETA 01:23:59 2022-07-25 10:19:11,417 - INFO - [TRAIN] epoch=125/160, iter=14430/18560, loss=2.688934, lr=0.003028 | ETA 01:24:43 2022-07-25 10:19:23,615 - INFO - [TRAIN] epoch=125/160, iter=14440/18560, loss=2.494971, lr=0.003015 | ETA 01:24:14 2022-07-25 10:19:35,999 - INFO - [TRAIN] epoch=125/160, iter=14450/18560, loss=2.636141, lr=0.003002 | ETA 01:25:51 2022-07-25 10:19:48,220 - INFO - [TRAIN] epoch=125/160, iter=14460/18560, loss=3.084037, lr=0.002989 | ETA 01:24:30 2022-07-25 10:20:00,407 - INFO - [TRAIN] epoch=125/160, iter=14470/18560, loss=2.738261, lr=0.002976 | ETA 01:23:46 2022-07-25 10:20:12,596 - INFO - [TRAIN] epoch=125/160, iter=14480/18560, loss=2.642657, lr=0.002964 | ETA 01:22:16 2022-07-25 10:20:24,868 - INFO - [TRAIN] epoch=125/160, iter=14490/18560, loss=2.625626, lr=0.002951 | ETA 01:22:21 2022-07-25 10:20:37,175 - INFO - [TRAIN] epoch=125/160, iter=14500/18560, loss=2.611662, lr=0.002938 | ETA 01:23:01 2022-07-25 10:20:37,196 - INFO - Pop model from output_nospa/epoch_100 2022-07-25 10:20:37,376 - INFO - Push model to checkpoint output_nospa/epoch_125 2022-07-25 10:21:20,118 - INFO - [TRAIN] epoch=126/160, iter=14510/18560, loss=2.581295, lr=0.002925 | ETA 01:28:04 2022-07-25 10:21:32,445 - INFO - [TRAIN] epoch=126/160, iter=14520/18560, loss=2.442993, lr=0.002912 | ETA 01:23:38 2022-07-25 10:21:44,818 - INFO - [TRAIN] epoch=126/160, iter=14530/18560, loss=2.542392, lr=0.002899 | ETA 01:23:38 2022-07-25 10:21:57,328 - INFO - [TRAIN] epoch=126/160, iter=14540/18560, loss=2.763026, lr=0.002887 | ETA 01:24:27 2022-07-25 10:22:09,666 - INFO - [TRAIN] epoch=126/160, iter=14550/18560, loss=2.688950, lr=0.002874 | ETA 01:22:19 2022-07-25 10:22:22,301 - INFO - [TRAIN] epoch=126/160, iter=14560/18560, loss=2.757383, lr=0.002861 | ETA 01:23:26 2022-07-25 10:22:34,810 - INFO - [TRAIN] epoch=126/160, iter=14570/18560, loss=2.533727, lr=0.002848 | ETA 01:22:29 2022-07-25 10:22:47,224 - INFO - [TRAIN] epoch=126/160, iter=14580/18560, loss=2.862745, lr=0.002836 | ETA 01:20:48 2022-07-25 10:22:59,560 - INFO - [TRAIN] epoch=126/160, iter=14590/18560, loss=2.793446, lr=0.002823 | ETA 01:21:14 2022-07-25 10:23:11,740 - INFO - [TRAIN] epoch=126/160, iter=14600/18560, loss=2.789876, lr=0.002810 | ETA 01:20:48 2022-07-25 10:23:23,862 - INFO - [TRAIN] epoch=126/160, iter=14610/18560, loss=2.663654, lr=0.002797 | ETA 01:19:20 2022-07-25 10:24:06,186 - INFO - [TRAIN] epoch=127/160, iter=14620/18560, loss=2.597182, lr=0.002785 | ETA 03:24:02 2022-07-25 10:24:18,885 - INFO - [TRAIN] epoch=127/160, iter=14630/18560, loss=2.557169, lr=0.002772 | ETA 01:22:28 2022-07-25 10:24:31,467 - INFO - [TRAIN] epoch=127/160, iter=14640/18560, loss=2.597732, lr=0.002760 | ETA 01:22:27 2022-07-25 10:24:43,841 - INFO - [TRAIN] epoch=127/160, iter=14650/18560, loss=2.570012, lr=0.002747 | ETA 01:19:54 2022-07-25 10:24:56,280 - INFO - [TRAIN] epoch=127/160, iter=14660/18560, loss=2.672926, lr=0.002734 | ETA 01:20:51 2022-07-25 10:25:08,753 - INFO - [TRAIN] epoch=127/160, iter=14670/18560, loss=2.557828, lr=0.002722 | ETA 01:21:07 2022-07-25 10:25:21,287 - INFO - [TRAIN] epoch=127/160, iter=14680/18560, loss=2.516216, lr=0.002709 | ETA 01:19:57 2022-07-25 10:25:33,577 - INFO - [TRAIN] epoch=127/160, iter=14690/18560, loss=2.796390, lr=0.002697 | ETA 01:18:18 2022-07-25 10:25:45,780 - INFO - [TRAIN] epoch=127/160, iter=14700/18560, loss=2.896330, lr=0.002684 | ETA 01:18:39 2022-07-25 10:25:58,024 - INFO - [TRAIN] epoch=127/160, iter=14710/18560, loss=2.795279, lr=0.002672 | ETA 01:18:18 2022-07-25 10:26:10,274 - INFO - [TRAIN] epoch=127/160, iter=14720/18560, loss=2.772396, lr=0.002659 | ETA 01:18:14 2022-07-25 10:26:22,532 - INFO - [TRAIN] epoch=127/160, iter=14730/18560, loss=2.651410, lr=0.002647 | ETA 01:18:54 2022-07-25 10:27:04,155 - INFO - [TRAIN] epoch=128/160, iter=14740/18560, loss=2.671000, lr=0.002634 | ETA 01:28:25 2022-07-25 10:27:16,544 - INFO - [TRAIN] epoch=128/160, iter=14750/18560, loss=2.369185, lr=0.002622 | ETA 01:17:31 2022-07-25 10:27:28,831 - INFO - [TRAIN] epoch=128/160, iter=14760/18560, loss=2.699121, lr=0.002610 | ETA 01:17:37 2022-07-25 10:27:41,011 - INFO - [TRAIN] epoch=128/160, iter=14770/18560, loss=2.904417, lr=0.002597 | ETA 01:16:52 2022-07-25 10:27:53,274 - INFO - [TRAIN] epoch=128/160, iter=14780/18560, loss=2.872954, lr=0.002585 | ETA 01:17:51 2022-07-25 10:28:05,465 - INFO - [TRAIN] epoch=128/160, iter=14790/18560, loss=2.697807, lr=0.002572 | ETA 01:16:33 2022-07-25 10:28:17,624 - INFO - [TRAIN] epoch=128/160, iter=14800/18560, loss=2.871150, lr=0.002560 | ETA 01:15:52 2022-07-25 10:28:29,921 - INFO - [TRAIN] epoch=128/160, iter=14810/18560, loss=2.881399, lr=0.002548 | ETA 01:16:15 2022-07-25 10:28:42,173 - INFO - [TRAIN] epoch=128/160, iter=14820/18560, loss=2.514840, lr=0.002536 | ETA 01:16:54 2022-07-25 10:28:54,629 - INFO - [TRAIN] epoch=128/160, iter=14830/18560, loss=2.759672, lr=0.002523 | ETA 01:17:10 2022-07-25 10:29:06,891 - INFO - [TRAIN] epoch=128/160, iter=14840/18560, loss=2.766256, lr=0.002511 | ETA 01:15:16 2022-07-25 10:29:48,011 - INFO - [TRAIN] epoch=129/160, iter=14850/18560, loss=2.727687, lr=0.002499 | ETA 08:42:46 2022-07-25 10:30:00,714 - INFO - [TRAIN] epoch=129/160, iter=14860/18560, loss=2.501826, lr=0.002487 | ETA 01:19:39 2022-07-25 10:30:13,142 - INFO - [TRAIN] epoch=129/160, iter=14870/18560, loss=2.677164, lr=0.002474 | ETA 01:15:37 2022-07-25 10:30:25,347 - INFO - [TRAIN] epoch=129/160, iter=14880/18560, loss=2.561675, lr=0.002462 | ETA 01:15:03 2022-07-25 10:30:37,523 - INFO - [TRAIN] epoch=129/160, iter=14890/18560, loss=2.731555, lr=0.002450 | ETA 01:15:13 2022-07-25 10:30:49,717 - INFO - [TRAIN] epoch=129/160, iter=14900/18560, loss=2.519836, lr=0.002438 | ETA 01:13:24 2022-07-25 10:31:02,107 - INFO - [TRAIN] epoch=129/160, iter=14910/18560, loss=2.551704, lr=0.002426 | ETA 01:14:43 2022-07-25 10:31:14,384 - INFO - [TRAIN] epoch=129/160, iter=14920/18560, loss=2.709578, lr=0.002414 | ETA 01:14:47 2022-07-25 10:31:26,772 - INFO - [TRAIN] epoch=129/160, iter=14930/18560, loss=3.038277, lr=0.002402 | ETA 01:15:12 2022-07-25 10:31:39,161 - INFO - [TRAIN] epoch=129/160, iter=14940/18560, loss=2.627360, lr=0.002390 | ETA 01:14:01 2022-07-25 10:31:51,542 - INFO - [TRAIN] epoch=129/160, iter=14950/18560, loss=2.766612, lr=0.002378 | ETA 01:14:05 2022-07-25 10:32:03,864 - INFO - [TRAIN] epoch=129/160, iter=14960/18560, loss=2.942373, lr=0.002366 | ETA 01:15:12 2022-07-25 10:32:46,059 - INFO - [TRAIN] epoch=130/160, iter=14970/18560, loss=2.717859, lr=0.002354 | ETA 01:44:21 2022-07-25 10:32:58,793 - INFO - [TRAIN] epoch=130/160, iter=14980/18560, loss=2.699688, lr=0.002342 | ETA 01:16:01 2022-07-25 10:33:11,185 - INFO - [TRAIN] epoch=130/160, iter=14990/18560, loss=2.628145, lr=0.002330 | ETA 01:12:24 2022-07-25 10:33:23,693 - INFO - [TRAIN] epoch=130/160, iter=15000/18560, loss=2.493547, lr=0.002318 | ETA 01:13:22 2022-07-25 10:33:36,275 - INFO - [TRAIN] epoch=130/160, iter=15010/18560, loss=2.865689, lr=0.002306 | ETA 01:14:07 2022-07-25 10:33:48,574 - INFO - [TRAIN] epoch=130/160, iter=15020/18560, loss=2.474649, lr=0.002294 | ETA 01:12:30 2022-07-25 10:34:00,956 - INFO - [TRAIN] epoch=130/160, iter=15030/18560, loss=2.476717, lr=0.002282 | ETA 01:12:40 2022-07-25 10:34:13,372 - INFO - [TRAIN] epoch=130/160, iter=15040/18560, loss=2.678214, lr=0.002270 | ETA 01:13:06 2022-07-25 10:34:25,838 - INFO - [TRAIN] epoch=130/160, iter=15050/18560, loss=2.552795, lr=0.002259 | ETA 01:13:17 2022-07-25 10:34:38,215 - INFO - [TRAIN] epoch=130/160, iter=15060/18560, loss=2.349186, lr=0.002247 | ETA 01:11:18 2022-07-25 10:34:50,490 - INFO - [TRAIN] epoch=130/160, iter=15070/18560, loss=2.591158, lr=0.002235 | ETA 01:11:17 2022-07-25 10:35:02,731 - INFO - [TRAIN] epoch=130/160, iter=15080/18560, loss=2.496292, lr=0.002223 | ETA 01:10:36 2022-07-25 10:35:02,752 - INFO - Pop model from output_nospa/epoch_105 2022-07-25 10:35:02,960 - INFO - Push model to checkpoint output_nospa/epoch_130 2022-07-25 10:35:45,235 - INFO - [TRAIN] epoch=131/160, iter=15090/18560, loss=2.741824, lr=0.002212 | ETA 01:14:02 2022-07-25 10:35:57,572 - INFO - [TRAIN] epoch=131/160, iter=15100/18560, loss=2.517965, lr=0.002200 | ETA 01:10:36 2022-07-25 10:36:09,903 - INFO - [TRAIN] epoch=131/160, iter=15110/18560, loss=2.414443, lr=0.002188 | ETA 01:10:40 2022-07-25 10:36:22,053 - INFO - [TRAIN] epoch=131/160, iter=15120/18560, loss=2.705996, lr=0.002177 | ETA 01:08:59 2022-07-25 10:36:34,263 - INFO - [TRAIN] epoch=131/160, iter=15130/18560, loss=2.659755, lr=0.002165 | ETA 01:11:03 2022-07-25 10:36:46,474 - INFO - [TRAIN] epoch=131/160, iter=15140/18560, loss=2.633502, lr=0.002153 | ETA 01:09:26 2022-07-25 10:36:58,781 - INFO - [TRAIN] epoch=131/160, iter=15150/18560, loss=2.687874, lr=0.002142 | ETA 01:09:56 2022-07-25 10:37:11,044 - INFO - [TRAIN] epoch=131/160, iter=15160/18560, loss=2.652299, lr=0.002130 | ETA 01:10:23 2022-07-25 10:37:23,404 - INFO - [TRAIN] epoch=131/160, iter=15170/18560, loss=2.456134, lr=0.002119 | ETA 01:10:03 2022-07-25 10:37:35,505 - INFO - [TRAIN] epoch=131/160, iter=15180/18560, loss=2.369290, lr=0.002107 | ETA 01:08:47 2022-07-25 10:37:47,816 - INFO - [TRAIN] epoch=131/160, iter=15190/18560, loss=2.483933, lr=0.002096 | ETA 01:07:57 2022-07-25 10:38:27,543 - INFO - [TRAIN] epoch=132/160, iter=15200/18560, loss=2.703731, lr=0.002084 | ETA 02:45:09 2022-07-25 10:38:39,968 - INFO - [TRAIN] epoch=132/160, iter=15210/18560, loss=2.666441, lr=0.002073 | ETA 01:08:14 2022-07-25 10:38:52,416 - INFO - [TRAIN] epoch=132/160, iter=15220/18560, loss=2.578341, lr=0.002061 | ETA 01:09:32 2022-07-25 10:39:04,897 - INFO - [TRAIN] epoch=132/160, iter=15230/18560, loss=2.480728, lr=0.002050 | ETA 01:09:11 2022-07-25 10:39:17,355 - INFO - [TRAIN] epoch=132/160, iter=15240/18560, loss=2.539796, lr=0.002039 | ETA 01:09:13 2022-07-25 10:39:29,810 - INFO - [TRAIN] epoch=132/160, iter=15250/18560, loss=2.450320, lr=0.002027 | ETA 01:09:03 2022-07-25 10:39:42,378 - INFO - [TRAIN] epoch=132/160, iter=15260/18560, loss=2.671277, lr=0.002016 | ETA 01:08:59 2022-07-25 10:39:54,873 - INFO - [TRAIN] epoch=132/160, iter=15270/18560, loss=2.739708, lr=0.002005 | ETA 01:08:49 2022-07-25 10:40:07,309 - INFO - [TRAIN] epoch=132/160, iter=15280/18560, loss=2.751974, lr=0.001993 | ETA 01:08:27 2022-07-25 10:40:19,671 - INFO - [TRAIN] epoch=132/160, iter=15290/18560, loss=2.566281, lr=0.001982 | ETA 01:06:58 2022-07-25 10:40:31,885 - INFO - [TRAIN] epoch=132/160, iter=15300/18560, loss=2.446341, lr=0.001971 | ETA 01:06:13 2022-07-25 10:40:44,138 - INFO - [TRAIN] epoch=132/160, iter=15310/18560, loss=2.761194, lr=0.001960 | ETA 01:06:02 2022-07-25 10:41:26,135 - INFO - [TRAIN] epoch=133/160, iter=15320/18560, loss=2.532883, lr=0.001948 | ETA 01:13:27 2022-07-25 10:41:38,741 - INFO - [TRAIN] epoch=133/160, iter=15330/18560, loss=2.546699, lr=0.001937 | ETA 01:07:14 2022-07-25 10:41:51,088 - INFO - [TRAIN] epoch=133/160, iter=15340/18560, loss=2.574656, lr=0.001926 | ETA 01:05:39 2022-07-25 10:42:03,323 - INFO - [TRAIN] epoch=133/160, iter=15350/18560, loss=2.745646, lr=0.001915 | ETA 01:05:38 2022-07-25 10:42:15,826 - INFO - [TRAIN] epoch=133/160, iter=15360/18560, loss=2.561192, lr=0.001904 | ETA 01:06:57 2022-07-25 10:42:28,283 - INFO - [TRAIN] epoch=133/160, iter=15370/18560, loss=2.384324, lr=0.001893 | ETA 01:06:31 2022-07-25 10:42:40,728 - INFO - [TRAIN] epoch=133/160, iter=15380/18560, loss=2.355945, lr=0.001882 | ETA 01:06:00 2022-07-25 10:42:53,322 - INFO - [TRAIN] epoch=133/160, iter=15390/18560, loss=2.755784, lr=0.001871 | ETA 01:06:34 2022-07-25 10:43:05,802 - INFO - [TRAIN] epoch=133/160, iter=15400/18560, loss=2.587896, lr=0.001860 | ETA 01:05:40 2022-07-25 10:43:18,242 - INFO - [TRAIN] epoch=133/160, iter=15410/18560, loss=2.566407, lr=0.001849 | ETA 01:05:03 2022-07-25 10:43:30,711 - INFO - [TRAIN] epoch=133/160, iter=15420/18560, loss=2.439548, lr=0.001838 | ETA 01:05:04 2022-07-25 10:44:10,574 - INFO - [TRAIN] epoch=134/160, iter=15430/18560, loss=2.598163, lr=0.001827 | ETA 07:04:05 2022-07-25 10:44:23,043 - INFO - [TRAIN] epoch=134/160, iter=15440/18560, loss=2.369205, lr=0.001816 | ETA 01:05:11 2022-07-25 10:44:35,409 - INFO - [TRAIN] epoch=134/160, iter=15450/18560, loss=2.537684, lr=0.001805 | ETA 01:04:56 2022-07-25 10:44:47,575 - INFO - [TRAIN] epoch=134/160, iter=15460/18560, loss=2.597566, lr=0.001794 | ETA 01:02:23 2022-07-25 10:44:59,833 - INFO - [TRAIN] epoch=134/160, iter=15470/18560, loss=2.530414, lr=0.001784 | ETA 01:03:17 2022-07-25 10:45:12,186 - INFO - [TRAIN] epoch=134/160, iter=15480/18560, loss=2.501907, lr=0.001773 | ETA 01:02:44 2022-07-25 10:45:24,413 - INFO - [TRAIN] epoch=134/160, iter=15490/18560, loss=2.537845, lr=0.001762 | ETA 01:02:21 2022-07-25 10:45:36,578 - INFO - [TRAIN] epoch=134/160, iter=15500/18560, loss=2.547713, lr=0.001751 | ETA 01:01:47 2022-07-25 10:45:48,911 - INFO - [TRAIN] epoch=134/160, iter=15510/18560, loss=2.658164, lr=0.001741 | ETA 01:02:54 2022-07-25 10:46:01,244 - INFO - [TRAIN] epoch=134/160, iter=15520/18560, loss=2.505899, lr=0.001730 | ETA 01:03:30 2022-07-25 10:46:13,682 - INFO - [TRAIN] epoch=134/160, iter=15530/18560, loss=2.550319, lr=0.001719 | ETA 01:02:40 2022-07-25 10:46:26,072 - INFO - [TRAIN] epoch=134/160, iter=15540/18560, loss=2.597833, lr=0.001709 | ETA 01:02:10 2022-07-25 10:47:07,312 - INFO - [TRAIN] epoch=135/160, iter=15550/18560, loss=2.681693, lr=0.001698 | ETA 01:25:25 2022-07-25 10:47:19,872 - INFO - [TRAIN] epoch=135/160, iter=15560/18560, loss=2.549350, lr=0.001687 | ETA 01:03:13 2022-07-25 10:47:32,520 - INFO - [TRAIN] epoch=135/160, iter=15570/18560, loss=2.510038, lr=0.001677 | ETA 01:04:13 2022-07-25 10:47:45,065 - INFO - [TRAIN] epoch=135/160, iter=15580/18560, loss=2.585239, lr=0.001666 | ETA 01:01:31 2022-07-25 10:47:57,371 - INFO - [TRAIN] epoch=135/160, iter=15590/18560, loss=2.535992, lr=0.001656 | ETA 01:00:18 2022-07-25 10:48:09,675 - INFO - [TRAIN] epoch=135/160, iter=15600/18560, loss=2.653273, lr=0.001645 | ETA 01:00:58 2022-07-25 10:48:22,173 - INFO - [TRAIN] epoch=135/160, iter=15610/18560, loss=2.340686, lr=0.001635 | ETA 01:01:16 2022-07-25 10:48:34,664 - INFO - [TRAIN] epoch=135/160, iter=15620/18560, loss=2.835471, lr=0.001625 | ETA 01:01:37 2022-07-25 10:48:46,950 - INFO - [TRAIN] epoch=135/160, iter=15630/18560, loss=2.646196, lr=0.001614 | ETA 01:00:00 2022-07-25 10:48:59,239 - INFO - [TRAIN] epoch=135/160, iter=15640/18560, loss=2.461953, lr=0.001604 | ETA 01:00:17 2022-07-25 10:49:11,515 - INFO - [TRAIN] epoch=135/160, iter=15650/18560, loss=2.674909, lr=0.001593 | ETA 00:59:42 2022-07-25 10:49:23,677 - INFO - [TRAIN] epoch=135/160, iter=15660/18560, loss=2.452496, lr=0.001583 | ETA 00:58:25 2022-07-25 10:49:23,696 - INFO - Pop model from output_nospa/epoch_110 2022-07-25 10:49:23,885 - INFO - Push model to checkpoint output_nospa/epoch_135 2022-07-25 10:50:04,094 - INFO - [TRAIN] epoch=136/160, iter=15670/18560, loss=2.528943, lr=0.001573 | ETA 01:02:18 2022-07-25 10:50:16,522 - INFO - [TRAIN] epoch=136/160, iter=15680/18560, loss=2.414417, lr=0.001563 | ETA 00:59:15 2022-07-25 10:50:28,953 - INFO - [TRAIN] epoch=136/160, iter=15690/18560, loss=2.368386, lr=0.001552 | ETA 01:00:02 2022-07-25 10:50:41,314 - INFO - [TRAIN] epoch=136/160, iter=15700/18560, loss=2.728493, lr=0.001542 | ETA 00:58:22 2022-07-25 10:50:53,613 - INFO - [TRAIN] epoch=136/160, iter=15710/18560, loss=2.337515, lr=0.001532 | ETA 00:58:08 2022-07-25 10:51:05,998 - INFO - [TRAIN] epoch=136/160, iter=15720/18560, loss=2.353338, lr=0.001522 | ETA 00:59:05 2022-07-25 10:51:18,190 - INFO - [TRAIN] epoch=136/160, iter=15730/18560, loss=2.712195, lr=0.001512 | ETA 00:57:42 2022-07-25 10:51:30,527 - INFO - [TRAIN] epoch=136/160, iter=15740/18560, loss=2.587533, lr=0.001502 | ETA 00:59:07 2022-07-25 10:51:42,900 - INFO - [TRAIN] epoch=136/160, iter=15750/18560, loss=2.449412, lr=0.001492 | ETA 00:57:28 2022-07-25 10:51:55,405 - INFO - [TRAIN] epoch=136/160, iter=15760/18560, loss=2.596527, lr=0.001482 | ETA 00:58:12 2022-07-25 10:52:07,643 - INFO - [TRAIN] epoch=136/160, iter=15770/18560, loss=2.529326, lr=0.001472 | ETA 00:56:21 2022-07-25 10:52:49,882 - INFO - [TRAIN] epoch=137/160, iter=15780/18560, loss=2.500118, lr=0.001462 | ETA 02:25:39 2022-07-25 10:53:02,462 - INFO - [TRAIN] epoch=137/160, iter=15790/18560, loss=2.532882, lr=0.001452 | ETA 00:57:37 2022-07-25 10:53:14,831 - INFO - [TRAIN] epoch=137/160, iter=15800/18560, loss=2.536001, lr=0.001442 | ETA 00:57:11 2022-07-25 10:53:27,227 - INFO - [TRAIN] epoch=137/160, iter=15810/18560, loss=2.566884, lr=0.001432 | ETA 00:57:25 2022-07-25 10:53:39,533 - INFO - [TRAIN] epoch=137/160, iter=15820/18560, loss=2.687933, lr=0.001422 | ETA 00:55:38 2022-07-25 10:53:51,705 - INFO - [TRAIN] epoch=137/160, iter=15830/18560, loss=2.251237, lr=0.001412 | ETA 00:55:03 2022-07-25 10:54:04,063 - INFO - [TRAIN] epoch=137/160, iter=15840/18560, loss=2.568031, lr=0.001402 | ETA 00:55:45 2022-07-25 10:54:16,499 - INFO - [TRAIN] epoch=137/160, iter=15850/18560, loss=2.570870, lr=0.001392 | ETA 00:56:18 2022-07-25 10:54:28,943 - INFO - [TRAIN] epoch=137/160, iter=15860/18560, loss=2.601518, lr=0.001383 | ETA 00:56:01 2022-07-25 10:54:41,439 - INFO - [TRAIN] epoch=137/160, iter=15870/18560, loss=2.406703, lr=0.001373 | ETA 00:55:59 2022-07-25 10:54:53,924 - INFO - [TRAIN] epoch=137/160, iter=15880/18560, loss=2.455603, lr=0.001363 | ETA 00:55:39 2022-07-25 10:55:06,312 - INFO - [TRAIN] epoch=137/160, iter=15890/18560, loss=2.696634, lr=0.001354 | ETA 00:55:18 2022-07-25 10:55:47,783 - INFO - [TRAIN] epoch=138/160, iter=15900/18560, loss=2.491275, lr=0.001344 | ETA 00:59:39 2022-07-25 10:56:00,180 - INFO - [TRAIN] epoch=138/160, iter=15910/18560, loss=2.451516, lr=0.001334 | ETA 00:54:42 2022-07-25 10:56:12,498 - INFO - [TRAIN] epoch=138/160, iter=15920/18560, loss=2.493538, lr=0.001325 | ETA 00:54:56 2022-07-25 10:56:25,036 - INFO - [TRAIN] epoch=138/160, iter=15930/18560, loss=2.574543, lr=0.001315 | ETA 00:54:53 2022-07-25 10:56:37,173 - INFO - [TRAIN] epoch=138/160, iter=15940/18560, loss=2.492197, lr=0.001306 | ETA 00:52:39 2022-07-25 10:56:49,439 - INFO - [TRAIN] epoch=138/160, iter=15950/18560, loss=2.453913, lr=0.001296 | ETA 00:52:49 2022-07-25 10:57:01,610 - INFO - [TRAIN] epoch=138/160, iter=15960/18560, loss=2.723858, lr=0.001287 | ETA 00:53:04 2022-07-25 10:57:13,739 - INFO - [TRAIN] epoch=138/160, iter=15970/18560, loss=2.614186, lr=0.001277 | ETA 00:51:28 2022-07-25 10:57:26,054 - INFO - [TRAIN] epoch=138/160, iter=15980/18560, loss=2.644028, lr=0.001268 | ETA 00:53:35 2022-07-25 10:57:38,227 - INFO - [TRAIN] epoch=138/160, iter=15990/18560, loss=2.523486, lr=0.001259 | ETA 00:51:13 2022-07-25 10:57:50,427 - INFO - [TRAIN] epoch=138/160, iter=16000/18560, loss=2.456889, lr=0.001249 | ETA 00:52:08 2022-07-25 10:58:31,531 - INFO - [TRAIN] epoch=139/160, iter=16010/18560, loss=2.439239, lr=0.001240 | ETA 05:58:43 2022-07-25 10:58:44,267 - INFO - [TRAIN] epoch=139/160, iter=16020/18560, loss=2.310796, lr=0.001231 | ETA 00:54:00 2022-07-25 10:58:56,772 - INFO - [TRAIN] epoch=139/160, iter=16030/18560, loss=2.341685, lr=0.001221 | ETA 00:52:24 2022-07-25 10:59:09,218 - INFO - [TRAIN] epoch=139/160, iter=16040/18560, loss=2.413633, lr=0.001212 | ETA 00:52:08 2022-07-25 10:59:21,456 - INFO - [TRAIN] epoch=139/160, iter=16050/18560, loss=2.628843, lr=0.001203 | ETA 00:50:39 2022-07-25 10:59:33,745 - INFO - [TRAIN] epoch=139/160, iter=16060/18560, loss=2.413899, lr=0.001194 | ETA 00:51:02 2022-07-25 10:59:46,001 - INFO - [TRAIN] epoch=139/160, iter=16070/18560, loss=2.489731, lr=0.001185 | ETA 00:51:13 2022-07-25 10:59:58,325 - INFO - [TRAIN] epoch=139/160, iter=16080/18560, loss=2.493328, lr=0.001176 | ETA 00:50:50 2022-07-25 11:00:10,530 - INFO - [TRAIN] epoch=139/160, iter=16090/18560, loss=2.532528, lr=0.001167 | ETA 00:49:57 2022-07-25 11:00:22,671 - INFO - [TRAIN] epoch=139/160, iter=16100/18560, loss=2.546118, lr=0.001158 | ETA 00:49:42 2022-07-25 11:00:34,917 - INFO - [TRAIN] epoch=139/160, iter=16110/18560, loss=2.445106, lr=0.001148 | ETA 00:49:57 2022-07-25 11:00:47,201 - INFO - [TRAIN] epoch=139/160, iter=16120/18560, loss=2.362148, lr=0.001140 | ETA 00:49:53 2022-07-25 11:01:27,565 - INFO - [TRAIN] epoch=140/160, iter=16130/18560, loss=2.494731, lr=0.001131 | ETA 01:09:00 2022-07-25 11:01:40,020 - INFO - [TRAIN] epoch=140/160, iter=16140/18560, loss=2.492344, lr=0.001122 | ETA 00:48:58 2022-07-25 11:01:52,484 - INFO - [TRAIN] epoch=140/160, iter=16150/18560, loss=2.213452, lr=0.001113 | ETA 00:50:07 2022-07-25 11:02:04,791 - INFO - [TRAIN] epoch=140/160, iter=16160/18560, loss=2.366913, lr=0.001104 | ETA 00:48:55 2022-07-25 11:02:17,125 - INFO - [TRAIN] epoch=140/160, iter=16170/18560, loss=2.610676, lr=0.001095 | ETA 00:48:37 2022-07-25 11:02:29,380 - INFO - [TRAIN] epoch=140/160, iter=16180/18560, loss=2.475876, lr=0.001086 | ETA 00:48:27 2022-07-25 11:02:41,659 - INFO - [TRAIN] epoch=140/160, iter=16190/18560, loss=2.613630, lr=0.001078 | ETA 00:48:40 2022-07-25 11:02:53,965 - INFO - [TRAIN] epoch=140/160, iter=16200/18560, loss=2.643485, lr=0.001069 | ETA 00:49:07 2022-07-25 11:03:06,270 - INFO - [TRAIN] epoch=140/160, iter=16210/18560, loss=2.707431, lr=0.001060 | ETA 00:49:02 2022-07-25 11:03:18,490 - INFO - [TRAIN] epoch=140/160, iter=16220/18560, loss=2.511895, lr=0.001051 | ETA 00:47:09 2022-07-25 11:03:30,801 - INFO - [TRAIN] epoch=140/160, iter=16230/18560, loss=2.469665, lr=0.001043 | ETA 00:47:54 2022-07-25 11:03:42,914 - INFO - [TRAIN] epoch=140/160, iter=16240/18560, loss=2.497500, lr=0.001034 | ETA 00:47:03 2022-07-25 11:03:42,992 - INFO - Pop model from output_nospa/epoch_115 2022-07-25 11:03:43,209 - INFO - Push model to checkpoint output_nospa/epoch_140 2022-07-25 11:04:23,979 - INFO - [TRAIN] epoch=141/160, iter=16250/18560, loss=2.523623, lr=0.001026 | ETA 00:49:02 2022-07-25 11:04:36,369 - INFO - [TRAIN] epoch=141/160, iter=16260/18560, loss=2.383084, lr=0.001017 | ETA 00:47:25 2022-07-25 11:04:48,787 - INFO - [TRAIN] epoch=141/160, iter=16270/18560, loss=2.363344, lr=0.001009 | ETA 00:47:32 2022-07-25 11:05:01,269 - INFO - [TRAIN] epoch=141/160, iter=16280/18560, loss=2.477523, lr=0.001000 | ETA 00:47:37 2022-07-25 11:05:13,454 - INFO - [TRAIN] epoch=141/160, iter=16290/18560, loss=2.646637, lr=0.000992 | ETA 00:46:28 2022-07-25 11:05:25,882 - INFO - [TRAIN] epoch=141/160, iter=16300/18560, loss=2.392076, lr=0.000983 | ETA 00:47:25 2022-07-25 11:05:38,292 - INFO - [TRAIN] epoch=141/160, iter=16310/18560, loss=2.467214, lr=0.000975 | ETA 00:47:02 2022-07-25 11:05:50,864 - INFO - [TRAIN] epoch=141/160, iter=16320/18560, loss=2.648033, lr=0.000966 | ETA 00:45:46 2022-07-25 11:06:03,079 - INFO - [TRAIN] epoch=141/160, iter=16330/18560, loss=2.476900, lr=0.000958 | ETA 00:45:10 2022-07-25 11:06:15,396 - INFO - [TRAIN] epoch=141/160, iter=16340/18560, loss=2.429320, lr=0.000950 | ETA 00:45:11 2022-07-25 11:06:27,635 - INFO - [TRAIN] epoch=141/160, iter=16350/18560, loss=2.368317, lr=0.000942 | ETA 00:44:17 2022-07-25 11:07:08,187 - INFO - [TRAIN] epoch=142/160, iter=16360/18560, loss=2.466169, lr=0.000933 | ETA 01:52:32 2022-07-25 11:07:20,631 - INFO - [TRAIN] epoch=142/160, iter=16370/18560, loss=2.584626, lr=0.000925 | ETA 00:45:02 2022-07-25 11:07:33,040 - INFO - [TRAIN] epoch=142/160, iter=16380/18560, loss=2.516787, lr=0.000917 | ETA 00:43:52 2022-07-25 11:07:45,226 - INFO - [TRAIN] epoch=142/160, iter=16390/18560, loss=2.384484, lr=0.000909 | ETA 00:44:24 2022-07-25 11:07:57,752 - INFO - [TRAIN] epoch=142/160, iter=16400/18560, loss=2.626023, lr=0.000901 | ETA 00:44:21 2022-07-25 11:08:10,177 - INFO - [TRAIN] epoch=142/160, iter=16410/18560, loss=2.245121, lr=0.000893 | ETA 00:44:04 2022-07-25 11:08:22,516 - INFO - [TRAIN] epoch=142/160, iter=16420/18560, loss=2.599064, lr=0.000885 | ETA 00:43:33 2022-07-25 11:08:35,084 - INFO - [TRAIN] epoch=142/160, iter=16430/18560, loss=2.613887, lr=0.000877 | ETA 00:44:12 2022-07-25 11:08:47,502 - INFO - [TRAIN] epoch=142/160, iter=16440/18560, loss=2.578875, lr=0.000869 | ETA 00:43:59 2022-07-25 11:08:59,965 - INFO - [TRAIN] epoch=142/160, iter=16450/18560, loss=2.426327, lr=0.000861 | ETA 00:44:33 2022-07-25 11:09:12,433 - INFO - [TRAIN] epoch=142/160, iter=16460/18560, loss=2.382156, lr=0.000853 | ETA 00:43:22 2022-07-25 11:09:24,797 - INFO - [TRAIN] epoch=142/160, iter=16470/18560, loss=2.392348, lr=0.000845 | ETA 00:42:52 2022-07-25 11:10:06,296 - INFO - [TRAIN] epoch=143/160, iter=16480/18560, loss=2.458372, lr=0.000837 | ETA 00:46:21 2022-07-25 11:10:19,009 - INFO - [TRAIN] epoch=143/160, iter=16490/18560, loss=2.603558, lr=0.000829 | ETA 00:43:40 2022-07-25 11:10:31,364 - INFO - [TRAIN] epoch=143/160, iter=16500/18560, loss=2.597877, lr=0.000822 | ETA 00:42:38 2022-07-25 11:10:43,868 - INFO - [TRAIN] epoch=143/160, iter=16510/18560, loss=2.544525, lr=0.000814 | ETA 00:42:19 2022-07-25 11:10:56,204 - INFO - [TRAIN] epoch=143/160, iter=16520/18560, loss=2.344388, lr=0.000806 | ETA 00:42:08 2022-07-25 11:11:08,646 - INFO - [TRAIN] epoch=143/160, iter=16530/18560, loss=2.277564, lr=0.000799 | ETA 00:41:30 2022-07-25 11:11:20,905 - INFO - [TRAIN] epoch=143/160, iter=16540/18560, loss=2.736085, lr=0.000791 | ETA 00:41:09 2022-07-25 11:11:33,258 - INFO - [TRAIN] epoch=143/160, iter=16550/18560, loss=2.388910, lr=0.000783 | ETA 00:40:46 2022-07-25 11:11:45,444 - INFO - [TRAIN] epoch=143/160, iter=16560/18560, loss=2.679156, lr=0.000776 | ETA 00:40:12 2022-07-25 11:11:57,805 - INFO - [TRAIN] epoch=143/160, iter=16570/18560, loss=2.308598, lr=0.000768 | ETA 00:40:57 2022-07-25 11:12:10,161 - INFO - [TRAIN] epoch=143/160, iter=16580/18560, loss=2.384864, lr=0.000761 | ETA 00:41:05 2022-07-25 11:12:53,082 - INFO - [TRAIN] epoch=144/160, iter=16590/18560, loss=2.381611, lr=0.000753 | ETA 04:51:47 2022-07-25 11:13:05,557 - INFO - [TRAIN] epoch=144/160, iter=16600/18560, loss=2.605464, lr=0.000746 | ETA 00:40:23 2022-07-25 11:13:17,835 - INFO - [TRAIN] epoch=144/160, iter=16610/18560, loss=2.379615, lr=0.000739 | ETA 00:40:38 2022-07-25 11:13:30,167 - INFO - [TRAIN] epoch=144/160, iter=16620/18560, loss=2.465703, lr=0.000731 | ETA 00:39:49 2022-07-25 11:13:42,468 - INFO - [TRAIN] epoch=144/160, iter=16630/18560, loss=2.343037, lr=0.000724 | ETA 00:39:38 2022-07-25 11:13:54,846 - INFO - [TRAIN] epoch=144/160, iter=16640/18560, loss=2.341448, lr=0.000717 | ETA 00:39:24 2022-07-25 11:14:06,975 - INFO - [TRAIN] epoch=144/160, iter=16650/18560, loss=2.471192, lr=0.000709 | ETA 00:38:43 2022-07-25 11:14:19,324 - INFO - [TRAIN] epoch=144/160, iter=16660/18560, loss=2.324571, lr=0.000702 | ETA 00:39:32 2022-07-25 11:14:31,497 - INFO - [TRAIN] epoch=144/160, iter=16670/18560, loss=2.619433, lr=0.000695 | ETA 00:38:53 2022-07-25 11:14:43,705 - INFO - [TRAIN] epoch=144/160, iter=16680/18560, loss=2.332455, lr=0.000688 | ETA 00:38:11 2022-07-25 11:14:55,994 - INFO - [TRAIN] epoch=144/160, iter=16690/18560, loss=2.631424, lr=0.000681 | ETA 00:38:02 2022-07-25 11:15:08,225 - INFO - [TRAIN] epoch=144/160, iter=16700/18560, loss=2.516064, lr=0.000673 | ETA 00:37:41 2022-07-25 11:15:50,820 - INFO - [TRAIN] epoch=145/160, iter=16710/18560, loss=2.268431, lr=0.000666 | ETA 00:53:27 2022-07-25 11:16:03,353 - INFO - [TRAIN] epoch=145/160, iter=16720/18560, loss=2.473109, lr=0.000659 | ETA 00:38:56 2022-07-25 11:16:15,641 - INFO - [TRAIN] epoch=145/160, iter=16730/18560, loss=2.273922, lr=0.000652 | ETA 00:38:02 2022-07-25 11:16:27,915 - INFO - [TRAIN] epoch=145/160, iter=16740/18560, loss=2.584496, lr=0.000645 | ETA 00:36:56 2022-07-25 11:16:40,162 - INFO - [TRAIN] epoch=145/160, iter=16750/18560, loss=2.298377, lr=0.000639 | ETA 00:37:23 2022-07-25 11:16:52,329 - INFO - [TRAIN] epoch=145/160, iter=16760/18560, loss=2.288389, lr=0.000632 | ETA 00:36:04 2022-07-25 11:17:04,545 - INFO - [TRAIN] epoch=145/160, iter=16770/18560, loss=2.624151, lr=0.000625 | ETA 00:36:56 2022-07-25 11:17:16,692 - INFO - [TRAIN] epoch=145/160, iter=16780/18560, loss=2.582177, lr=0.000618 | ETA 00:36:13 2022-07-25 11:17:29,171 - INFO - [TRAIN] epoch=145/160, iter=16790/18560, loss=2.593188, lr=0.000611 | ETA 00:36:08 2022-07-25 11:17:41,304 - INFO - [TRAIN] epoch=145/160, iter=16800/18560, loss=2.581858, lr=0.000605 | ETA 00:35:52 2022-07-25 11:17:53,716 - INFO - [TRAIN] epoch=145/160, iter=16810/18560, loss=2.464734, lr=0.000598 | ETA 00:36:31 2022-07-25 11:18:06,212 - INFO - [TRAIN] epoch=145/160, iter=16820/18560, loss=2.478762, lr=0.000591 | ETA 00:36:21 2022-07-25 11:18:06,235 - INFO - Pop model from output_nospa/epoch_120 2022-07-25 11:18:06,462 - INFO - Push model to checkpoint output_nospa/epoch_145 2022-07-25 11:18:47,196 - INFO - [TRAIN] epoch=146/160, iter=16830/18560, loss=2.592535, lr=0.000585 | ETA 00:37:23 2022-07-25 11:18:59,472 - INFO - [TRAIN] epoch=146/160, iter=16840/18560, loss=2.493227, lr=0.000578 | ETA 00:35:05 2022-07-25 11:19:11,948 - INFO - [TRAIN] epoch=146/160, iter=16850/18560, loss=2.400460, lr=0.000571 | ETA 00:35:25 2022-07-25 11:19:24,289 - INFO - [TRAIN] epoch=146/160, iter=16860/18560, loss=2.489980, lr=0.000565 | ETA 00:34:54 2022-07-25 11:19:36,773 - INFO - [TRAIN] epoch=146/160, iter=16870/18560, loss=2.399581, lr=0.000558 | ETA 00:34:51 2022-07-25 11:19:49,188 - INFO - [TRAIN] epoch=146/160, iter=16880/18560, loss=2.357191, lr=0.000552 | ETA 00:34:54 2022-07-25 11:20:01,589 - INFO - [TRAIN] epoch=146/160, iter=16890/18560, loss=2.410273, lr=0.000545 | ETA 00:34:53 2022-07-25 11:20:14,172 - INFO - [TRAIN] epoch=146/160, iter=16900/18560, loss=2.549635, lr=0.000539 | ETA 00:34:56 2022-07-25 11:20:26,662 - INFO - [TRAIN] epoch=146/160, iter=16910/18560, loss=2.588864, lr=0.000533 | ETA 00:34:13 2022-07-25 11:20:39,064 - INFO - [TRAIN] epoch=146/160, iter=16920/18560, loss=2.643601, lr=0.000526 | ETA 00:33:42 2022-07-25 11:20:51,456 - INFO - [TRAIN] epoch=146/160, iter=16930/18560, loss=2.356428, lr=0.000520 | ETA 00:33:15 2022-07-25 11:21:33,819 - INFO - [TRAIN] epoch=147/160, iter=16940/18560, loss=2.581469, lr=0.000514 | ETA 01:24:21 2022-07-25 11:21:46,334 - INFO - [TRAIN] epoch=147/160, iter=16950/18560, loss=2.391101, lr=0.000508 | ETA 00:33:24 2022-07-25 11:21:58,662 - INFO - [TRAIN] epoch=147/160, iter=16960/18560, loss=2.418098, lr=0.000501 | ETA 00:32:43 2022-07-25 11:22:10,988 - INFO - [TRAIN] epoch=147/160, iter=16970/18560, loss=2.414713, lr=0.000495 | ETA 00:33:19 2022-07-25 11:22:23,503 - INFO - [TRAIN] epoch=147/160, iter=16980/18560, loss=2.326014, lr=0.000489 | ETA 00:32:40 2022-07-25 11:22:35,873 - INFO - [TRAIN] epoch=147/160, iter=16990/18560, loss=2.417202, lr=0.000483 | ETA 00:32:10 2022-07-25 11:22:48,301 - INFO - [TRAIN] epoch=147/160, iter=17000/18560, loss=2.339262, lr=0.000477 | ETA 00:32:21 2022-07-25 11:23:00,797 - INFO - [TRAIN] epoch=147/160, iter=17010/18560, loss=2.427868, lr=0.000471 | ETA 00:32:15 2022-07-25 11:23:13,149 - INFO - [TRAIN] epoch=147/160, iter=17020/18560, loss=2.687676, lr=0.000465 | ETA 00:31:16 2022-07-25 11:23:25,401 - INFO - [TRAIN] epoch=147/160, iter=17030/18560, loss=2.383548, lr=0.000459 | ETA 00:31:21 2022-07-25 11:23:37,719 - INFO - [TRAIN] epoch=147/160, iter=17040/18560, loss=2.475449, lr=0.000453 | ETA 00:31:51 2022-07-25 11:23:49,922 - INFO - [TRAIN] epoch=147/160, iter=17050/18560, loss=2.474850, lr=0.000448 | ETA 00:30:44 2022-07-25 11:24:32,190 - INFO - [TRAIN] epoch=148/160, iter=17060/18560, loss=2.339798, lr=0.000442 | ETA 00:34:14 2022-07-25 11:24:44,750 - INFO - [TRAIN] epoch=148/160, iter=17070/18560, loss=2.382632, lr=0.000436 | ETA 00:31:18 2022-07-25 11:24:57,063 - INFO - [TRAIN] epoch=148/160, iter=17080/18560, loss=2.336056, lr=0.000430 | ETA 00:30:24 2022-07-25 11:25:09,339 - INFO - [TRAIN] epoch=148/160, iter=17090/18560, loss=2.557405, lr=0.000424 | ETA 00:29:55 2022-07-25 11:25:21,558 - INFO - [TRAIN] epoch=148/160, iter=17100/18560, loss=2.468296, lr=0.000419 | ETA 00:30:02 2022-07-25 11:25:33,840 - INFO - [TRAIN] epoch=148/160, iter=17110/18560, loss=2.388548, lr=0.000413 | ETA 00:29:25 2022-07-25 11:25:46,055 - INFO - [TRAIN] epoch=148/160, iter=17120/18560, loss=2.342592, lr=0.000408 | ETA 00:29:26 2022-07-25 11:25:58,309 - INFO - [TRAIN] epoch=148/160, iter=17130/18560, loss=2.602294, lr=0.000402 | ETA 00:28:54 2022-07-25 11:26:10,555 - INFO - [TRAIN] epoch=148/160, iter=17140/18560, loss=2.535555, lr=0.000397 | ETA 00:29:13 2022-07-25 11:26:22,846 - INFO - [TRAIN] epoch=148/160, iter=17150/18560, loss=2.553432, lr=0.000391 | ETA 00:29:18 2022-07-25 11:26:35,085 - INFO - [TRAIN] epoch=148/160, iter=17160/18560, loss=2.366557, lr=0.000386 | ETA 00:28:18 2022-07-25 11:27:17,666 - INFO - [TRAIN] epoch=149/160, iter=17170/18560, loss=2.618417, lr=0.000380 | ETA 03:24:39 2022-07-25 11:27:30,428 - INFO - [TRAIN] epoch=149/160, iter=17180/18560, loss=2.477524, lr=0.000375 | ETA 00:29:08 2022-07-25 11:27:43,058 - INFO - [TRAIN] epoch=149/160, iter=17190/18560, loss=2.361242, lr=0.000369 | ETA 00:28:36 2022-07-25 11:27:55,506 - INFO - [TRAIN] epoch=149/160, iter=17200/18560, loss=2.297547, lr=0.000364 | ETA 00:27:39 2022-07-25 11:28:07,728 - INFO - [TRAIN] epoch=149/160, iter=17210/18560, loss=2.474021, lr=0.000359 | ETA 00:27:29 2022-07-25 11:28:20,005 - INFO - [TRAIN] epoch=149/160, iter=17220/18560, loss=2.235734, lr=0.000354 | ETA 00:27:39 2022-07-25 11:28:32,560 - INFO - [TRAIN] epoch=149/160, iter=17230/18560, loss=2.397653, lr=0.000348 | ETA 00:28:04 2022-07-25 11:28:44,967 - INFO - [TRAIN] epoch=149/160, iter=17240/18560, loss=2.399362, lr=0.000343 | ETA 00:27:22 2022-07-25 11:28:57,444 - INFO - [TRAIN] epoch=149/160, iter=17250/18560, loss=2.523354, lr=0.000338 | ETA 00:26:51 2022-07-25 11:29:09,651 - INFO - [TRAIN] epoch=149/160, iter=17260/18560, loss=2.540237, lr=0.000333 | ETA 00:26:27 2022-07-25 11:29:21,897 - INFO - [TRAIN] epoch=149/160, iter=17270/18560, loss=2.389271, lr=0.000328 | ETA 00:26:19 2022-07-25 11:29:34,096 - INFO - [TRAIN] epoch=149/160, iter=17280/18560, loss=2.559793, lr=0.000323 | ETA 00:25:46 2022-07-25 11:30:15,364 - INFO - [TRAIN] epoch=150/160, iter=17290/18560, loss=2.413513, lr=0.000318 | ETA 00:35:45 2022-07-25 11:30:27,855 - INFO - [TRAIN] epoch=150/160, iter=17300/18560, loss=2.245849, lr=0.000313 | ETA 00:25:58 2022-07-25 11:30:40,358 - INFO - [TRAIN] epoch=150/160, iter=17310/18560, loss=2.253937, lr=0.000308 | ETA 00:25:46 2022-07-25 11:30:52,776 - INFO - [TRAIN] epoch=150/160, iter=17320/18560, loss=2.614157, lr=0.000303 | ETA 00:25:53 2022-07-25 11:31:05,183 - INFO - [TRAIN] epoch=150/160, iter=17330/18560, loss=2.485294, lr=0.000299 | ETA 00:25:02 2022-07-25 11:31:17,479 - INFO - [TRAIN] epoch=150/160, iter=17340/18560, loss=2.388159, lr=0.000294 | ETA 00:25:32 2022-07-25 11:31:29,657 - INFO - [TRAIN] epoch=150/160, iter=17350/18560, loss=2.145643, lr=0.000289 | ETA 00:24:17 2022-07-25 11:31:41,827 - INFO - [TRAIN] epoch=150/160, iter=17360/18560, loss=2.352712, lr=0.000284 | ETA 00:24:36 2022-07-25 11:31:54,215 - INFO - [TRAIN] epoch=150/160, iter=17370/18560, loss=2.541248, lr=0.000280 | ETA 00:24:39 2022-07-25 11:32:06,758 - INFO - [TRAIN] epoch=150/160, iter=17380/18560, loss=2.586325, lr=0.000275 | ETA 00:24:58 2022-07-25 11:32:19,147 - INFO - [TRAIN] epoch=150/160, iter=17390/18560, loss=2.416200, lr=0.000270 | ETA 00:23:50 2022-07-25 11:32:31,265 - INFO - [TRAIN] epoch=150/160, iter=17400/18560, loss=2.492123, lr=0.000266 | ETA 00:23:15 2022-07-25 11:32:31,284 - INFO - Pop model from output_nospa/epoch_125 2022-07-25 11:32:31,469 - INFO - Push model to checkpoint output_nospa/epoch_150 2022-07-25 11:33:12,818 - INFO - [TRAIN] epoch=151/160, iter=17410/18560, loss=2.278226, lr=0.000261 | ETA 00:24:05 2022-07-25 11:33:25,214 - INFO - [TRAIN] epoch=151/160, iter=17420/18560, loss=2.299737, lr=0.000257 | ETA 00:23:13 2022-07-25 11:33:37,412 - INFO - [TRAIN] epoch=151/160, iter=17430/18560, loss=2.445905, lr=0.000252 | ETA 00:22:45 2022-07-25 11:33:49,761 - INFO - [TRAIN] epoch=151/160, iter=17440/18560, loss=2.476056, lr=0.000248 | ETA 00:22:59 2022-07-25 11:34:02,021 - INFO - [TRAIN] epoch=151/160, iter=17450/18560, loss=2.354847, lr=0.000244 | ETA 00:22:28 2022-07-25 11:34:14,234 - INFO - [TRAIN] epoch=151/160, iter=17460/18560, loss=2.323349, lr=0.000239 | ETA 00:22:23 2022-07-25 11:34:26,441 - INFO - [TRAIN] epoch=151/160, iter=17470/18560, loss=2.401113, lr=0.000235 | ETA 00:21:56 2022-07-25 11:34:38,777 - INFO - [TRAIN] epoch=151/160, iter=17480/18560, loss=2.557165, lr=0.000231 | ETA 00:22:13 2022-07-25 11:34:51,115 - INFO - [TRAIN] epoch=151/160, iter=17490/18560, loss=2.442869, lr=0.000227 | ETA 00:21:50 2022-07-25 11:35:03,477 - INFO - [TRAIN] epoch=151/160, iter=17500/18560, loss=2.567640, lr=0.000222 | ETA 00:21:44 2022-07-25 11:35:15,652 - INFO - [TRAIN] epoch=151/160, iter=17510/18560, loss=2.365428, lr=0.000218 | ETA 00:21:41 2022-07-25 11:35:59,866 - INFO - [TRAIN] epoch=152/160, iter=17520/18560, loss=2.442748, lr=0.000214 | ETA 00:55:42 2022-07-25 11:36:12,479 - INFO - [TRAIN] epoch=152/160, iter=17530/18560, loss=2.270942, lr=0.000210 | ETA 00:22:13 2022-07-25 11:36:24,760 - INFO - [TRAIN] epoch=152/160, iter=17540/18560, loss=2.346987, lr=0.000206 | ETA 00:20:53 2022-07-25 11:36:37,168 - INFO - [TRAIN] epoch=152/160, iter=17550/18560, loss=2.295838, lr=0.000202 | ETA 00:21:00 2022-07-25 11:36:49,442 - INFO - [TRAIN] epoch=152/160, iter=17560/18560, loss=2.674676, lr=0.000198 | ETA 00:20:22 2022-07-25 11:37:01,825 - INFO - [TRAIN] epoch=152/160, iter=17570/18560, loss=2.407122, lr=0.000194 | ETA 00:20:07 2022-07-25 11:37:14,141 - INFO - [TRAIN] epoch=152/160, iter=17580/18560, loss=2.382856, lr=0.000190 | ETA 00:20:04 2022-07-25 11:37:26,503 - INFO - [TRAIN] epoch=152/160, iter=17590/18560, loss=2.711610, lr=0.000187 | ETA 00:20:20 2022-07-25 11:37:38,733 - INFO - [TRAIN] epoch=152/160, iter=17600/18560, loss=2.476916, lr=0.000183 | ETA 00:19:47 2022-07-25 11:37:50,969 - INFO - [TRAIN] epoch=152/160, iter=17610/18560, loss=2.328329, lr=0.000179 | ETA 00:19:26 2022-07-25 11:38:03,284 - INFO - [TRAIN] epoch=152/160, iter=17620/18560, loss=2.254205, lr=0.000175 | ETA 00:19:11 2022-07-25 11:38:15,454 - INFO - [TRAIN] epoch=152/160, iter=17630/18560, loss=2.429274, lr=0.000172 | ETA 00:18:33 2022-07-25 11:38:58,410 - INFO - [TRAIN] epoch=153/160, iter=17640/18560, loss=2.418985, lr=0.000168 | ETA 00:21:03 2022-07-25 11:39:10,863 - INFO - [TRAIN] epoch=153/160, iter=17650/18560, loss=2.446015, lr=0.000164 | ETA 00:19:01 2022-07-25 11:39:23,193 - INFO - [TRAIN] epoch=153/160, iter=17660/18560, loss=2.269283, lr=0.000161 | ETA 00:18:06 2022-07-25 11:39:35,366 - INFO - [TRAIN] epoch=153/160, iter=17670/18560, loss=2.304738, lr=0.000157 | ETA 00:18:04 2022-07-25 11:39:47,738 - INFO - [TRAIN] epoch=153/160, iter=17680/18560, loss=2.367094, lr=0.000154 | ETA 00:18:11 2022-07-25 11:40:00,143 - INFO - [TRAIN] epoch=153/160, iter=17690/18560, loss=2.166182, lr=0.000150 | ETA 00:18:11 2022-07-25 11:40:12,404 - INFO - [TRAIN] epoch=153/160, iter=17700/18560, loss=2.325011, lr=0.000147 | ETA 00:17:20 2022-07-25 11:40:24,941 - INFO - [TRAIN] epoch=153/160, iter=17710/18560, loss=2.616034, lr=0.000143 | ETA 00:18:00 2022-07-25 11:40:37,488 - INFO - [TRAIN] epoch=153/160, iter=17720/18560, loss=2.221381, lr=0.000140 | ETA 00:17:18 2022-07-25 11:40:49,666 - INFO - [TRAIN] epoch=153/160, iter=17730/18560, loss=2.607089, lr=0.000137 | ETA 00:16:46 2022-07-25 11:41:01,898 - INFO - [TRAIN] epoch=153/160, iter=17740/18560, loss=2.257216, lr=0.000134 | ETA 00:17:03 2022-07-25 11:41:51,521 - INFO - [TRAIN] epoch=154/160, iter=17750/18560, loss=2.402728, lr=0.000130 | ETA 02:22:42 2022-07-25 11:42:04,041 - INFO - [TRAIN] epoch=154/160, iter=17760/18560, loss=2.290321, lr=0.000127 | ETA 00:16:44 2022-07-25 11:42:16,612 - INFO - [TRAIN] epoch=154/160, iter=17770/18560, loss=2.297022, lr=0.000124 | ETA 00:16:11 2022-07-25 11:42:29,303 - INFO - [TRAIN] epoch=154/160, iter=17780/18560, loss=2.427579, lr=0.000121 | ETA 00:16:38 2022-07-25 11:42:41,766 - INFO - [TRAIN] epoch=154/160, iter=17790/18560, loss=2.539531, lr=0.000118 | ETA 00:16:17 2022-07-25 11:42:54,177 - INFO - [TRAIN] epoch=154/160, iter=17800/18560, loss=2.179120, lr=0.000115 | ETA 00:15:33 2022-07-25 11:43:06,355 - INFO - [TRAIN] epoch=154/160, iter=17810/18560, loss=2.281892, lr=0.000112 | ETA 00:15:20 2022-07-25 11:43:18,543 - INFO - [TRAIN] epoch=154/160, iter=17820/18560, loss=2.425623, lr=0.000109 | ETA 00:15:00 2022-07-25 11:43:30,893 - INFO - [TRAIN] epoch=154/160, iter=17830/18560, loss=2.598502, lr=0.000106 | ETA 00:15:04 2022-07-25 11:43:43,080 - INFO - [TRAIN] epoch=154/160, iter=17840/18560, loss=2.433475, lr=0.000103 | ETA 00:14:26 2022-07-25 11:43:55,498 - INFO - [TRAIN] epoch=154/160, iter=17850/18560, loss=2.474093, lr=0.000100 | ETA 00:14:43 2022-07-25 11:44:07,672 - INFO - [TRAIN] epoch=154/160, iter=17860/18560, loss=2.538239, lr=0.000098 | ETA 00:14:10 2022-07-25 11:44:51,457 - INFO - [TRAIN] epoch=155/160, iter=17870/18560, loss=2.410008, lr=0.000095 | ETA 00:20:25 2022-07-25 11:45:04,085 - INFO - [TRAIN] epoch=155/160, iter=17880/18560, loss=2.398698, lr=0.000092 | ETA 00:14:02 2022-07-25 11:45:16,450 - INFO - [TRAIN] epoch=155/160, iter=17890/18560, loss=2.349261, lr=0.000089 | ETA 00:13:38 2022-07-25 11:45:28,766 - INFO - [TRAIN] epoch=155/160, iter=17900/18560, loss=2.410124, lr=0.000087 | ETA 00:13:27 2022-07-25 11:45:41,042 - INFO - [TRAIN] epoch=155/160, iter=17910/18560, loss=2.477190, lr=0.000084 | ETA 00:13:16 2022-07-25 11:45:53,353 - INFO - [TRAIN] epoch=155/160, iter=17920/18560, loss=2.432264, lr=0.000082 | ETA 00:13:23 2022-07-25 11:46:05,681 - INFO - [TRAIN] epoch=155/160, iter=17930/18560, loss=2.385686, lr=0.000079 | ETA 00:13:00 2022-07-25 11:46:18,170 - INFO - [TRAIN] epoch=155/160, iter=17940/18560, loss=2.592744, lr=0.000077 | ETA 00:13:02 2022-07-25 11:46:30,744 - INFO - [TRAIN] epoch=155/160, iter=17950/18560, loss=2.413785, lr=0.000074 | ETA 00:12:54 2022-07-25 11:46:43,205 - INFO - [TRAIN] epoch=155/160, iter=17960/18560, loss=2.278495, lr=0.000072 | ETA 00:12:17 2022-07-25 11:46:55,435 - INFO - [TRAIN] epoch=155/160, iter=17970/18560, loss=2.427830, lr=0.000069 | ETA 00:12:00 2022-07-25 11:47:07,568 - INFO - [TRAIN] epoch=155/160, iter=17980/18560, loss=2.480835, lr=0.000067 | ETA 00:11:43 2022-07-25 11:47:07,588 - INFO - Pop model from output_nospa/epoch_130 2022-07-25 11:47:07,778 - INFO - Push model to checkpoint output_nospa/epoch_155 2022-07-25 11:47:50,910 - INFO - [TRAIN] epoch=156/160, iter=17990/18560, loss=2.415217, lr=0.000065 | ETA 00:11:53 2022-07-25 11:48:03,144 - INFO - [TRAIN] epoch=156/160, iter=18000/18560, loss=2.218254, lr=0.000063 | ETA 00:11:28 2022-07-25 11:48:15,416 - INFO - [TRAIN] epoch=156/160, iter=18010/18560, loss=2.270191, lr=0.000060 | ETA 00:11:08 2022-07-25 11:48:27,661 - INFO - [TRAIN] epoch=156/160, iter=18020/18560, loss=2.521739, lr=0.000058 | ETA 00:10:49 2022-07-25 11:48:39,896 - INFO - [TRAIN] epoch=156/160, iter=18030/18560, loss=2.162685, lr=0.000056 | ETA 00:10:52 2022-07-25 11:48:52,084 - INFO - [TRAIN] epoch=156/160, iter=18040/18560, loss=2.363710, lr=0.000054 | ETA 00:10:25 2022-07-25 11:49:04,277 - INFO - [TRAIN] epoch=156/160, iter=18050/18560, loss=2.295409, lr=0.000052 | ETA 00:10:24 2022-07-25 11:49:16,554 - INFO - [TRAIN] epoch=156/160, iter=18060/18560, loss=2.768045, lr=0.000050 | ETA 00:10:22 2022-07-25 11:49:29,003 - INFO - [TRAIN] epoch=156/160, iter=18070/18560, loss=2.207337, lr=0.000048 | ETA 00:10:17 2022-07-25 11:49:41,564 - INFO - [TRAIN] epoch=156/160, iter=18080/18560, loss=2.338698, lr=0.000046 | ETA 00:09:46 2022-07-25 11:49:53,820 - INFO - [TRAIN] epoch=156/160, iter=18090/18560, loss=2.426579, lr=0.000044 | ETA 00:09:38 2022-07-25 11:50:44,268 - INFO - [TRAIN] epoch=157/160, iter=18100/18560, loss=2.418967, lr=0.000042 | ETA 00:27:47 2022-07-25 11:50:56,545 - INFO - [TRAIN] epoch=157/160, iter=18110/18560, loss=2.308566, lr=0.000041 | ETA 00:09:18 2022-07-25 11:51:08,872 - INFO - [TRAIN] epoch=157/160, iter=18120/18560, loss=2.230700, lr=0.000039 | ETA 00:08:57 2022-07-25 11:51:21,234 - INFO - [TRAIN] epoch=157/160, iter=18130/18560, loss=2.349017, lr=0.000037 | ETA 00:08:43 2022-07-25 11:51:33,531 - INFO - [TRAIN] epoch=157/160, iter=18140/18560, loss=2.530039, lr=0.000035 | ETA 00:08:37 2022-07-25 11:51:46,876 - INFO - [TRAIN] epoch=157/160, iter=18150/18560, loss=2.283750, lr=0.000034 | ETA 00:12:04 2022-07-25 11:52:00,324 - INFO - [TRAIN] epoch=157/160, iter=18160/18560, loss=2.235580, lr=0.000032 | ETA 00:08:30 2022-07-25 11:52:13,169 - INFO - [TRAIN] epoch=157/160, iter=18170/18560, loss=2.364787, lr=0.000030 | ETA 00:08:03 2022-07-25 11:52:25,297 - INFO - [TRAIN] epoch=157/160, iter=18180/18560, loss=2.544666, lr=0.000029 | ETA 00:07:41 2022-07-25 11:52:38,084 - INFO - [TRAIN] epoch=157/160, iter=18190/18560, loss=2.568426, lr=0.000027 | ETA 00:08:02 2022-07-25 11:52:52,548 - INFO - [TRAIN] epoch=157/160, iter=18200/18560, loss=2.468578, lr=0.000026 | ETA 00:07:40 2022-07-25 11:53:05,187 - INFO - [TRAIN] epoch=157/160, iter=18210/18560, loss=2.519248, lr=0.000025 | ETA 00:07:10 2022-07-25 11:53:52,589 - INFO - [TRAIN] epoch=158/160, iter=18220/18560, loss=2.393024, lr=0.000023 | ETA 00:10:30 2022-07-25 11:54:05,270 - INFO - [TRAIN] epoch=158/160, iter=18230/18560, loss=2.398245, lr=0.000022 | ETA 00:06:46 2022-07-25 11:54:17,751 - INFO - [TRAIN] epoch=158/160, iter=18240/18560, loss=2.185709, lr=0.000021 | ETA 00:06:34 2022-07-25 11:54:31,603 - INFO - [TRAIN] epoch=158/160, iter=18250/18560, loss=2.618774, lr=0.000019 | ETA 00:06:42 2022-07-25 11:54:44,012 - INFO - [TRAIN] epoch=158/160, iter=18260/18560, loss=2.508224, lr=0.000018 | ETA 00:06:09 2022-07-25 11:54:57,893 - INFO - [TRAIN] epoch=158/160, iter=18270/18560, loss=2.306517, lr=0.000017 | ETA 00:06:23 2022-07-25 11:55:10,081 - INFO - [TRAIN] epoch=158/160, iter=18280/18560, loss=2.405667, lr=0.000016 | ETA 00:05:40 2022-07-25 11:55:22,390 - INFO - [TRAIN] epoch=158/160, iter=18290/18560, loss=2.441481, lr=0.000015 | ETA 00:05:28 2022-07-25 11:55:36,107 - INFO - [TRAIN] epoch=158/160, iter=18300/18560, loss=2.190574, lr=0.000014 | ETA 00:06:49 2022-07-25 11:55:52,562 - INFO - [TRAIN] epoch=158/160, iter=18310/18560, loss=2.689454, lr=0.000013 | ETA 00:06:57 2022-07-25 11:56:05,866 - INFO - [TRAIN] epoch=158/160, iter=18320/18560, loss=2.497480, lr=0.000012 | ETA 00:05:01 2022-07-25 11:56:57,067 - INFO - [TRAIN] epoch=159/160, iter=18330/18560, loss=2.421674, lr=0.000011 | ETA 00:41:58 2022-07-25 11:57:09,559 - INFO - [TRAIN] epoch=159/160, iter=18340/18560, loss=2.484238, lr=0.000010 | ETA 00:04:29 2022-07-25 11:57:21,980 - INFO - [TRAIN] epoch=159/160, iter=18350/18560, loss=2.289275, lr=0.000009 | ETA 00:04:20 2022-07-25 11:57:34,343 - INFO - [TRAIN] epoch=159/160, iter=18360/18560, loss=2.338437, lr=0.000008 | ETA 00:04:06 2022-07-25 11:57:46,936 - INFO - [TRAIN] epoch=159/160, iter=18370/18560, loss=2.418895, lr=0.000007 | ETA 00:03:56 2022-07-25 11:57:59,332 - INFO - [TRAIN] epoch=159/160, iter=18380/18560, loss=2.265922, lr=0.000007 | ETA 00:03:39 2022-07-25 11:58:11,600 - INFO - [TRAIN] epoch=159/160, iter=18390/18560, loss=2.351922, lr=0.000006 | ETA 00:03:26 2022-07-25 11:58:24,052 - INFO - [TRAIN] epoch=159/160, iter=18400/18560, loss=2.470027, lr=0.000005 | ETA 00:03:18 2022-07-25 11:58:36,556 - INFO - [TRAIN] epoch=159/160, iter=18410/18560, loss=2.574253, lr=0.000005 | ETA 00:03:07 2022-07-25 11:58:49,025 - INFO - [TRAIN] epoch=159/160, iter=18420/18560, loss=2.567105, lr=0.000004 | ETA 00:02:56 2022-07-25 11:59:01,326 - INFO - [TRAIN] epoch=159/160, iter=18430/18560, loss=2.305873, lr=0.000004 | ETA 00:02:38 2022-07-25 11:59:13,886 - INFO - [TRAIN] epoch=159/160, iter=18440/18560, loss=2.230552, lr=0.000003 | ETA 00:02:30 2022-07-25 12:00:07,206 - INFO - [TRAIN] epoch=160/160, iter=18450/18560, loss=2.478170, lr=0.000003 | ETA 00:03:25 2022-07-25 12:00:19,576 - INFO - [TRAIN] epoch=160/160, iter=18460/18560, loss=2.473450, lr=0.000002 | ETA 00:02:05 2022-07-25 12:00:32,058 - INFO - [TRAIN] epoch=160/160, iter=18470/18560, loss=2.184308, lr=0.000002 | ETA 00:01:50 2022-07-25 12:00:44,644 - INFO - [TRAIN] epoch=160/160, iter=18480/18560, loss=2.271593, lr=0.000001 | ETA 00:01:39 2022-07-25 12:00:57,261 - INFO - [TRAIN] epoch=160/160, iter=18490/18560, loss=2.288844, lr=0.000001 | ETA 00:01:30 2022-07-25 12:01:09,676 - INFO - [TRAIN] epoch=160/160, iter=18500/18560, loss=2.307953, lr=0.000001 | ETA 00:01:14 2022-07-25 12:01:22,075 - INFO - [TRAIN] epoch=160/160, iter=18510/18560, loss=2.578577, lr=0.000001 | ETA 00:01:02 2022-07-25 12:01:34,506 - INFO - [TRAIN] epoch=160/160, iter=18520/18560, loss=2.595699, lr=0.000000 | ETA 00:00:50 LAUNCH INFO 2022-07-25 12:02:29,673 Pod completed LAUNCH INFO 2022-07-25 12:02:29,675 Exit code 0 2022-07-25 12:01:47,184 - INFO - [TRAIN] epoch=160/160, iter=18530/18560, loss=2.395863, lr=0.000000 | ETA 00:00:38 2022-07-25 12:01:59,892 - INFO - [TRAIN] epoch=160/160, iter=18540/18560, loss=2.435884, lr=0.000000 | ETA 00:00:25 2022-07-25 12:02:12,609 - INFO - [TRAIN] epoch=160/160, iter=18550/18560, loss=2.436638, lr=0.000000 | ETA 00:00:12 2022-07-25 12:02:25,259 - INFO - [TRAIN] epoch=160/160, iter=18560/18560, loss=2.522927, lr=0.000000 | ETA 00:00:00 2022-07-25 12:02:25,288 - INFO - Pop model from output_nospa/epoch_135 2022-07-25 12:02:25,750 - INFO - Push model to checkpoint output_nospa/epoch_160 2022-07-25 12:02:25,862 - INFO - Training is complete.