-
Notifications
You must be signed in to change notification settings - Fork 6
/
Copy path2020.06.16.txt
1710 lines (1402 loc) · 127 KB
/
2020.06.16.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
==========New Papers==========
1, TITLE: Coherent Reconstruction of Multiple Humans from a Single Image
http://arxiv.org/abs/2006.08586
AUTHORS: Wen Jiang ; Nikos Kolotouros ; Georgios Pavlakos ; Xiaowei Zhou ; Kostas Daniilidis
COMMENTS: CVPR 2020. Project Page: https://jiangwenpl.github.io/multiperson/
HIGHLIGHT: In this work, we address the problem of multi-person 3D pose estimation from a single image.
2, TITLE: Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder
http://arxiv.org/abs/2006.08101
AUTHORS: Daya Guo ; Duyu Tang ; Nan Duan ; Jian Yin ; Daxin Jiang ; Ming Zhou
COMMENTS: Accepted by ACL 2020
HIGHLIGHT: To address this, we propose an approach that automatically finds evidence for an event from a large text corpus, and leverages the evidence to guide the generation of inferential texts.
3, TITLE: Spectral DiffuserCam: lensless snapshot hyperspectral imaging with a spectral filter array
http://arxiv.org/abs/2006.08565
AUTHORS: Kristina Monakhova ; Kyrollos Yanny ; Neerja Aggarwal ; Laura Waller
HIGHLIGHT: In this paper, we propose a novel, compact, and inexpensive computational camera for snapshot hyperspectral imaging.
4, TITLE: Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction
http://arxiv.org/abs/2006.08558
AUTHORS: Yaodong Yu ; Kwan Ho Ryan Chan ; Chong You ; Chaobing Song ; Yi Ma
HIGHLIGHT: To learn intrinsic low-dimensional structures from high-dimensional data that most discriminate between classes, we propose the principle of Maximal Coding Rate Reduction ($\text{MCR}^2$), an information-theoretic measure that maximizes the coding rate difference between the whole dataset and the sum of each individual class.
5, TITLE: Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
http://arxiv.org/abs/2006.08555
AUTHORS: Stephen McAleer ; John Lanier ; Roy Fox ; Pierre Baldi
COMMENTS: SM and JL contributed equally
HIGHLIGHT: We introduce Pipeline PSRO (P2SRO), the first scalable general method for finding approximate Nash equilibria in large zero-sum imperfect-information games.
6, TITLE: Now that I can see, I can improve: Enabling data-driven finetuning of CNNs on the edge
http://arxiv.org/abs/2006.08554
AUTHORS: Aditya Rajagopal ; Christos-Savvas Bouganis
COMMENTS: Accepted for publication at CVPR2020 workshop - Efficient Deep Learning for Computer Vision
HIGHLIGHT: It explores the performance gains and costs of doing so and presents an extensible open-source framework that allows the deployment of such approaches on a wide range of network architectures and devices.
7, TITLE: Visibility Guided NMS: Efficient Boosting of Amodal Object Detection in Crowded Traffic Scenes
http://arxiv.org/abs/2006.08547
AUTHORS: Nils Gählert ; Niklas Hanselmann ; Uwe Franke ; Joachim Denzler
COMMENTS: Machine Learning for Autonomous Driving Workshop at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada
HIGHLIGHT: Modern 2D object detection frameworks such as Yolo, SSD or Faster R-CNN predict multiple bounding boxes per object that are refined using Non-Maximum-Suppression (NMS) to suppress all but one bounding box.
8, TITLE: Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya
http://arxiv.org/abs/2006.07698
AUTHORS: Abrhalei Tela ; Abraham Woubie ; Ville Hautamaki
HIGHLIGHT: In this work, we propose a cost-effective transfer learning method to adopt a strong source language model, trained from a large monolingual corpus to a low-resource language.
9, TITLE: Sensorless Freehand 3D Ultrasound Reconstruction via Deep Contextual Learning
http://arxiv.org/abs/2006.07694
AUTHORS: Hengtao Guo ; Sheng Xu ; Bradford Wood ; Pingkun Yan
COMMENTS: Provisionally accepted by MICCAI 2020
HIGHLIGHT: In this paper, we propose a deep contextual learning network (DCL-Net), which can efficiently exploit the image feature relationship between US frames and reconstruct 3D US volumes without any tracking device.
10, TITLE: Efficient Black-Box Adversarial Attack Guided by the Distribution of Adversarial Perturbations
http://arxiv.org/abs/2006.08538
AUTHORS: Yan Feng ; Baoyuan Wu ; Yanbo Fan ; Zhifeng Li ; Shutao Xia
HIGHLIGHT: In this work, we propose to transform the Gaussian-distributed variable to another space through a conditional flow-based model, to enhance the capability and flexibility of capturing the intrinsic distribution of adversarial perturbations conditioned on the benign example.
11, TITLE: Improved Conditional Flow Models for Molecule to Image Synthesis
http://arxiv.org/abs/2006.08532
AUTHORS: Karren Yang ; Samuel Goldman ; Wengong Jin ; Alex Lu ; Regina Barzilay ; Tommi Jaakkola ; Caroline Uhler
HIGHLIGHT: In this paper, we aim to synthesize cell microscopy images under different molecular interventions, motivated by practical applications to drug development. To evaluate our model, we propose a new set of metrics for biological image generation that are robust, interpretable, and relevant to practitioners.
12, TITLE: Go-CaRD -- Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications
http://arxiv.org/abs/2006.08521
AUTHORS: Lukas Stappen ; Xinchen Du ; Vincent Karas ; Stefan Müller ; Björn W. Schuller
COMMENTS: submitted to IEEE MMSP 2020
HIGHLIGHT: In this paper, we present three suitable datasets as well as quantitatively and qualitatively explore the efficacy of state-of-the-art deep learning architectures for the localisation of 29 interior and exterior vehicle regions, independent of brand, model, and environment.
13, TITLE: Pure Pattern Calculus à la de Bruijn
http://arxiv.org/abs/2006.07674
AUTHORS: Alexis Martín ; Alejandro Ríos ; Andrés Viso
HIGHLIGHT: This paper extends de Bruijn's ideas to properly overcome the multi-binding problem by introducing a novel presentation of PPC with bidimensional indices, in an effort to implement a prototype for a typed functional programming language based on PPC that captures path polymorphism.
14, TITLE: The Limit of the Batch Size
http://arxiv.org/abs/2006.08517
AUTHORS: Yang You ; Yuhui Wang ; Huan Zhang ; Zhao Zhang ; James Demmel ; Cho-Jui Hsieh
HIGHLIGHT: In this paper, we focus on studying the limit of the batch size.
15, TITLE: AutoGAN-Distiller: Searching to Compress Generative Adversarial Networks
http://arxiv.org/abs/2006.08198
AUTHORS: Yonggan Fu ; Wuyang Chen ; Haotao Wang ; Haoran Li ; Yingyan Lin ; Zhangyang Wang
COMMENTS: Accepted by ICML2020
HIGHLIGHT: Inspired by the recent success of AutoML in deep compression, we introduce AutoML to GAN compression and develop an AutoGAN-Distiller (AGD) framework.
16, TITLE: Extracting N-ary Cross-sentence Relations using Constrained Subsequence Kernel
http://arxiv.org/abs/2006.08185
AUTHORS: Sachin Pawar ; Pushpak Bhattacharyya ; Girish K. Palshikar
COMMENTS: Appeared in 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2019), https://www.cicling.org/2019/
HIGHLIGHT: We propose a new formulation of the relation extraction task where the relations are more general than intra-sentence relations in the sense that they may span multiple sentences and may have more than two arguments.
17, TITLE: Infinite Feature Selection: A Graph-based Feature Filtering Approach
http://arxiv.org/abs/2006.08184
AUTHORS: Giorgio Roffo ; Simone Melzi ; Umberto Castellani ; Alessandro Vinciarelli ; Marco Cristani
COMMENTS: TPAMI PREPRINT 2020
HIGHLIGHT: We propose a filtering feature selection framework that considers subsets of features as paths in a graph, where a node is a feature and an edge indicates pairwise (customizable) relations among features, dealing with relevance and redundancy principles.
18, TITLE: Derivative-free global minimization for a class of multiple minima problems
http://arxiv.org/abs/2006.08181
AUTHORS: Xiaopeng Luo ; Xin Xu ; Daoyi Dong
COMMENTS: 14 pages, 3 figures
HIGHLIGHT: We prove that the finite-difference based derivative-free descent (FD-DFD) methods have a capability to find the global minima for a class of multiple minima problems.
19, TITLE: Neural gradients are lognormally distributed: understanding sparse and quantized training
http://arxiv.org/abs/2006.08173
AUTHORS: Brian Chmiel ; Liad Ben-Uri ; Moran Shkolnik ; Elad Hoffer ; Ron Banner ; Daniel Soudry
HIGHLIGHT: Taking this into account, we suggest two methods to reduce the computational and memory burdens of neural gradients.
20, TITLE: Learn to Effectively Explore in Context-Based Meta-RL
http://arxiv.org/abs/2006.08170
AUTHORS: Jin Zhang ; Jianhao Wang ; Hao Hu ; Yingfeng Chen ; Changjie Fan ; Chongjie Zhang
COMMENTS: Committed to NeurIPS 2020
HIGHLIGHT: In this paper, we present a novel off-policy context-based meta-RL approach that efficiently learns a separate exploration policy to support fast adaptation, as well as a context-aware exploitation policy to maximize extrinsic return.
21, TITLE: Binary DAD-Net: Binarized Driveable Area Detection Network for Autonomous Driving
http://arxiv.org/abs/2006.08178
AUTHORS: Alexander Frickenstein ; Manoj Rohit Vemparala ; Jakob Mayr ; Naveen Shankar Nagaraja ; Christian Unger ; Federico Tombari ; Walter Stechele
COMMENTS: IEEE International Conference on Robotics and Automation (ICRA) 2020
HIGHLIGHT: This paper proposes a novel binarized driveable area detection network (binary DAD-Net), which uses only binary weights and activations in the encoder, the bottleneck, and the decoder part.
22, TITLE: Dissimilarity Mixture Autoencoder for Deep Clustering
http://arxiv.org/abs/2006.08177
AUTHORS: Juan S. Lara ; Fabio A. González
COMMENTS: 8 pages (5 additional pages for broader impact, references and supplementary material)
HIGHLIGHT: In this paper, we introduce the Dissimilarity Mixture Autoencoder (DMAE), a novel neural network model that uses a dissimilarity function to generalize a family of density estimation and clustering methods.
23, TITLE: Public Willingness to Get Vaccinated Against COVID-19: How AI-Developed Vaccines Can Affect Acceptance
http://arxiv.org/abs/2006.08164
AUTHORS: Gabriel Lima ; Hyeyoung Hwang ; Chiyoung Cha ; Meeyoung Cha
HIGHLIGHT: We used a between-subjects study design (N=572 adults in the UK and UK) to understand the public willingness towards vaccination against the novel coronavirus under various circumstances.
24, TITLE: Filter design for small target detection on infrared imagery using normalized-cross-correlation layer
http://arxiv.org/abs/2006.08162
AUTHORS: H. Seçkin Demir ; Erdem Akagunduz
HIGHLIGHT: In this paper, we introduce a machine learning approach to the problem of infrared small target detection filter design.
25, TITLE: Intelligent Decision Support System for Updating Control Plans
http://arxiv.org/abs/2006.08153
AUTHORS: Fadwa Oukhay ; Pascale Zaraté ; Taieb Romdhane
HIGHLIGHT: This paper proposes an intelligent DSS for quality control planning.
26, TITLE: Group Decision Support for agriculture planning by a combination of Mathematical Model and Collaborative Tool
http://arxiv.org/abs/2006.08151
AUTHORS: Pascale Zaraté ; Alemany Mme ; Ana Esteso Alvarez ; Amir Sakka ; Guy Camilleri
HIGHLIGHT: In the current work the same experiment has been conducted with real business (farmers) in order to benefit from their expertise.
27, TITLE: A VIKOR and TOPSIS focused reanalysis of the MADM methods based on logarithmic normalization
http://arxiv.org/abs/2006.08150
AUTHORS: Sarfaraz Zolfani ; Morteza Yazdani ; Dragan Pamucar ; Pascale Zaraté
HIGHLIGHT: This normalization method had never been applied in any MADM methods before.
28, TITLE: Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry and Fusion
http://arxiv.org/abs/2006.08159
AUTHORS: Yang Wang
COMMENTS: Appearing at ACM TOMM, 26 pages
HIGHLIGHT: In this paper, we provide a substantial overview of the existing state-of-the-arts on the filed of multi-modal data analytics from shallow to deep spaces.
29, TITLE: Solution Subset Selection for Final Decision Making in Evolutionary Multi-Objective Optimization
http://arxiv.org/abs/2006.08156
AUTHORS: Hisao Ishibuchi ; Lie Meng Pang ; Ke Shang
HIGHLIGHT: In this paper, we discuss subset selection from a viewpoint of the final decision making.
30, TITLE: Selection of an Integrated Security Area for locating a State Military Organization (SMO) based on group decision system: a multicriteria approach
http://arxiv.org/abs/2006.08155
AUTHORS: Jean Gomes Turet ; Ana Paula Cabral Seixtas Cabral ; Pascale Zaraté
HIGHLIGHT: Thus, this paper aims to identify the best ISA to deploy a police battalion using group decision techniques and tools.
31, TITLE: Anomalous Motion Detection on Highway Using Deep Learning
http://arxiv.org/abs/2006.08143
AUTHORS: Harpreet Singh ; Emily M. Hand ; Kostas Alexis
COMMENTS: to be published in IEEE ICIP 2020
HIGHLIGHT: We evaluate state-of-the-art deep learning anomaly detection models and propose novel variations to these methods. This paper presents a new anomaly detection dataset - the Highway Traffic Anomaly (HTA) dataset - for the problem of detecting anomalous traffic patterns from dash cam videos of vehicles on highways.
32, TITLE: The Social Contract for AI
http://arxiv.org/abs/2006.08140
AUTHORS: Mirka Snyder Caron ; Abhishek Gupta
COMMENTS: Accepted paper for presentation at the IJCAI 2019 AI for Social Good workshop
HIGHLIGHT: For the purpose of this paper, we consider that a social contract arises when there is sufficient consensus within society to adopt and implement this new technology.
33, TITLE: Classifying degraded images over various levels of degradation
http://arxiv.org/abs/2006.08145
AUTHORS: Kazuki Endo ; Masayuki Tanaka ; Masatoshi Okutomi
COMMENTS: Accepted by the 27th IEEE International Conference on Image Processing (ICIP 2020)
HIGHLIGHT: This paper proposes a convolutional neural network to classify degraded images by using a restoration network and an ensemble learning.
34, TITLE: Emotion Recognition in Audio and Video Using Deep Neural Networks
http://arxiv.org/abs/2006.08129
AUTHORS: Mandeep Singh ; Yuan Fang
COMMENTS: 9 pages, 9 figures, 3 tables
HIGHLIGHT: In this work, we attempt to explore different neural networks to improve accuracy of emotion recognition.
35, TITLE: FinBERT: A Pretrained Language Model for Financial Communications
http://arxiv.org/abs/2006.08097
AUTHORS: Yi Yang ; Mark Christopher Siy UY ; Allen Huang
COMMENTS: https://github.com/yya518/FinBERT
HIGHLIGHT: In this work,we address the need by pretraining a financial domain specific BERT models, FinBERT, using a large scale of financial communication corpora.
36, TITLE: Neural Execution Engines: Learning to Execute Subroutines
http://arxiv.org/abs/2006.08084
AUTHORS: Yujun Yan ; Kevin Swersky ; Danai Koutra ; Parthasarathy Ranganathan ; Milad Heshemi
HIGHLIGHT: To address the issue, we propose a learned conditional masking mechanism, which enables the model to strongly generalize far outside of its training range with near-perfect accuracy on a variety of algorithms.
37, TITLE: Generalized Adversarially Learned Inference
http://arxiv.org/abs/2006.08089
AUTHORS: Yatin Dandi ; Homanga Bharadhwaj ; Abhishek Kumar ; Piyush Rai
COMMENTS: Under review
HIGHLIGHT: Within our proposed framework, we introduce a novel set of techniques for providing self-supervised feedback to the model based on properties, such as patch-level correspondence and cycle consistency of reconstructions.
38, TITLE: Words ranking and Hirsch index for identifying the core of the hapaxes in political texts
http://arxiv.org/abs/2006.07667
AUTHORS: Valerio Ficcadenti ; Roy Cerqueti ; Marcel Ausloos ; Gurjeet Dhesi
HIGHLIGHT: This paper deals with a quantitative analysis of the content of official political speeches.
39, TITLE: Uncertainty-aware Score Distribution Learning for Action Quality Assessment
http://arxiv.org/abs/2006.07665
AUTHORS: Yansong Tang ; Zanlin Ni ; Jiahuan Zhou ; Danyang Zhang ; Jiwen Lu ; Ying Wu ; Jie Zhou
COMMENTS: CVPR2020
HIGHLIGHT: To address this issue, we propose an uncertainty-aware score distribution learning (USDL) approach for action quality assessment (AQA).
40, TITLE: Convolutional Generation of Textured 3D Meshes
http://arxiv.org/abs/2006.07660
AUTHORS: Dario Pavllo ; Graham Spinks ; Thomas Hofmann ; Marie-Francine Moens ; Aurelien Lucchi
HIGHLIGHT: A key contribution of our work is the encoding of the mesh and texture as 2D representations, which are semantically aligned and can be easily modeled by a 2D convolutional GAN.
41, TITLE: APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
http://arxiv.org/abs/2006.08509
AUTHORS: Tianzhe Wang ; Kuan Wang ; Han Cai ; Ji Lin ; Zhijian Liu ; Song Han
COMMENTS: Accepted by CVPR 2020
HIGHLIGHT: We present APQ for efficient deep learning inference on resource-constrained hardware.
42, TITLE: Regularized Forward-Backward Decoder for Attention Models
http://arxiv.org/abs/2006.08506
AUTHORS: Tobias Watzel ; Ludwig Kürzinger ; Lujun Li ; Gerhard Rigoll
COMMENTS: Under review for Interspeech 2020
HIGHLIGHT: In this paper, we propose a novel regularization technique incorporating a second decoder during the training phase.
43, TITLE: QD-RL: Efficient Mixing of Quality and Diversity in Reinforcement Learning
http://arxiv.org/abs/2006.08505
AUTHORS: Geoffrey Cideron ; Thomas Pierrot ; Nicolas Perrin ; Karim Beguir ; Olivier Sigaud
HIGHLIGHT: We propose a novel reinforcement learning algorithm,QD-RL, that incorporates the strengths of off-policy RL algorithms into Quality Diversity (QD) approaches.
44, TITLE: Quota-based debiasing can decrease representation of already underrepresented groups
http://arxiv.org/abs/2006.07647
AUTHORS: Ivan Smirnov ; Florian Lemmerich ; Markus Strohmaier
HIGHLIGHT: For the case of two correlated binary attributes we show that quota-based debiasing based on a single attribute can worsen the representation of already underrepresented groups and decrease overall fairness of selection.
45, TITLE: RoadNet-RT: High Throughput CNN Architecture and SoC Design for Real-Time Road Segmentation
http://arxiv.org/abs/2006.07644
AUTHORS: Lin Bai ; Yecheng Lyu ; Xinming Huang
COMMENTS: has been submitted for peer review
HIGHLIGHT: In order to reach real-time process speed, a light-weight, high-throughput CNN architecture namely RoadNet-RT is proposed for road segmentation in this paper.
46, TITLE: SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement
http://arxiv.org/abs/2006.07637
AUTHORS: Luka Chkhetiani ; Levan Bejanidze
COMMENTS: 4 pages, 1 image, 1 table, 1 page for references
HIGHLIGHT: We propose that it is possible to transfer the MelGAN's [3] robustness in learning speech features to speech enhancement and noise reduction domain without any model modification tasks.
47, TITLE: DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms
http://arxiv.org/abs/2006.07634
AUTHORS: Hua Qi ; Qing Guo ; Felix Juefei-Xu ; Xiaofei Xie ; Lei Ma ; Wei Feng ; Yang Liu ; Jianjun Zhao
COMMENTS: 11 pages, 7 figures
HIGHLIGHT: In this work, we show that our conjecture holds true and the proposed method indeed can very effectively exposeDeepFakes by monitoring the heartbeat rhythms, which is termedasDeepRhythm.
48, TITLE: Equivariant Neural Rendering
http://arxiv.org/abs/2006.07630
AUTHORS: Emilien Dupont ; Miguel Angel Bautista ; Alex Colburn ; Aditya Sankar ; Carlos Guestrin ; Josh Susskind ; Qi Shan
COMMENTS: ICML 2020 camera ready
HIGHLIGHT: We propose a framework for learning neural scene representations directly from images, without 3D supervision. In addition, we introduce two challenging new datasets for scene representation and neural rendering, including scenes with complex lighting and backgrounds.
49, TITLE: Special-case Algorithms for Blackbox Radical Membership, Nullstellensatz and Transcendence Degree
http://arxiv.org/abs/2006.07613
AUTHORS: Abhibhav Garg ; Nitin Saxena
HIGHLIGHT: We identify a useful case of these problems where practical algorithms, and improved bounds, could be given, when the transcendence degree $r$ of the input polynomials is smaller than the number of variables $n$.
50, TITLE: DTG-Net: Differentiated Teachers Guided Self-Supervised Video Action Recognition
http://arxiv.org/abs/2006.07609
AUTHORS: Ziming Liu ; Guangyu Gao ; A. K. Qin ; Jinyang Li
HIGHLIGHT: To reduce such dependency, we propose a self-supervised teacher-student architecture, i.e., the Differentiated Teachers Guided self-supervised Network (DTG-Net).
51, TITLE: HRDNet: High-resolution Detection Network for Small Objects
http://arxiv.org/abs/2006.07607
AUTHORS: Ziming Liu ; Guangyu Gao ; Lin Sun ; Zhiyuan Fang
HIGHLIGHT: To keep the benefits of high-resolution images without bringing up new problems, we proposed the High-Resolution Detection Network (HRDNet).
52, TITLE: Faces à la Carte: Text-to-Face Generation via Attribute Disentanglement
http://arxiv.org/abs/2006.07606
AUTHORS: Tianren Wang ; Teng Zhang ; Brian Lovell
COMMENTS: 8 pages, 4 figures
HIGHLIGHT: In this paper, we propose a Text-to-Face model that not only produces images in high resolution (1024x1024) with text-to-image consistency, but also outputs multiple diverse faces to cover a wide range of unspecified facial features in a natural way.
53, TITLE: Dynamic gesture retrieval: searching videos by human pose sequence
http://arxiv.org/abs/2006.07604
AUTHORS: Cheng Zhang
HIGHLIGHT: We propose a novel method for querying videos containing a designated sequence of human poses, whereas previous works only designate a single static pose.
54, TITLE: NoPeopleAllowed: The Three-Step Approach to Weakly Supervised Semantic Segmentation
http://arxiv.org/abs/2006.07601
AUTHORS: Mariia Dobko ; Ostap Viniavskyi ; Oles Dobosevych
COMMENTS: This short-paper was submitted to Learning from Imperfect Data workshop at CVPR 2020
HIGHLIGHT: We propose a novel approach to weakly supervised semantic segmentation, which consists of three consecutive steps.
55, TITLE: Improved algorithm for permutation testing
http://arxiv.org/abs/2006.08473
AUTHORS: Xiaojin Zhang
HIGHLIGHT: In this paper, we provide a simple adaptive algorithm with one-sided error for testing monotone permutation.
56, TITLE: Towards Incorporating Contextual Knowledge into the Prediction of Driving Behavior
http://arxiv.org/abs/2006.08470
AUTHORS: Florian Wirthmüller ; Julian Schlechtriemen ; Jochen Hipp ; Manfred Reichert
COMMENTS: the article has been accepted for publication during the 23rd IEEE Intelligent Transportation Systems Conference (ITSC), 7 pages, 6 figures, 1 table
HIGHLIGHT: Extending former studies, we investigate how predictions are affected by external conditions.
57, TITLE: Parametrized Fixed Points on O-Categories and Applications to Session Types
http://arxiv.org/abs/2006.08479
AUTHORS: Ryan Kavanagh
COMMENTS: Accepted at the 36th International Conference on Mathematical Foundations of Programming Semantics --- MFPS 2020
HIGHLIGHT: We generalize existing techniques to define a functorial dagger operation on locally continuous functors between O-categories.
58, TITLE: Comparing Alternative Route Planning Techniques: A Web-based Demonstration and User Study
http://arxiv.org/abs/2006.08475
AUTHORS: Lingxiao Li ; Muhammad Aamir Cheema ; Hua Lu ; Mohammed Eunus Ali ; Adel N. Toosi
HIGHLIGHT: Motivated by this, in this paper, we present the first user study that compares the quality of the alternative routes generated by four of the most popular existing approaches including the routes provided by Google Maps.
59, TITLE: Algorithmically Optimal Outer Measures
http://arxiv.org/abs/2006.08468
AUTHORS: Jack H. Lutz ; Neil Lutz
HIGHLIGHT: We introduce global and local optimality conditions for lower semicomputable outer measures.
60, TITLE: Oblivious and Semi-Oblivious Boundedness for Existential Rules
http://arxiv.org/abs/2006.08467
AUTHORS: Pierre Bourhis ; Michel Leclère ; Marie-Laure Mugnier ; Sophie Tison ; Federico Ulliana ; Lily Galois
HIGHLIGHT: We study the notion of boundedness in the context of positive existential rules, that is, whether there exists an upper bound to the depth of the chase procedure, that is independent from the initial instance.
61, TITLE: 3D-ZeF: A 3D Zebrafish Tracking Benchmark Dataset
http://arxiv.org/abs/2006.08466
AUTHORS: Malte Pedersen ; Joakim Bruslund Haurum ; Stefan Hein Bengtson ; Thomas B. Moeslund
COMMENTS: CVPR 2020. Project webpage: https://vap.aau.dk/3d-zef/
HIGHLIGHT: In this work we present a novel publicly available stereo based 3D RGB dataset for multi-object zebrafish tracking, called 3D-ZeF.
62, TITLE: Evolution of Group-Theoretic Cryptology Attacks using Hyper-heuristics
http://arxiv.org/abs/2006.08458
AUTHORS: Matthew J. Craven ; John R. Woodward
COMMENTS: 16 pages
HIGHLIGHT: The contribution of this paper is thus a framework to automatically build algorithms to attack cryptology problems.
63, TITLE: Interaction Networks: Using a Reinforcement Learner to train other Machine Learning algorithms
http://arxiv.org/abs/2006.08457
AUTHORS: Florian Dietz
HIGHLIGHT: In this paper, thought experiments are used to explore how the additional abilities of Interaction Networks could be used to improve various existing types of neural networks.
64, TITLE: Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search
http://arxiv.org/abs/2006.07593
AUTHORS: Vu Nguyen ; Tam Le ; Makoto Yamada ; Michael A Osborne
COMMENTS: 21 pages
HIGHLIGHT: Building upon tree-Wasserstein (TW), which is a negative definite variant of OT, we develop a novel discrepancy for neural architectures, and demonstrate it within a Gaussian process surrogate model for the sequential NAS settings.
65, TITLE: Attribute-aware Identity-hard Triplet Loss for Video-based Person Re-identification
http://arxiv.org/abs/2006.07597
AUTHORS: Zhiyuan Chen ; Annan Li ; Shilu Jiang ; Yunhong Wang
HIGHLIGHT: In this paper, we address this issue by introducing a new metric learning method called Attribute-aware Identity-hard Triplet Loss (AITL), which reduces the intra-class variation among positive samples via calculating attribute distance.
66, TITLE: Mining Implicit Relevance Feedback from User Behavior forWeb Question Answering
http://arxiv.org/abs/2006.07581
AUTHORS: Linjun Shou ; Shining Bo ; Feixiang Cheng ; Ming Gong ; Jian Pei ; Daxin Jiang
COMMENTS: Accepted by KDD 2020
HIGHLIGHT: In this paper, we make the first study to explore the correlation between user behavior and passage relevance, and propose a novel approach for mining training data for Web QA.
67, TITLE: Adversarial Self-Supervised Contrastive Learning
http://arxiv.org/abs/2006.07589
AUTHORS: Minseon Kim ; Jihoon Tack ; Sung Ju Hwang
COMMENTS: Under review
HIGHLIGHT: In this paper, we propose a novel adversarial attack for unlabeled data, which makes the model confuse the instance-level identities of the perturbed data samples.
68, TITLE: Semantic-driven Colorization
http://arxiv.org/abs/2006.07587
AUTHORS: Man M. Ho ; Lu Zhang ; TU Ilmenau ; Jinjia Zhou
COMMENTS: This work is available at https://minhmanho.github.io/semantic-driven_colorization/
HIGHLIGHT: In this study, we simulate that human-like action to firstly let our network learn to segment what is in the photo, then colorize it.
69, TITLE: SD-RSIC: Summarization Driven Deep Remote Sensing Image Captioning
http://arxiv.org/abs/2006.08432
AUTHORS: Gencer Sumbul ; Sonali Nayak ; Begüm Demir
HIGHLIGHT: To overcome this limitation, in this paper we present a novel Summarization Driven Remote Sensing Image Captioning (SD-RSIC) approach.
70, TITLE: Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation
http://arxiv.org/abs/2006.07585
AUTHORS: Tao He ; Lianli Gao ; Jingkuan Song ; Jianfei Cai ; Yuan-Fang Li
HIGHLIGHT: In this paper, we tackle this issue from another two aspects: (1) scene-object interaction aiming at learning specific knowledge from a scene via an additive attention mechanism; and (2) long-tail knowledge transfer which tries to transfer the rich knowledge learned from the head into the tail.
71, TITLE: Mitigating Face Recognition Bias via Group Adaptive Classifier
http://arxiv.org/abs/2006.07576
AUTHORS: Sixue Gong ; Xiaoming Liu ; Anil K. Jain
HIGHLIGHT: This work aims to learn a fair face representation, where faces of every group could be equally well-represented.
72, TITLE: GIPFA: Generating IPA Pronunciation from Audio
http://arxiv.org/abs/2006.07573
AUTHORS: Xavier Marjou
COMMENTS: 8 pages, 2 figures, 7 tables
HIGHLIGHT: In this study, we instead examined the use of an Artificial Neural Network (ANN) model to automatically extract the IPA pronunciation of a word based on its audio pronunciation, hence its name Generating IPA Pronunciation From Audio (GIPFA).
73, TITLE: Accurate Anchor Free Tracking
http://arxiv.org/abs/2006.07560
AUTHORS: Shengyun Peng ; Yunxuan Yu ; Kun Wang ; Lei He
COMMENTS: 10 pages, 11 figures
HIGHLIGHT: This paper develops the first Anchor Free Siamese Network (AFSN).
74, TITLE: Unbiased Auxiliary Classifier GANs with MINE
http://arxiv.org/abs/2006.07567
AUTHORS: Ligong Han ; Anastasis Stathopoulos ; Tao Xue ; Dimitris Metaxas
COMMENTS: Accepted at CVPRW-20
HIGHLIGHT: To this end, we propose an Unbiased Auxiliary GANs (UAC-GAN) that utilizes the Mutual Information Neural Estimator (MINE) to estimate the mutual information between the generated data distribution and labels.
75, TITLE: Machine Common Sense
http://arxiv.org/abs/2006.08409
AUTHORS: Alexander Gavrilenko ; Katerina Morozova
HIGHLIGHT: This article deals with the aspects of modeling commonsense reasoning focusing on such domain as interpersonal interactions.
76, TITLE: Ethical Considerations for AI Researchers
http://arxiv.org/abs/2006.07558
AUTHORS: Kyle Dent
HIGHLIGHT: Ethical Considerations for AI Researchers
77, TITLE: Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
http://arxiv.org/abs/2006.07554
AUTHORS: Yunhao Tang ; Krzysztof Choromanski
HIGHLIGHT: In this work, we propose a framework which entails the application of Evolutionary Strategies to online hyper-parameter tuning in off-policy learning.
78, TITLE: Sparse Separable Nonnegative Matrix Factorization
http://arxiv.org/abs/2006.07553
AUTHORS: Nicolas Nadisic ; Arnaud Vandaele ; Jeremy E. Cohen ; Nicolas Gillis
COMMENTS: 20 pages, accepted in ECML 2020
HIGHLIGHT: We introduce an algorithm to solve SSNMF, based on the successive nonnegative projection algorithm (SNPA, an effective algorithm for separable NMF), and an exact sparse nonnegative least squares solver.
79, TITLE: Guided Transformer: Leveraging Multiple External Sources for Representation Learning in Conversational Search
http://arxiv.org/abs/2006.07548
AUTHORS: Helia Hashemi ; Hamed Zamani ; W. Bruce Croft
COMMENTS: To appear in the Proceedings of ACM SIGIR 2020. 10 pages
HIGHLIGHT: In this paper, we enrich the representations learned by Transformer networks using a novel attention mechanism from external information sources that weights each term in the conversation.
80, TITLE: Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction
http://arxiv.org/abs/2006.08072
AUTHORS: Tong He ; John Collomosse ; Hailin Jin ; Stefano Soatto
HIGHLIGHT: We propose Geo-PIFu, a method to recover a 3D mesh from a monocular color image of a clothed person.
81, TITLE: Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution
http://arxiv.org/abs/2006.08070
AUTHORS: Xianhang Cheng ; Zhenzhong Chen
HIGHLIGHT: In this paper, we try to solve these problems and propose a novel approach that we refer to as enhanced deformable separable convolution (EDSC) to estimate not only adaptive kernels, but also offsets, masks and biases to make the network obtain information from non-local neighborhood.
82, TITLE: Multi-Purchase Behavior: Modeling and Optimization
http://arxiv.org/abs/2006.08055
AUTHORS: Theja Tulabandhula ; Deeksha Sinha ; Prasoon Patidar
COMMENTS: 40 pages
HIGHLIGHT: We present a parsimonious multi-purchase family of choice models called the BundleMVL-K family, and develop a binary search based iterative strategy that efficiently computes optimized recommendations for this model.
83, TITLE: RasterNet: Modeling Free-Flow Speed using LiDAR and Overhead Imagery
http://arxiv.org/abs/2006.08021
AUTHORS: Armin Hadzic ; Hunter Blanton ; Weilian Song ; Mei Chen ; Scott Workman ; Nathan Jacobs
HIGHLIGHT: We propose a fully automated approach, RasterNet, for estimating free-flow speed without the need for explicit geometric features. To support training and evaluation, we introduce a novel dataset combining free-flow speeds of road segments, overhead imagery, and LiDAR point clouds across the state of Kentucky.
84, TITLE: Symbolic Logic meets Machine Learning: A Brief Survey in Infinite Domains
http://arxiv.org/abs/2006.08480
AUTHORS: Vaishak Belle
HIGHLIGHT: In this article, we survey work that provides further evidence for the connections between logic and learning.
85, TITLE: CompressNet: Generative Compression at Extremely Low Bitrates
http://arxiv.org/abs/2006.08003
AUTHORS: Suraj Kiran Raman ; Aditya Ramesh ; Vijayakrishna Naganoor ; Shubham Dash ; Giridharan Kumaravelu ; Honglak Lee
HIGHLIGHT: To address this need, we propose a novel network called CompressNet which augments a Stacked Autoencoder with a Switch Prediction Network (SAE-SPN).
86, TITLE: UWSpeech: Speech to Speech Translation for Unwritten Languages
http://arxiv.org/abs/2006.07926
AUTHORS: Chen Zhang ; Xu Tan ; Yi Ren ; Tao Qin ; Kejun Zhang ; Tie-Yan Liu
HIGHLIGHT: We propose a method called XL-VAE, which enhances vector quantized variational autoencoder (VQ-VAE) with cross-lingual (XL) speech recognition, to train the converter and inverter of UWSpeech jointly.
87, TITLE: Categorical anomaly detection in heterogeneous data using minimum description length clustering
http://arxiv.org/abs/2006.07916
AUTHORS: James Cheney ; Xavier Gombau ; Ghita Berrada ; Sidahmed Benabderrahmane
HIGHLIGHT: Fast and effective unsupervised anomaly detection algorithms have been proposed for categorical data based on the minimum description length (MDL) principle.
88, TITLE: Fair Influence Maximization: A Welfare Optimization Approach
http://arxiv.org/abs/2006.07906
AUTHORS: Aida Rahmattalabi ; Shahin Jabbari ; Himabindu Lakkaraju ; Phebe Vayanos ; Eric Rice ; Milind Tambe
HIGHLIGHT: Algorithmic influence maximization techniques have been proposed to aid with the choice of influencers (or peer leaders) in such interventions.
89, TITLE: Leveraging Multimodal Behavioral Analytics for Automated Job Interview Performance Assessment and Feedback
http://arxiv.org/abs/2006.07909
AUTHORS: Anumeha Agrawal ; Rosa Anil George ; Selvan Sunitha Ravi ; Sowmya Kamath S ; Anand Kumar M
COMMENTS: 9 pages, ACL 2020
HIGHLIGHT: We propose a multimodal analytical framework that analyzes the candidate in an interview scenario and provides feedback for predefined labels such as engagement, speaking rate, eye contact, etc.
90, TITLE: GAN Memory with No Forgetting
http://arxiv.org/abs/2006.07543
AUTHORS: Yulai Cong ; Miaoyun Zhao ; Jianqiao Li ; Sijia Wang ; Lawrence Carin
HIGHLIGHT: Seeking to address the fundamental issue of memory in lifelong learning, we propose a GAN memory that is capable of realistically remembering a stream of generative processes with \emph{no} forgetting.
91, TITLE: FakePolisher: Making DeepFakes More Detection-Evasive by Shallow Reconstruction
http://arxiv.org/abs/2006.07533
AUTHORS: Yihao Huang ; Felix Juefei-Xu ; Run Wang ; Qing Guo ; Lei Ma ; Xiaofei Xie ; Jianwen Li ; Weikai Miao ; Yang Liu ; Geguang Pu
COMMENTS: 10 pages
HIGHLIGHT: In this paper, we devise a simple yet powerful approach termed FakePolisher that performs shallow reconstruction of fake images through learned linear dictionary, intending to effectively and efficiently reduce the artifacts introduced during image synthesis.
92, TITLE: Online Bayesian Goal Inference for Boundedly-Rational Planning Agents
http://arxiv.org/abs/2006.07532
AUTHORS: Tan Zhi-Xuan ; Jordyn L. Mann ; Tom Silver ; Joshua B. Tenenbaum ; Vikash K. Mansinghka
COMMENTS: 11 pages, 5 figures (supplement: 8 pages, 8 figures)
HIGHLIGHT: Here we present an architecture capable of inferring an agent's goals online from both optimal and non-optimal sequences of actions.
93, TITLE: Rethinking the Value of Labels for Improving Class-Imbalanced Learning
http://arxiv.org/abs/2006.07529
AUTHORS: Yuzhe Yang ; Zhi Xu
HIGHLIGHT: In this work, we systematically investigate these two facets of labels.
94, TITLE: CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)
http://arxiv.org/abs/2006.07526
AUTHORS: Xiang Wang ; Baiteng Ma ; Zhiwu Qing ; Yongpeng Sang ; Changxin Gao ; Shiwei Zhang ; Nong Sang
COMMENTS: ActivityNet Challenge 2020 Temporal Action Localization (Task 1) Champion Solution (Rank 1)
HIGHLIGHT: In this report, we present our solution for the task of temporal action localization (detection) (task 1) in ActivityNet Challenge 2020.
95, TITLE: Self-Supervised Discovery of Anatomical Shape Landmarks
http://arxiv.org/abs/2006.07525
AUTHORS: Riddhish Bhalodia ; Ladislav Kavan ; Ross Whitaker
COMMENTS: Early accept at MICCAI 2020
HIGHLIGHT: Using this assumption we propose a self-supervised, neural network approach for automatically positioning and detecting landmarks in images that can be used for subsequent analysis.
96, TITLE: Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)
http://arxiv.org/abs/2006.07520
AUTHORS: Zhiwu Qing ; Xiang Wang ; Yongpeng Sang ; Changxin Gao ; Shiwei Zhang ; Nong Sang
COMMENTS: To appear on CVPR 2020 HACS Workshop (Rank 1st)
HIGHLIGHT: This technical report analyzes a temporal action localization method we used in the HACS competition which is hosted in Activitynet Challenge 2020.
97, TITLE: A Multifunction Printer CUI for the Blind
http://arxiv.org/abs/2006.07519
AUTHORS: Kyle Dent ; Kalai Ramea
HIGHLIGHT: This paper presents a case study of our work to develop a conversational user interface for accessibility for multifunction printers (MFP).
98, TITLE: BatVision with GCC-PHAT Features for Better Sound to Vision Predictions
http://arxiv.org/abs/2006.07995
AUTHORS: Jesper Haahr Christensen ; Sascha Hornauer ; Stella Yu
HIGHLIGHT: We compare and present both quantitative and qualitative improvements over our previous BatVision model.
99, TITLE: Do Dogs have Whiskers? A New Knowledge Base of hasPart Relations
http://arxiv.org/abs/2006.07510
AUTHORS: Sumithra Bhakthavatsalam ; Kyle Richardson ; Niket Tandon ; Peter Clark
HIGHLIGHT: We present a new knowledge-base of hasPart relationships, extracted from a large corpus of generic statements.
100, TITLE: Road Mapping in Low Data Environments with OpenStreetMap
http://arxiv.org/abs/2006.07993
AUTHORS: John Kamalu ; Benjamin Choi
HIGHLIGHT: This work investigates the viability of high resolution satellite imagery and crowd-sourced resources like OpenStreetMap in the construction of such a mapping.
101, TITLE: Emergent Properties of Foveated Perceptual Systems
http://arxiv.org/abs/2006.07991
AUTHORS: Arturo Deza ; Talia Konkle
COMMENTS: A pre-print. Currently under review at the Conference on Neural Information Processing Systems (NeurIPS 2020). Themes: Foveation, Perception & Representational Learning
HIGHLIGHT: We introduce foveated perceptual systems, inspired by human biological systems, and examine the impact that this foveation stage has on the nature and robustness of subsequently learned visual representation.
102, TITLE: Realistic Physics Based Character Controller
http://arxiv.org/abs/2006.07508
AUTHORS: Joe Booth ; Vladimir Ivanov
COMMENTS: 5 pages
HIGHLIGHT: The paper aims at closing the gap between the researchers and users by introducing an open source implementation of physics based character control in Unity framework that has a low entry barrier and a steep learning curve.
103, TITLE: GradAug: A New Regularization Method for Deep Neural Networks
http://arxiv.org/abs/2006.07989
AUTHORS: Taojiannan Yang ; Sijie Zhu ; Chen Chen
HIGHLIGHT: We propose a new regularization method to alleviate over-fitting in deep neural networks.
104, TITLE: Weakly-supervised Any-shot Object Detection
http://arxiv.org/abs/2006.07502
AUTHORS: Siddhesh Khandelwal ; Raghav Goyal ; Leonid Sigal
COMMENTS: 19 Pages, 8 Figures, 8 Tables
HIGHLIGHT: In this work, we aim to bridge this divide by proposing an intuitive weakly-supervised model that is applicable to a range of supervision: from zero to a few instance-level samples per novel class.
105, TITLE: Domain Generalization using Causal Matching
http://arxiv.org/abs/2006.07500
AUTHORS: Divyat Mahajan ; Shruti Tople ; Amit Sharma
COMMENTS: Preprint. Under review
HIGHLIGHT: In this work, we propose a causal interpretation of domain generalization that defines domains as interventions under a data-generating process.
106, TITLE: ShapeFlow: Learnable Deformations Among 3D Shapes
http://arxiv.org/abs/2006.07982
AUTHORS: Chiyu "Max" Jiang ; Jingwei Huang ; Andrea Tagliasacchi ; Leonidas Guibas
HIGHLIGHT: We present ShapeFlow, a flow-based model for learning a deformation space for entire classes of 3D shapes with large intra-class variations.
107, TITLE: Geodesic-HOF: 3D Reconstruction Without Cutting Corners
http://arxiv.org/abs/2006.07981
AUTHORS: Ziyun Wang ; Eric A. Mitchell ; Volkan Isler ; Daniel D. Lee
HIGHLIGHT: To address this issue, we propose learning an image-conditioned mapping function from a canonical sampling domain to a high dimensional space where the Euclidean distance is equal to the geodesic distance on the object.
108, TITLE: Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
http://arxiv.org/abs/2006.07976
AUTHORS: Junting Pan ; Siyu Chen ; Zheng Shou ; Jing Shao ; Hongsheng Li
HIGHLIGHT: In this paper, we propose to explicitly model the Actor-Context-Actor Relation, which can capture indirect high-order supportive information for effectively reasoning actors' actions in complex scenes.
109, TITLE: Tackling Morpion Solitaire with AlphaZero-likeRanked Reward Reinforcement Learning
http://arxiv.org/abs/2006.07970
AUTHORS: Hui Wang ; Mike Preuss ; Michael Emmerich ; Aske Plaat
COMMENTS: 4 pages, 2 figures. the first/ongoing attempt to tackle Morpion Solitaire using ranked reward reinforcement learning. submitted to SYNASC2020
HIGHLIGHT: In this paper we take the recent impressive performance of deep self-learning reinforcement learning approaches from AlphaGo/AlphaZero as inspiration to design a searcher for Morpion Solitaire.
110, TITLE: Relational reasoning and generalization using non-symbolic neural networks
http://arxiv.org/abs/2006.07968
AUTHORS: Atticus Geiger ; Alexandra Carstensen Michael C. Frank ; Christopher Potts
HIGHLIGHT: This paper revisits the question of whether equality can be learned by neural networks that do not encode explicit symbolic structure.
111, TITLE: Meta Approach to Data Augmentation Optimization
http://arxiv.org/abs/2006.07965
AUTHORS: Ryuichiro Hataya ; Jan Zdenek ; Kazuki Yoshizoe ; Hideki Nakayama
HIGHLIGHT: In this paper, we propose to optimize image recognition models and data augmentation policies simultaneously to improve the performance using gradient descent.
112, TITLE: OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold
http://arxiv.org/abs/2006.07491
AUTHORS: Mohamed Yousef ; Tom E. Bishop
COMMENTS: Accepted to CVPR 2020
HIGHLIGHT: We propose a novel and simple neural network module, termed \textbf{OrigamiNet}, that can augment any CTC-trained, fully convolutional single line text recognizer, to convert it into a multi-line version by providing the model with enough spatial capacity to be able to properly collapse a 2D input signal into 1D without losing information.
113, TITLE: Understanding Unintended Memorization in Federated Learning
http://arxiv.org/abs/2006.07490
AUTHORS: Om Thakkar ; Swaroop Ramaswamy ; Rajiv Mathews ; Françoise Beaufays
HIGHLIGHT: In this paper, we initiate a formal study to understand the effect of different components of canonical FL on unintended memorization in trained models, comparing with the central learning setting.
114, TITLE: A Generative Model for Joint Natural Language Understanding and Generation
http://arxiv.org/abs/2006.07499
AUTHORS: Bo-Hsiang Tseng ; Jianpeng Cheng ; Yimai Fang ; David Vandyke
COMMENTS: The 58th Annual Meeting of the Association for Computational Linguistics, ACL2020
HIGHLIGHT: In this work, we propose a generative model which couples NLU and NLG through a shared latent variable.
115, TITLE: Multi-Modal Fingerprint Presentation Attack Detection: Evaluation On A New Dataset
http://arxiv.org/abs/2006.07498
AUTHORS: Leonidas Spinoulas ; Hengameh Mirzaalian ; Mohamed Hussein ; Wael AbdAlmageed
HIGHLIGHT: In this work, rather than relying on legacy fingerprint images, which are widely used in the community, we study the usefulness of multiple recently introduced sensing modalities.
116, TITLE: Open Questions in Creating Safe Open-ended AI: Tensions Between Control and Creativity
http://arxiv.org/abs/2006.07495
AUTHORS: Adrien Ecoffet ; Jeff Clune ; Joel Lehman
HIGHLIGHT: This paper proposes that open-ended evolution and artificial life have much to contribute towards the understanding of open-ended AI, focusing here in particular on the safety of open-ended search.
117, TITLE: Multispectral Biometrics System Framework: Application to Presentation Attack Detection
http://arxiv.org/abs/2006.07489
AUTHORS: Leonidas Spinoulas ; Mohamed Hussein ; David Geissbühler ; Joe Mathai ; Oswin G. Almeida ; Guillaume Clivaz ; Sébastien Marcel ; Wael AbdAlmageed
HIGHLIGHT: In this work, we present a general framework for building a biometrics system capable of capturing multispectral data from a series of sensors synchronized with active illumination sources.
118, TITLE: Learning-to-Learn Personalised Human Activity Recognition Models
http://arxiv.org/abs/2006.07472
AUTHORS: Anjana Wijekoon ; Nirmalie Wiratunga
COMMENTS: 17 pages
HIGHLIGHT: We introduce two algorithms, Personalised MAML and Personalised Relation Networks inspired by existing Meta-Learning algorithms but optimised for learning HAR models that are adaptable to any person in health and well-being applications.
119, TITLE: ORD: Object Relationship Discovery for Visual Dialogue Generation
http://arxiv.org/abs/2006.08322
AUTHORS: Ziwei Wang ; Zi Huang ; Yadan Luo ; Huimin Lu
HIGHLIGHT: In this paper, we propose an object relationship discovery (ORD) framework to preserve the object interactions for visual dialogue generation.
120, TITLE: Early Blindness Detection Based on Retinal Images Using Ensemble Learning
http://arxiv.org/abs/2006.07475
AUTHORS: Niloy Sikder ; Md. Sanaullah Chowdhury ; Abu Shamim Mohammad Arif ; Abdullah-Al Nahid
COMMENTS: 6 pages, 22nd International Conference of Computer and Information Technology (ICCIT), 18-20 December, 2019
HIGHLIGHT: In this study, a novel early blind detection method has been proposed based on the color information extracted from retinal images using an ensemble learning algorithm.
121, TITLE: On the Preservation of Spatio-temporal Information in Machine Learning Applications
http://arxiv.org/abs/2006.08321
AUTHORS: Yigit Oktar ; Mehmet Turkan
HIGHLIGHT: In this paper, the problem of orthogonality is first investigated through conventional $k$-means of images, where images are to be processed as vectors.
122, TITLE: Learning Causal Models Online
http://arxiv.org/abs/2006.07461
AUTHORS: Khurram Javed ; Martha White ; Yoshua Bengio
COMMENTS: Spurious features, causal models, online learning, random search, non-iid
HIGHLIGHT: In this work, we take an indirect approach to discovering causal models.
123, TITLE: TURB-Rot. A large database of 3d and 2d snapshots from turbulent rotating flows
http://arxiv.org/abs/2006.07469
AUTHORS: L. Biferale ; F. Bonaccorso ; M. Buzzicotti ; P. Clark di Leoni
HIGHLIGHT: We present TURB-Rot, a new open database of 3d and 2d snapshots of turbulent velocity fields, obtained by Direct Numerical Simulations (DNS) of the original Navier-Stokes equations in the presence of rotation.
124, TITLE: Mitigating Gender Bias in Captioning Systems
http://arxiv.org/abs/2006.08315
AUTHORS: Ruixiang Tang ; Mengnan Du ; Yuening Li ; Zirui Liu ; Xia Hu
HIGHLIGHT: In this work, we specifically focus on the gender bias problem.
125, TITLE: Existential Theory of the Reals Completeness of Stationary Nash Equilibria in Perfect Information Stochastic Games
http://arxiv.org/abs/2006.08314
AUTHORS: Kristoffer Arnsfelt Hansen ; Steffan Christ Sølvsten
COMMENTS: 19 pages, 9 figures, submitted to 45th International Symposium on Mathematical Foundations of Computer Science
HIGHLIGHT: We show that the problem of deciding whether in a multi-player perfect information recursive game (i.e. a stochastic game with terminal rewards) there exists a stationary Nash equilibrium ensuring each player a certain payoff is Existential Theory of the Reals complete.
126, TITLE: Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine
http://arxiv.org/abs/2006.07446
AUTHORS: Kwangyeon Kim ; Akshita Gupta ; Hong-Cheol Choi ; Inseok Hwang
HIGHLIGHT: In this work, we address the problem of satisfying hard state constraints in a model-free RL setting with the deterministic system dynamics.
127, TITLE: Algorithm for Computing Approximate Nash equilibrium in Continuous Games with Application to Continuous Blotto
http://arxiv.org/abs/2006.07443
AUTHORS: Sam Ganzfried
HIGHLIGHT: We present a new algorithm for computing Nash equilibrium strategies in continuous games.
128, TITLE: Pointer Data Structure Synthesis from Answer Set Programming Specifications
http://arxiv.org/abs/2006.07440
AUTHORS: Sarat Chandra Varanasi ; Neeraj Mittal ; Gopal Gupta
HIGHLIGHT: We argue in this paper that ASP's reliance on negation-as-failure makes it a better formalism than those based on first-order logic for writing formal specifications.
129, TITLE: On the Skolem Problem and Prime Powers
http://arxiv.org/abs/2006.07432
AUTHORS: George Kenison ; Richard Lipton ; Joël Ouaknine ; James Worrell
COMMENTS: 13 pages, ISSAC 2020
HIGHLIGHT: In this paper we consider the following specialisation of the problem: given in addition $c\in\mathbb{N}$, determine whether there exists $n\in\mathbb{N}$ of the form $n=lp^k$, with $k,l\leq c$ and $p$ any prime number, such that $u_n=0$.
130, TITLE: Continuous Control for Searching and Planning with a Learned Model
http://arxiv.org/abs/2006.07430
AUTHORS: Xuxi Yang ; Werner Duvaud ; Peng Wei
HIGHLIGHT: In this paper, we provide a way and the necessary theoretical results to extend the MuZero algorithm to more generalized environments with continuous action space.
131, TITLE: Pixel Invisibility: Detecting Objects Invisible in Color Images
http://arxiv.org/abs/2006.08383
AUTHORS: Yongxin ; Wang ; Duminda Wijesekera
COMMENTS: 8 pages, 7 figures, submitted to NIPS 2020
HIGHLIGHT: We propose a novel use of cross modal knowledge distillation from color to infra-red domain using weakly-aligned image pairs from the day and construct indicators for the pixel-level invisibility based on the distances of their intermediate-level features.
132, TITLE: DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning
http://arxiv.org/abs/2006.08381
AUTHORS: Kevin Ellis ; Catherine Wong ; Maxwell Nye ; Mathias Sable-Meyer ; Luc Cary ; Lucas Morales ; Luke Hewitt ; Armando Solar-Lezama ; Joshua B. Tenenbaum
HIGHLIGHT: We present DreamCoder, a system that learns to solve problems by writing programs.
133, TITLE: Causal Inference with Deep Causal Graphs
http://arxiv.org/abs/2006.08380
AUTHORS: Álvaro Parafita ; Jordi Vitrià
COMMENTS: Supplementary material can be found in https://github.com/aparafita/dcg-paper
HIGHLIGHT: We propose Deep Causal Graphs, an abstract specification of the required functionality for a neural network to model causal distributions, and provide a model that satisfies this contract: Normalizing Causal Flows.
134, TITLE: Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech
http://arxiv.org/abs/2006.08387
AUTHORS: William N. Havard ; Jean-Pierre Chevrot ; Laurent Besacier
HIGHLIGHT: We introduce a simple way to introduce such information in an RNN-based model and investigate which type of boundary enables a better mapping between an image and its spoken description.
135, TITLE: Generating Master Faces for Use in Performing Wolf Attacks on Face Recognition Systems
http://arxiv.org/abs/2006.08376
AUTHORS: Huy H. Nguyen ; Junichi Yamagishi ; Isao Echizen ; Sébastien Marcel
COMMENTS: Accepted to be Published in Proceedings of the 2020 International Joint Conference on Biometrics (IJCB 2020), Houston, USA
HIGHLIGHT: In this work, we demonstrated that wolf (generic) faces, which we call "master faces," can also compromise face recognition systems and that the master face concept can be generalized in some cases.
136, TITLE: Adaptively Meshed Video Stabilization
http://arxiv.org/abs/2006.07820
AUTHORS: Minda Zhao ; Qiang Ling
HIGHLIGHT: To resolve the above issues, this paper proposes an adaptively meshed method to stabilize a shaky video based on all of its feature trajectories and an adaptive blocking strategy.
137, TITLE: Alternating ConvLSTM: Learning Force Propagation with Alternate State Updates
http://arxiv.org/abs/2006.07818
AUTHORS: Congyue Deng ; Tai-Jiang Mu ; Shi-Min Hu
HIGHLIGHT: In this paper, we introduce the alternating convolutional Long Short-Term Memory (Alt-ConvLSTM) that models the force propagation mechanisms in a deformable object with near-uniform material properties.
138, TITLE: 2D Image Relighting with Image-to-Image Translation
http://arxiv.org/abs/2006.07816
AUTHORS: Paul Gafton ; Erick Maraz
COMMENTS: 12 pages, 52 Postscript figures, uses cvpr_eso.sty eso-pic.sty ruler.sty
HIGHLIGHT: Here we provide our attempt to solve this problem using GANs.
139, TITLE: Disentanglement for Discriminative Visual Recognition
http://arxiv.org/abs/2006.07810
AUTHORS: Xiaofeng Liu
COMMENTS: Manuscript for book "Recognition and perception of images" Willy
HIGHLIGHT: In this chapter, these problems are casted as either a deep metric learning problem or an adversarial minimax game in the latent space.
140, TITLE: Relative Pose Estimation for Stereo Rolling Shutter Cameras
http://arxiv.org/abs/2006.07807
AUTHORS: Ke Wang ; Bin Fan ; Yuchao Dai
COMMENTS: Accepted by International Conference on Image Processing (ICIP 2020)
HIGHLIGHT: In this paper, we present a novel linear algorithm to estimate the 6 DoF relative pose from consecutive frames of stereo rolling shutter (RS) cameras.
141, TITLE: Vietnamese Word Segmentation with SVM: Ambiguity Reduction and Suffix Capture
http://arxiv.org/abs/2006.07804
AUTHORS: Duc-Vu Nguyen ; Dang Van Thin ; Kiet Van Nguyen ; Ngan Luu-Thuy Nguyen
COMMENTS: In Proceedings of the 16th International Conference of the Pacific Association for Computational Linguistics (PACLING 2019)
HIGHLIGHT: In this paper, we approach Vietnamese word segmentation as a binary classification by using the Support Vector Machine classifier.
142, TITLE: Geometry-Aware Instance Segmentation with Disparity Maps
http://arxiv.org/abs/2006.07802
AUTHORS: Cho-Ying Wu ; Xiaoyan Hu ; Michael Happold ; Qiangeng Xu ; Ulrich Neumann
COMMENTS: CVPR 2020 Workshop of Scalability in Autonomous Driving (WSAD). Please refer to WSAD site for details
HIGHLIGHT: Most previous works of outdoor instance segmentation for images only use color information. We collect and utilize High-Quality Driving Stereo (HQDS) dataset, using much longer baseline and focal length with higher resolution.
143, TITLE: ReLGAN: Generalization of Consistency for GAN with Disjoint Constraints and Relative Learning of Generative Processes for Multiple Transformation Learning
http://arxiv.org/abs/2006.07809
AUTHORS: Chiranjib Sur
HIGHLIGHT: In this work, we have introduced a generalized scheme for consistency for GAN architectures with two new concepts of Transformation Learning (TL) and Relative Learning (ReL) for enhanced learning image transformations.
144, TITLE: Measuring Forecasting Skill from Text
http://arxiv.org/abs/2006.07425
AUTHORS: Shi Zong ; Alan Ritter ; Eduard Hovy
COMMENTS: Accepted at ACL 2020
HIGHLIGHT: In this paper we explore connections between the language people use to describe their predictions and their forecasting skill.
145, TITLE: Defending against GAN-based Deepfake Attacks via Transformation-aware Adversarial Faces
http://arxiv.org/abs/2006.07421
AUTHORS: Chaofei Yang ; Lei Ding ; Yiran Chen ; Hai Li
HIGHLIGHT: This work aims to take an offensive measure to impede the generation of high-quality fake images or videos.
146, TITLE: FinEst BERT and CroSloEngual BERT: less is more in multilingual models
http://arxiv.org/abs/2006.07890
AUTHORS: Matej Ulčar ; Marko Robnik-Šikonja
COMMENTS: 10 pages, accepted at TSD 2020 conference
HIGHLIGHT: We train two trilingual BERT-like models, one for Finnish, Estonian, and English, the other for Croatian, Slovenian, and English.
147, TITLE: Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning
http://arxiv.org/abs/2006.07896
AUTHORS: Yuqing Song ; Shizhe Chen ; Yida Zhao ; Qin Jin
COMMENTS: Winner solution in CVPR 2020 Activitynet Dense Video Captioning challenge
HIGHLIGHT: In this work, we propose a novel and simple model for event sequence generation and explore temporal relationships of the event sequence in the video.
148, TITLE: BI-MAML: Balanced Incremental Approach for Meta Learning
http://arxiv.org/abs/2006.07412
AUTHORS: Yang Zheng ; Jinlin Xiang ; Kun Su ; Eli Shlizerman
COMMENTS: Please see associated video at: https://youtu.be/4qlb-iG5SFo
HIGHLIGHT: We present a novel Balanced Incremental Model Agnostic Meta Learning system (BI-MAML) for learning multiple tasks.
149, TITLE: How to Avoid Being Eaten by a Grue: Structured Exploration Strategies for Textual Worlds
http://arxiv.org/abs/2006.07409
AUTHORS: Prithviraj Ammanabrolu ; Ethan Tien ; Matthew Hausknecht ; Mark O. Riedl
HIGHLIGHT: We introduce Q*BERT, an agent that learns to build a knowledge graph of the world by answering questions, which leads to greater sample efficiency.
150, TITLE: Examining the Role of Mood Patterns in Predicting Self-Reported Depressive symptoms
http://arxiv.org/abs/2006.07887
AUTHORS: Lucia Lushi Chen ; Walid Magdy ; Heather Whalley ; Maria Wolters
COMMENTS: Accepted at The Web Science Conference 2020
HIGHLIGHT: In this work, we attempt to enrich current technology for detecting symptoms of potential depression by constructing a 'mood profile' for social media users.
151, TITLE: Optical Music Recognition: State of the Art and Major Challenges
http://arxiv.org/abs/2006.07885
AUTHORS: Elona Shatri ; György Fazekas
COMMENTS: Author manuscript for TENOR 2020 conference. 11 pages with references, 3 figures
HIGHLIGHT: In this paper, we review relevant works in OMR, including fundamental methods and significant outcomes, and highlight different stages of the OMR pipeline.
152, TITLE: FenceMask: A Data Augmentation Approach for Pre-extracted Image Features
http://arxiv.org/abs/2006.07877
AUTHORS: Pu Li ; Xiangyang Li ; Xiang Long
HIGHLIGHT: We propose a novel data augmentation method named 'FenceMask' that exhibits outstanding performance in various computer vision tasks.
153, TITLE: Explicitly Modeled Attention Maps for Image Classification
http://arxiv.org/abs/2006.07872
AUTHORS: Andong Tan ; Duc Tam Nguyen ; Maximilian Dax ; Matthias Niessner ; Thomas Brox
COMMENTS: 12 pages, 3 figures
HIGHLIGHT: To mitigate this problem, we propose a novel self-attention module with explicitly modeled attention-maps using only a single learnable parameter for low computational overhead.
154, TITLE: Comparative Evaluation of Multi-Agent Deep Reinforcement Learning Algorithms
http://arxiv.org/abs/2006.07869
AUTHORS: Georgios Papoudakis ; Filippos Christianos ; Lukas Schäfer ; Stefano V. Albrecht
HIGHLIGHT: In this work, we evaluate and compare three different classes of MARL algorithms (independent learners, centralised training with decentralised execution, and value decomposition) in a diverse range of multi-agent learning tasks.
155, TITLE: Cityscapes 3D: Dataset and Benchmark for 9 DoF Vehicle Detection
http://arxiv.org/abs/2006.07864
AUTHORS: Nils Gählert ; Nicolas Jourdan ; Marius Cordts ; Uwe Franke ; Joachim Denzler
COMMENTS: 2020 "Scalability in Autonomous Driving" CVPR Workshop
HIGHLIGHT: To this end, we propose Cityscapes 3D, extending the original Cityscapes dataset with 3D bounding box annotations for all types of vehicles.
156, TITLE: Continual General Chunking Problem and SyncMap
http://arxiv.org/abs/2006.07853
AUTHORS: Danilo Vasconcellos Vargas ; Toshitake Asabuki
HIGHLIGHT: Here, we propose a continual generalization of the chunking problem (an unsupervised problem), encompassing fixed and probabilistic chunks, discovery of temporal and causal structures and their continual variations.
157, TITLE: Pot, kettle: Nonliteral titles aren't (natural) science
http://arxiv.org/abs/2006.07849
AUTHORS: Mike Thelwall
COMMENTS: Quantitative Science Studies, to appear
HIGHLIGHT: This article investigates the prevalence of poetic expressions in journal article titles 1996-2019 in 3.3 million articles from all 27 Scopus broad fields.
158, TITLE: An adversarial learning algorithm for mitigating gender bias in face recognition
http://arxiv.org/abs/2006.07845
AUTHORS: Prithviraj Dhar ; Joshua Gleason ; Hossein Souri ; Carlos D. Castillo ; Rama Chellappa
HIGHLIGHT: Therefore, we present a novel approach called `Adversarial Gender De-biasing (AGD)' to reduce the strength of gender information in face recognition features.
159, TITLE: A Generalized Asymmetric Dual-front Model for Active Contours and Image Segmentation
http://arxiv.org/abs/2006.07839
AUTHORS: Da~Chen ; Jack Spencer ; Jean-Marie Mirebeau ; Ke Chen ; Ming-Lei Shu ; Laurent D. Cohen
HIGHLIGHT: In this paper, we introduce a new type of geodesic metrics that encodes the edge-based anisotropy features, the region-based homogeneity penalization and asymmetric enhancement.
160, TITLE: Representative Committees of Peers
http://arxiv.org/abs/2006.07837
AUTHORS: Reshef Meir ; Fedor Sandomirskiy ; Moshe Tennenholtz
HIGHLIGHT: We show that a k-sortition (a random committee of k voters with the majority vote within the committee) leads to an outcome within the factor 1+O(1/k) of the optimal social cost for any number of voters n, any number of issues $m$, and any preference profile.
161, TITLE: Multi-Miner: Object-Adaptive Region Mining for Weakly-Supervised Semantic Segmentation
http://arxiv.org/abs/2006.07834
AUTHORS: Kuangqi Zhou ; Qibin Hou ; Zun Li ; Jiashi Feng
HIGHLIGHT: In this paper, we propose a novel multi-miner framework to perform a region mining process that adapts to diverse object sizes and is thus able to mine more integral and finer object regions.
162, TITLE: On Saliency Maps and Adversarial Robustness
http://arxiv.org/abs/2006.07828
AUTHORS: Puneet Mangla ; Vedant Singh ; Vineeth N Balasubramanian
COMMENTS: Accepted at ECML-PKDD 2020
HIGHLIGHT: In this work, we provide a different perspective to this coupling, and provide a method, Saliency based Adversarial training (SAT), to use saliency maps to improve adversarial robustness of a model.
163, TITLE: PCAAE: Principal Component Analysis Autoencoder for organising the latent space of generative networks
http://arxiv.org/abs/2006.07827
AUTHORS: Chi-Hieu Pham ; Saïd Ladjal ; Alasdair Newson
COMMENTS: Preprint with Appendix
HIGHLIGHT: Drawing inspiration from principal component analysis and autoencoder, we propose the Principal Component Analysis Autoencoder (PCAAE).
164, TITLE: Few-shot Object Detection on Remote Sensing Images
http://arxiv.org/abs/2006.07826
AUTHORS: Jingyu Deng ; Xiang Li ; Yi Fang
COMMENTS: 12pages, 7 figures
HIGHLIGHT: In this paper, we deal with the problem of object detection on remote sensing images.
165, TITLE: Working with scale: 2nd place solution to Product Detection in Densely Packed Scenes [Technical Report]
http://arxiv.org/abs/2006.07825
AUTHORS: Artem Kozlov
HIGHLIGHT: This report describes a 2nd place solution of the detection challenge which is held within CVPR 2020 Retail-Vision workshop.
166, TITLE: Differentiable Neural Architecture Transformation for Reproducible Architecture Improvement
http://arxiv.org/abs/2006.08231
AUTHORS: Do-Guk Kim ; Heung-Chang Lee
HIGHLIGHT: In this paper, we propose differentiable neural architecture transformation that is reproducible and efficient.
167, TITLE: Piecewise-Stationary Off-Policy Optimization
http://arxiv.org/abs/2006.08236
AUTHORS: Joey Hong ; Branislav Kveton ; Manzil Zaheer ; Yinlam Chow ; Amr Ahmed
COMMENTS: 16 pages, 2 figures
HIGHLIGHT: To address this challenge, we study the novel problem of off-policy optimization in piecewise-stationary contextual bandits.
168, TITLE: Finding trainable sparse networks through Neural Tangent Transfer
http://arxiv.org/abs/2006.08228
AUTHORS: Tianlin Liu ; Friedemann Zenke
COMMENTS: Accepted by ICML 2020
HIGHLIGHT: In this article, we introduce Neural Tangent Transfer, a method that instead finds trainable sparse networks in a label-free manner.
169, TITLE: Slowing Down the Weight Norm Increase in Momentum-based Optimizers
http://arxiv.org/abs/2006.08217
AUTHORS: Byeongho Heo ; Sanghyuk Chun ; Seong Joon Oh ; Dongyoon Han ; Sangdoo Yun ; Youngjung Uh ; Jung-Woo Ha
COMMENTS: 16 pages, 9 figures. First two authors contributed equally
HIGHLIGHT: We propose a simple and effective solution: at each iteration of momentum-based GD optimizers (e.g. SGD or Adam) applied on scale-invariant weights (e.g. Conv weights preceding a BN layer), we remove the radial component (i.e. parallel to the weight vector) from the update vector.
170, TITLE: Structural Autoencoders Improve Representations for Generation and Transfer
http://arxiv.org/abs/2006.07796
AUTHORS: Felix Leeb ; Yashas Annadani ; Stefan Bauer ; Bernhard Schölkopf
COMMENTS: Submitted to NeurIPS 2020
HIGHLIGHT: We propose a self-attention based architecture to make the encoder explicitly associate parts of the representation with parts of the input observation.
171, TITLE: Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling On-the-Fly for Moving Object Detection
http://arxiv.org/abs/2006.07795
AUTHORS: Zerui Shao ; Yifei Pu ; Jiliu Zhou ; Bihan Wen ; Yi Zhang
HIGHLIGHT: In this paper, we propose a novel RPCA-based model, called Hyper RPCA, to detect moving objects on the fly.
172, TITLE: Generative 3D Part Assembly via Dynamic Graph Learning
http://arxiv.org/abs/2006.07793
AUTHORS: Jialei Huang ; Guanqi Zhan ; Qingnan Fan ; Kaichun Mo ; Lin Shao ; Baoquan Chen ; Leonidas Guibas ; Hao Dong
HIGHLIGHT: In this paper, we focus on the pose estimation subproblem from the vision side involving geometric and relational reasoning over the input part geometry.
173, TITLE: PrimA6D: Rotational Primitive Reconstruction for Enhanced and Robust 6D Pose Estimation
http://arxiv.org/abs/2006.07789
AUTHORS: MyungHwan Jeon ; Ayoung Kim
COMMENTS: Accepted at RA-L
HIGHLIGHT: In this paper, we introduce a rotational primitive prediction based 6D object pose estimation using a single image as an input.
174, TITLE: Fine-grained Human Evaluation of Transformer and Recurrent Approaches to Neural Machine Translation for English-to-Chinese
http://arxiv.org/abs/2006.08297
AUTHORS: Yuying Ye ; Antonio Toral
COMMENTS: Accepted at the 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020)
HIGHLIGHT: This research presents a fine-grained human evaluation to compare the Transformer and recurrent approaches to neural machine translation (MT), on the translation direction English-to-Chinese.
175, TITLE: Deep-CAPTCHA: a deep learning based CAPTCHA solver for vulnerability assessment
http://arxiv.org/abs/2006.08296
AUTHORS: Zahra Noury ; Mahdi Rezaei
HIGHLIGHT: In this research, we investigate a way to crack visual CAPTCHA tests by an automated deep learning based solution. In this paper, we present our customised deep neural network model, the research gaps and the existing challenges, and the solutions to overcome the issues.
176, TITLE: Efficient Reasoning in Regular Boardgames
http://arxiv.org/abs/2006.08295
AUTHORS: Jakub Kowalski ; Radosław Miernik ; Maksymilian Mika ; Wojciech Pawlik ; Jakub Sutowicz ; Marek Szykuła ; Andrzej Tkaczyk
COMMENTS: IEEE Conference on Games 2020
HIGHLIGHT: We present the technical side of reasoning in Regular Boardgames (RBG) language -- a universal General Game Playing (GGP) formalism for the class of finite deterministic games with perfect information, encoding rules in the form of regular expressions.
177, TITLE: On the Multi-Property Extraction and Beyond
http://arxiv.org/abs/2006.08281
AUTHORS: Tomasz Dwojak ; Michał Pietruszka ; Łukasz Borchmann ; Filip Graliński ; Jakub Chłędowski
COMMENTS: 5 pages
HIGHLIGHT: In this paper, we investigate the Dual-source Transformer architecture on the WikiReading information extraction and machine reading comprehension dataset.
178, TITLE: Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset
http://arxiv.org/abs/2006.08274
AUTHORS: Andrei Andrusenko ; Aleksandr Laptev ; Ivan Medennikov
COMMENTS: Submitted to SPECOM 2020
HIGHLIGHT: This paper presents an exploration of end-to-end automatic speech recognition systems (ASR) for the largest open-source Russian language data set -- OpenSTT.
179, TITLE: Randomized polynomial-time equivalence between determinant and trace-IMM equivalence tests
http://arxiv.org/abs/2006.08272
AUTHORS: Janaky Murthy ; Vineet Nair ; Chandan Saha
COMMENTS: 36 pages, 2 figures
HIGHLIGHT: We show that the answer is 'yes' for Det and Tr-IMM (modulo the use of randomness).
180, TITLE: AMENet: Attentive Maps Encoder Network for Trajectory Prediction
http://arxiv.org/abs/2006.08264
AUTHORS: Hao Cheng ; Wentong Liao ; Michael Ying Yang ; Bodo Rosenhahn ; Monika Sester
HIGHLIGHT: To this end, we propose an end-to-end generative model named Attentive Maps Encoder Network (AMENet) for accurate and realistic multi-path trajectory prediction.
181, TITLE: Polynomial time deterministic identity testingalgorithm for $Σ^{[3]}ΠΣΠ^{[2]}$ circuits via Edelstein-Kelly type theorem for quadratic polynomials
http://arxiv.org/abs/2006.08263
AUTHORS: Shir Peleg ; Amir Shpilka
HIGHLIGHT: In this work we resolve conjectures of Beecken, Mitmann and Saxena [BMS13] and Gupta [Gup14], by proving an analog of a theorem of Edelstein and Kelly for quadratic polynomials.
182, TITLE: The PSPACE-hardness of understanding neural circuits
http://arxiv.org/abs/2006.08266