doc. Ing. Petr Červa, Ph.D.

doc. Ing. Petr Červa, Ph.D. QR VCARD
LinePositionDepartmentOffice number
+420 48535 3778EmployeeInstitute of Information Technology and ElectronicsA 02018
+420 48535 3778MemberAcademic Senate of Faculty of MechatronicsA 02018

Publications

  1. L. Matějů, J. Nouza, P. Červa, J. Žďánský, F. Kynych, Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Dublin, ISCA, p. 3252 - 3256, 5 pages, ISSN: 2308-457X, [Online], 2023
  2. J. Nouza, L. Matějů, P. Červa, J. Žďánský, Developing State-of-the-Art End-to-End ASR for Norwegian, Lecture Notes in Computer Science - including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, Springer Science and Business, ISBN: 978-303140497-9, p. 200-213, 14 pages, ISSN: 03029743, [Online], 2023
  3. M. Poláček, P. Červa, J. Žďánský, L. Weingartová, Online Punctuation Restoration using ELECTRA Model for streaming ASR Systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Irsko, ISCA, p. 446-450, 5 pages, ISSN: 2308-457X, [Online], 2023
  4. F. Kynych, J. Žďánský, P. Červa, L. Matějů, Online Speaker Diarization Using Optimized SE-ResNet Architecture, Lecture Notes in Computer Science, Německo, Springer, ISBN: 978-303140497-9, p. 176-187, 12 pages, ISSN: 03029743, [Online], 2023
  5. J. Nouza, P. Červa, J. Žďánský, Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription, Lecture Notes in Computer Science, SPRINGER-VERLAG BERLIN, ISBN: 978-303116269-5, p. 401-409, 9 pages, ISSN: 0302-9743, [Online], 2022
  6. L. Matějů, F. Kynych, P. Červa, J. Málek, J. Žďánský, Overlapped Speech Detection in Broadcast Streams Using X-vectors, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Jižní Korea, ISCA, p. 4606 - 4610, 4 pages, ISSN: 2308-457X, [Online], 2022
  7. J. Chaloupka, K. Paleček, P. Červa, Audio-visual Broadcast Transcription System Using Artificial Neural Networks, 2021 IEEE International Workshop of Electronics, Control, Measurement, Signals and their Application to Mechatronics, ECMSM 2021, IEEE, ISBN: 978-153861757-1, 5 pages, [Online], 2021
  8. P. Červa, L. Matějů, J. Žďánský, R. Šafařík, J. Nouza, Identification of related languages from spoken data: Moving from off-line to on-line scenario, Computer Speech and Language, Elsevier, 19 pages, ISSN: 0885-2308, [Online], 2021
  9. P. Červa, L. Matějů, F. Kynych, J. Žďánský, J. Nouza, Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-vectors, Lecture Notes in Computer Science, Switzerland, Springer Nature Switzerland AG, ISBN: 978-303083526-2, p. 371-381, 11 pages, ISSN: 0302-9743, [Online], 2021
  10. P. Červa, J. Nouza, J. Václ, L. Weingartová, Multilingvální softwarová technologie pro detekci a včasné upozornění, 2021
  11. L. Matějů, F. Kynych, P. Červa, J. Žďánský, J. Málek, Using X-vectors for Speech Activity Detection in Broadcast Streams, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, ISBN: 978-171383690-2, p. 4161 - 4165, 5 pages, ISSN: 2308-457X, [Online], 2021
  12. P. Červa, V. Volná, L. Weingartová, Dealing with Newly Emerging OOVs in Broadcast Programs by Daily Updates of the Lexicon and Language Model, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 22nd International Conference on Speech and Computer, SPECOM 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303060275-8, p. 97-107, 11 pages, ISSN: 0302-9743, [Online], 2020
  13. J. Chaloupka, P. Červa, J. Nouza, MyVoice verze 2.0, [Online], 2020
  14. J. Chaloupka, K. Paleček, P. Červa, J. Žďánský, Optical Character Recognition for Audio-Visual Broadcast Transcription System, 11th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2020 - Proceedings, Finsko, IEEE, 1, ISBN: 978-172818213-1, p. 229-232, 4 pages, [Online], 2020
  15. J. Nouza, P. Červa, J. Žďánský, Very Fast Keyword Spotting System with Real Time Factor below 0.01, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 426-436, 11 pages, ISSN: 0302-9743, [Online], 2020
  16. L. Matějů, P. Červa, J. Žďánský, An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Austria, ISCA, 1, p. 649-653, 5 pages, ISSN: 2308-457X, 2019
  17. J. Málek, J. Žďánský, P. Červa, Robust Recognition of Conversational Telephone Speech via Multi-Condition Training and Data Augmentation, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 21st International Conference on Text, Speech, and Dialogue, TSD 2018, Springer Verlag, ISBN: 978-303000793-5, p. 324-333, 10 pages, ISSN: 0302-9743, 2018
  18. J. Málek, J. Žďánský, P. Červa, Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Kanada, IEEE, 1, ISBN: 978-153864658-8, p. 5624-5628, 5 pages, ISSN: 1520-6149, 2018
  19. L. Matějů, P. Červa, J. Žďánský, R. Šafařík, Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Indie, ISCA, 1, p. 1803-1807, 5 pages, ISSN: 2308-457X, 2018
  20. L. Matějů, P. Červa, J. Žďánský, Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription, Communications in Computer and Information Science, Spolková republika Německo, Springer Verlag, ISBN: 978-331967875-7, p. 341-358, 18 pages, ISSN: 1865-0929, 2017
  21. J. Nouza, P. Červa, J. Žďánský, S. Čihák, K. Bureš, Multilingvální platforma pro monitoring a analýzu multimédií, 2017
  22. J. Málek, J. Žďánský, P. Červa, Robust Automatic Recognition of Speech with Background Music, 16 June 2017, Article number 7953150, Pages 5210-52142017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017; Hilton New Orleans RiversideNew Orleans; United States; 5 March 2017 through 9 March 2017; Category numberCFP, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5210-5214, 5 pages, ISSN: 1520-6149, 2017
  23. L. Matějů, P. Červa, J. Žďánský, J. Málek, Speech Activity Detection in Online Broadcast Transcription Using Deep Neural Networks and Weighted Finite State Transducers, 2017 IEEE IICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsnternational Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5460-5464, 5 pages, ISSN: 1520-6149, 2017
  24. J. Nouza, R. Šafařík, P. Červa, ASR for south slavic languages developed in almost automated way, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, USA, International Speech and Communication Association, p. 3868-3872, 5 pages, ISSN: 2308-457X, 2016
  25. M. Rott, P. Červa, Speech-to-text summarization using automatic phrase extraction from recognized text, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Switzerland, Springer International Publishing, ISBN: 978-3-319-45509-9, p. 101-108, 8 pages, ISSN: 0302-9743, 2016
  26. J. Málek, P. Červa, L. Šeps, J. Nouza, Study on the use and adaptation of bottleneck features for robust speech recognition of nonlinearly distorted speech, ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications, Lisabon, Portugalsko, SciTePress, ISBN: 978-989-758-196-0, p. 65-71, 7 pages, 2016
  27. L. Matějů, P. Červa, J. Žďánský, Study on the use of deep neural networks for speech activity detection in broadcast recordings, ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications, Lisabon, Portugalsko, SciTePress, ISBN: 978-989-758-196-0, p. 45-51, 7 pages, 2016
  28. J. Málek, J. Silovský, P. Červa, Z. Koldovský, J. Nouza, J. Žďánský, Compensation of Nonlinear Distortions in Speech for Automatic Recognition, 38th International Conference on Telecommunications and Signal Processing, TSP 2015, Praha, Česká Republika, Institute of Electrical and Electronics Engineers Inc., 1, ISBN: 978-1-4799-8498-5, p. 419-423, 5 pages, 2015
  29. J. Nouza, P. Červa, R. Šafařík, Cross-Lingual Adaptation of Broadcast Transcription System to Polish Language Using Public Data Sources, 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Polsko, Fundancja Uniwersytetu im. Adama Mickiewicza w Poznaniu, 1, ISBN: 978-83-932640-8-7, p. 181-185, 5 pages, 2015
  30. L. Matějů, P. Červa, J. Žďánský, Investigation into the use of deep neural networks for LVCSR of Czech, 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Česká Republika, IEEE, 1, ISBN: 978-1-4799-6972-2, p. 38-41, 4 pages, 2015
  31. M. Rott, P. Červa, Study on Methods for Vector Representation of Text for Topic-based Clustering of News Articles, 7th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Polsko, Fundancja Uniwersytetu im. Adama Mickiewicza w Poznaniu, 1, ISBN: 978-83-932640-8-7, p. 530-534, 5 pages, 2015
  32. J. Nouza, K. Blavka, M. Boháč, P. Červa, J. Málek, System for Producing Subtitles to Internet Audio-Visual Documents, 38th International Conference on Telecommunications and Signal Processing, TSP 2015, Praha, Česká Republika, Institute of Electrical and Electronics Engineers Inc., 1, ISBN: 978-1-4799-8498-5, p. 437-441, 5 pages, 2015
  33. J. Nouza, P. Červa, J. Žďánský, K. Blavka, M. Boháč, J. Silovský, J. Chaloupka, M. Kuchařová, J. Málek, Unikátní softwarová technologická platforma pro přepisy archivů historických i současných pořadů ČRo a jejich zpřístupnění pomocí webu, 2014