liu.seSearch for publications in DiVA
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 192) Show all publications
Bae, C. & Gustafsson, O. (2025). Architectural Trade-Offs for High-Speed Real-Time Chromatic Dispersion Compensation FIR Filters With Time-Multiplexed Streaming FFTs. Journal of Lightwave Technology, 43(14), 6744-6753
Open this publication in new window or tab >>Architectural Trade-Offs for High-Speed Real-Time Chromatic Dispersion Compensation FIR Filters With Time-Multiplexed Streaming FFTs
2025 (English)In: Journal of Lightwave Technology, ISSN 0733-8724, E-ISSN 1558-2213, Vol. 43, no 14, p. 6744-6753Article in journal (Refereed) Published
Abstract [en]

Fiber-optic communication faces challenges to address impairment such as chromatic dispersion, which distorts signals, necessitating efficient chromatic dispersion compensation (CDC) solutions. In this work, architectural trade-offs when implementing finite-length impulse response (FIR) filters in the frequency-domain for high-speed CDC is presented. Most earlier works have considered a fully parallel FFT. However, as shown in this work, it is possible to reduce the power consumption by increasing the FFTs size, leading to time-multiplexed streaming FFT architectures. The results, implemented for a tentative 400G 16-QAM system, illustrates that traditionally used complexity measures, such as number of multiplications per sample, are not suitable when determining the best FFT size. It is also shown that this size is not necessarily a power of two. Instead, a ratio between the block length and the filter length of between one and two provides the best results in this case, lower than the commonly used multiplications per sample estimate. The results highlights the impact of the data shuffling, required for time-multiplexed FFTs, on the power consumption, but also the possibilities to lower the power consumption by not restricting to power-of-two FFTs.

Place, publisher, year, edition, pages
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 2025
Keywords
Finite impulse response filters, Optical fiber filters, Frequency-domain analysis, Complexity theory, Clocks, Transforms, Optical fibers, Chromatic dispersion, Parallel processing, Optical signal processing
National Category
Signal Processing
Identifiers
urn:nbn:se:liu:diva-216638 (URN)10.1109/jlt.2025.3558510 (DOI)001530265700020 ()2-s2.0-105002488744 (Scopus ID)
Funder
ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Available from: 2025-08-19 Created: 2025-08-19 Last updated: 2025-09-03
Henriksson, M., Lindberg, T. & Gustafsson, O. (2024). APyTypes: Algorithmic Data Types in Python for Efficient Simulation of Finite Word-Length Effects. In: Proceedings of the IEEE 31st Symposium on Computer Arithmetic (ARITH): . Paper presented at IEEE 31st Symposium on Computer Arithmetic (ARITH), Malaga, Spain (pp. 72-75). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>APyTypes: Algorithmic Data Types in Python for Efficient Simulation of Finite Word-Length Effects
2024 (English)In: Proceedings of the IEEE 31st Symposium on Computer Arithmetic (ARITH), Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 72-75Conference paper, Published paper (Refereed)
Abstract [en]

A new Python library, APyTypes, suitable for simulating and exploring finite word-length effects is presented. The library supports configurable bit-accurate fixed- and floating-point types of both scalars and multidimensional arrays and uses a C++ backend to accelerate runtime performance. The underlying design principles of the library are introduced and examples show how it can be used. We argue that APyTypes have significant advantages over existing arithmetic libraries, especially from a hardware design perspective. Finally, some directions for further work are outlined.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
National Category
Computer Systems
Identifiers
urn:nbn:se:liu:diva-205936 (URN)10.1109/ARITH61463.2024.00021 (DOI)001267256100021 ()2-s2.0-85198640146 (Scopus ID)9798350384321 (ISBN)9798350384338 (ISBN)
Conference
IEEE 31st Symposium on Computer Arithmetic (ARITH), Malaga, Spain
Note

Funding Agencies|ELLIIT strategic research environment through the project D3 ACRE - Approximate Computing Reducing Energy; Swedish Foundation for Strategic Research through the project Large Intelligent Surfaces - Architecture and Hardware

Available from: 2024-07-12 Created: 2024-07-12 Last updated: 2025-08-26
Henriksson, M., Winbladh, H. & Gustafsson, O. (2024). Multi-Stream FFT Architectures for a Distributed MIMO Large Intelligent Surfaces Testbed. In: Proceeding of the IEEE Nordic Circuits and Systems Conference (NorCAS): . Paper presented at IEEE Nordic Circuits and Systems Conference (NorCAS), Lund, Sweden, 29-30 October, 2024. Lund, Sweden
Open this publication in new window or tab >>Multi-Stream FFT Architectures for a Distributed MIMO Large Intelligent Surfaces Testbed
2024 (English)In: Proceeding of the IEEE Nordic Circuits and Systems Conference (NorCAS), Lund, Sweden, 2024Conference paper, Published paper (Refereed)
Abstract [en]

A modern communication testbed based on distributed multiple-input multiple-output (MIMO) is constructed at Lund University. This testbed uses 16-antenna panels as a base for the distributed MIMO system. As the mixers in these panels are fully digital, there is a rational relationship between the sample frequency and the clock frequency of the field-programmable gate-array that performs the digital baseband processing. This rational relationship, combined with the multiple data streams in the panel, makes for an interesting design challenge when implementing high-utilization circuits. Three unique 16 multi-stream fast Fourier transform architectures tailored toward this particular distributed MIMO testbed are presented. The proposed architectures take the rational relationship between clock frequency and sample frequency into consideration and achieve 96.25%–100% butterfly utilization in the multi-streamFFT.

Place, publisher, year, edition, pages
Lund, Sweden: , 2024
Keywords
fast Fourier transform (FFT), field-programmable gate-array (FPGA), distributed multiple-input multiple-output (MIMO), multi-stream
National Category
Computer Engineering Signal Processing
Identifiers
urn:nbn:se:liu:diva-209881 (URN)10.1109/NorCAS64408.2024.10752445 (DOI)001444043400008 ()2-s2.0-85211892823 (Scopus ID)9798331517670 (ISBN)9798331517663 (ISBN)
Conference
IEEE Nordic Circuits and Systems Conference (NorCAS), Lund, Sweden, 29-30 October, 2024
Funder
Swedish Foundation for Strategic Research, CHI19-0001ELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Note

Funding Agencies|Swedish Foundation for Strategic Research (SSF) [CHI19-0001]; Excellence Center at Linkoping-Lund in Information Technology (ELLIIT)

Available from: 2024-11-19 Created: 2024-11-19 Last updated: 2025-04-17
Skarman, F. & Gustafsson, O. (2023). Abstraction in the Spade Hardware Description Language. In: : . Paper presented at LATTE ’23 - Workshop on Languages, Tools, and Techniques for Accelerator Design, Vancouver, BC, Canada, March 26, 2023.
Open this publication in new window or tab >>Abstraction in the Spade Hardware Description Language
2023 (English)Conference paper, Oral presentation only (Other academic)
Abstract [en]

Spade is an HDL that enhances the productivity of HDL designers byadding useful abstractions for hardware design. These abstractionsare zero- or low-cost, meaning that the designer still has full controlover what hardware gets generated.

Keywords
Hardware description languages, languages and compilers, Design automation
National Category
Computer Engineering
Identifiers
urn:nbn:se:liu:diva-198342 (URN)
Conference
LATTE ’23 - Workshop on Languages, Tools, and Techniques for Accelerator Design, Vancouver, BC, Canada, March 26, 2023
Available from: 2023-10-05 Created: 2023-10-05 Last updated: 2024-12-05Bibliographically approved
Khan, M. T. & Gustafsson, O. (2023). Analyzing Step-Size Approximation for Fixed-Point Implementation of LMS and BLMS Algorithms. In: 2023 IEEE Nordic Circuits and Systems Conference (NorCAS): . Paper presented at 31 October 2023 - 01 November 2023, Aalborg, Denmark. IEEE
Open this publication in new window or tab >>Analyzing Step-Size Approximation for Fixed-Point Implementation of LMS and BLMS Algorithms
2023 (English)In: 2023 IEEE Nordic Circuits and Systems Conference (NorCAS), IEEE, 2023Conference paper, Published paper (Refereed)
Abstract [en]

In this work, we analyze the step-size approximation for fixed-point least-mean-square (LMS) and block LMS (BLMS) algorithms. Our primary focus is on investigating how step size approximation impacts the convergence rate and steady-state mean square error (MSE) across varying block sizes and filter lengths. We consider three different FP quantized LMS and BLMS algorithms. The results demonstrate that the algorithm with two quantizers in single precision behaves approximately the same as one quantizer under quantized weights, regardless of block size and filter lengths. Subsequently, we explore the approximation effects of nearest power-of-two and their combinations with different design parameters on the convergence performance. Simulation results for within the context of a system identification problem under these approximations reveal intriguing insights. For instance, a single quantizer algorithm without quantized error is more robust than its counterpart under these approximations. Additionally, both single quantizer algorithms with combined power-of-two approximations matches the behavior of the actual step-size.

Place, publisher, year, edition, pages
IEEE, 2023
National Category
Signal Processing
Identifiers
urn:nbn:se:liu:diva-199293 (URN)10.1109/NorCAS58970.2023.10305481 (DOI)001103249500039 ()9798350337570 (ISBN)9798350337587 (ISBN)
Conference
31 October 2023 - 01 November 2023, Aalborg, Denmark
Available from: 2023-11-24 Created: 2023-11-24 Last updated: 2024-01-17Bibliographically approved
Skarman, F., Klemmer, L., Gustafsson, O. & Große, D. (2023). Enhancing Compiler-Driven HDL Design withAutomatic Waveform Analysis. In: Forum on Specification, Verification and Design Languages, FDL: . Paper presented at Forum on Specification, Verification and Design Languages, FDL. IEEE conference proceedings
Open this publication in new window or tab >>Enhancing Compiler-Driven HDL Design withAutomatic Waveform Analysis
2023 (English)In: Forum on Specification, Verification and Design Languages, FDL, IEEE conference proceedings, 2023Conference paper, Published paper (Refereed)
Abstract [en]

The time-to-market of a new product is one of its most crucial factors for success, therefore, reducing this time is of utter importance. However, this reduction must not come at the expense of a less thorough development process. This paper presents a compiler-driven approach for automatically analyzing metrics such as transaction delays or bus throughput on simulation waveforms of projects developed in the Spade Hardware Description Language (HDL). By utilizing the Spade compiler’s knowledge about design internals, an automatic analysis of the waveforms created during simulation is possible using the Waveform Analysis Language (WAL). Analysis programs can be bundled with Spade projects or libraries, such that they are automatically detected by Spade and can be reused by other projects using simple annotations. We call these bundled WAL programs analysis passes, since they fit into the Spade workflow and provide thorough analysis at no additional cost to the users of these libraries. In a detailed description, we present how new analysis passes can be defined using the example of a data streaming interface. Additionally, we highlight the possibilities of analysis passes in two case studies, including Finite State Machine (FSM) and Wishbone protocol analysis.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2023
Keywords
Performance Analysis, Hardware Description Languages, Debugging
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-209367 (URN)10.1109/FDL59689.2023.10272204 (DOI)
Conference
Forum on Specification, Verification and Design Languages, FDL
Available from: 2024-11-11 Created: 2024-11-11 Last updated: 2025-06-03
Bae, C. & Gustafsson, O. (2023). FFT-Size Implementation Tradeoffs for Chromatic Dispersion Compensation Filters. In: Advanced Photonics Congress 2023: . Paper presented at Signal Processing in Photonic Communications 2023, Busan, Republic of Korea, 9–13 July, 2023.. Optical Society of America
Open this publication in new window or tab >>FFT-Size Implementation Tradeoffs for Chromatic Dispersion Compensation Filters
2023 (English)In: Advanced Photonics Congress 2023, Optical Society of America, 2023Conference paper, Published paper (Refereed)
Abstract [en]

FIR filtering realized in frequency domain can use different FFT sizes leading to different arithmetic complexities. The implementation results indicate that not only arithmetic complexities must be considered for minimal power consumption.

Place, publisher, year, edition, pages
Optical Society of America, 2023
Series
Technical Digest Series ; SpTu3E.1
National Category
Signal Processing
Identifiers
urn:nbn:se:liu:diva-199284 (URN)10.1364/SPPCOM.2023.SpTu3E.1 (DOI)9781957171265 (ISBN)
Conference
Signal Processing in Photonic Communications 2023, Busan, Republic of Korea, 9–13 July, 2023.
Available from: 2023-11-23 Created: 2023-11-23 Last updated: 2024-11-07Bibliographically approved
Skarman, F. & Gustafsson, O. (2023). Spade: An Expression-Based HDL With Pipelines. In: Proceedings of the 3rd Workshop on Open-Source Design Automation (OSDA), 2023: . Paper presented at 3rd Workshop on Open-Source Design Automation (OSDA) (pp. 7-12).
Open this publication in new window or tab >>Spade: An Expression-Based HDL With Pipelines
2023 (English)In: Proceedings of the 3rd Workshop on Open-Source Design Automation (OSDA), 2023, 2023, p. 7-12Conference paper, Published paper (Refereed)
Abstract [en]

Spade is a new open source hardware descriptionlanguage (HDL) designed to increase developer productivitywithout sacrificing the low-level control offered by HDLs. Itis a standalone language which takes inspiration from modernsoftware languages, and adds useful abstractions for commonhardware constructs. It also comes with a convenient set of tool-ing, such as a helpful compiler, a build system with dependencymanagement, tools for debugging, and editor integration.

Keywords
Hardware description languages, languages and compilers, Design automation
National Category
Computer Engineering
Identifiers
urn:nbn:se:liu:diva-198341 (URN)10.48550/arXiv.2304.03079 (DOI)
Conference
3rd Workshop on Open-Source Design Automation (OSDA)
Available from: 2023-10-05 Created: 2023-10-05 Last updated: 2025-02-06Bibliographically approved
Henriksson, M. & Gustafsson, O. (2023). Streaming Matrix Transposition on FPGAs Using Distributed Memories. In: Proceeding of the IEEE Nordic Circuits and Systems Conference (NorCAS): . Paper presented at 2023 IEEE Nordic Circuits and Systems Conference (NorCAS), Aalborg, Denmark, 31 October - 01 November 2023. Aalborg, Denmark: Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Streaming Matrix Transposition on FPGAs Using Distributed Memories
2023 (English)In: Proceeding of the IEEE Nordic Circuits and Systems Conference (NorCAS), Aalborg, Denmark: Institute of Electrical and Electronics Engineers (IEEE), 2023Conference paper, Published paper (Refereed)
Abstract [en]

Matrix transposition, the procedure of swapping rows and columns of a matrix, has applications in various signal processing applications, such as massive multiple-input multiple-output (MIMO) communication systems, data compression, and multidimensional fast Fourier transforms – which are used in MIMO radar systems. In low-latency high-throughput streaming applications, specialized circuits for matrix transposition are needed in order to perform transposition in real-time. This is in contrast to "slower" applications, where transposition can be adequately performed by storing a matrix in a shared memory and afterward reading it back in a transposed order. In this paper, a design procedure for streaming matrix transposition on field-programmable gate arrays (FPGAs) using distributed memories is presented. It is shown that significantly fewer FPGA resources are required for small- to medium-sized streaming matrix transpositions compared to recent related works.

Place, publisher, year, edition, pages
Aalborg, Denmark: Institute of Electrical and Electronics Engineers (IEEE), 2023
Keywords
streaming data processing, matrix transposition, linear arithmetic, interleaver, FPGA, distributed memories
National Category
Computer Engineering
Identifiers
urn:nbn:se:liu:diva-199025 (URN)10.1109/NorCAS58970.2023.10305472 (DOI)001103249500031 ()9798350337570 (ISBN)9798350337587 (ISBN)
Conference
2023 IEEE Nordic Circuits and Systems Conference (NorCAS), Aalborg, Denmark, 31 October - 01 November 2023
Funder
Swedish Foundation for Strategic ResearchELLIIT - The Linköping‐Lund Initiative on IT and Mobile Communications
Note

Funding: Swedish Foundation for Strategic Research (SSF) [CHI19-0001]; Excellence Center at Linkoping-Lund in Information Technology (ELLIIT)

Available from: 2023-11-07 Created: 2023-11-07 Last updated: 2024-12-03Bibliographically approved
Khan, M. T. & Gustafsson, O. (2022). ASIC Implementation Trade-Offs for High-Speed LMS and Block LMS Adaptive Filters. In: 65th International Midwest Symposium on Circuits and Systems (MWSCAS): . Paper presented at 2022 IEEE 65th International Midwest Symposium on Circuits and Systems (MWSCAS), Fukuoka, Japan, 07-10 August, 2022 (pp. 1-4). Fukuoka, Japan: Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>ASIC Implementation Trade-Offs for High-Speed LMS and Block LMS Adaptive Filters
2022 (English)In: 65th International Midwest Symposium on Circuits and Systems (MWSCAS), Fukuoka, Japan: Institute of Electrical and Electronics Engineers (IEEE), 2022, p. 1-4Conference paper, Published paper (Other academic)
Abstract [en]

In this work, implementation trade-offs for ASIC-implementation of least-mean-square (LMS) and block LMS (BLMS) adaptive filters are presented. We explore the design trade-offs by increasing the block size and/or relying on the synthesis tool for increased sample rate. For area, lower block size is advantageous as long as the synthesis tool can meet timing. Energy optimum is however found at a different point in design space. Simulation confirms that longer block sizes leads to lower MSE errors for identical step-size. Hence, the design-point should be decided based on weighted requirements for area, energy and MSE.

Place, publisher, year, edition, pages
Fukuoka, Japan: Institute of Electrical and Electronics Engineers (IEEE), 2022
Series
Midwest Symposium on Circuits and Systems. Conference Proceedings, E-ISSN 1558-3899
National Category
Engineering and Technology Signal Processing
Identifiers
urn:nbn:se:liu:diva-188813 (URN)10.1109/MWSCAS54063.2022.9859296 (DOI)000861351300017 ()9781665402798 (ISBN)
Conference
2022 IEEE 65th International Midwest Symposium on Circuits and Systems (MWSCAS), Fukuoka, Japan, 07-10 August, 2022
Available from: 2022-09-26 Created: 2022-09-26 Last updated: 2023-11-16Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-3470-3911

Search in DiVA

Show all publications