The article reports three experiments conducted to determine whether musicians possess better ability of recognising the sources of natural sounds than non-musicians. The study was inspired by reports which indicate that musical training develops not only musical hearing, but also enhances various non-musical auditory capabilities. Recognition and detection thresholds were measured for recordings of environmental sounds presented in quiet (Experiment 1) and in the background of a noise masker (Experiment 2). The listener’s ability of sound source recognition was inferred from the recognition-detection threshold gap (RDTG) defined as the difference in signal level between the thresholds of sound recognition and sound detection. Contrary to what was expected from reports of enhanced auditory abilities of musicians, the RDTGs were not smaller for musicians than for non-musicians. In Experiment 3, detection thresholds were measured with an adaptive procedure comprising three interleaved stimulus tracks with different sounds. It was found that the threshold elevation caused by stimulus interleaving was similar for musicians and non-musicians. The lack of superiority of musicians over non-musicians in the auditory tasks explored in this study is explained in terms of a listening strategy known as casual listening mode, which is a basis for auditory orientation in the environment.
Speech and music signals are multifractal phenomena. The time displacement profile of speech and music signal show strikingly different scaling behaviour. However, a full complexity analysis of their frequency and amplitude has not been made so far. We propose a novel complex network based approach (Visibility Graph) to study the scaling behaviour of frequency wise amplitude variation of speech and music signals over time and then extract their PSVG (Power of Scale freeness of Visibility Graph). From this analysis it emerges that the scaling behaviour of amplitude-profile of music varies a lot from frequency to frequency whereas it’s almost consistent for the speech signal. Our left auditory cortical areas are proposed to be neurocognitively specialised in speech perception and right ones in music. Hence we can conclude that human brain might have adapted to the distinctly different scaling behaviour of speech and music signals and developed different decoding mechanisms, as if following the so called Fractal Darwinism. Using this method, we can capture all non-stationary aspects of the acoustic properties of the source signal to the deepest level, which has huge neurocognitive significance. Further, we propose a novel non-invasive application to detect neurological illness (here autism spectrum disorder, ASD), using the quantitative parameters deduced from the variation of scaling behaviour for speech and music.
The aim of the study was to determine the signal-to-noise ratio (SNR) for the Speech Reception Threshold (SRT) for young persons with normal hearing. The following three tests available for Polish language were used: the New Articulation Lists (NAL-93) version of 2011, the Polish Sentence Test (PST) and the Polish Sentence Matrix Test (PSMT). When using PST and PSMT the masking signal was babble noise made of the language material contained in the test. For NAL-93 the masking signal was speech noise. The speech reception threshold (SRT) was found to be (−6:8 ± 1.1), (−4:8 ± 1.6), (−3:5 ± 1.8) and (−3:4 ± 2.0) dB SNR for PST, PSMT, NAL-93 (constant stimuli method) and NAL-93 (short method), respectively. The values of SRT depend on semantic redundancy of the language material. Differences in SRT were statistically non-significant only for NAL-93 (constant stimuli method) and NAL-93 (short method). Moreover, it was shown that the time needed for presentation of a single word list (NAL-93, short method) or single sentence list (PST, PSMT) was comparable and equal to 2–3 minutes. The most uniform SRT values were obtained for PST. The PSMT was the least demanding for the listener, experimenter and equipment.
Listening tests have been carried out to quantify the significance of binaural auralization over monaural auralization in accordance with the acoustic properties of the enclosure. To this end, acoustic rendering of three different rooms were generated based on synthesized monaural (two channels with the same audio material) and binaural room impulse responses. The auralizations were evaluated by means of subjective tests using headphones with non-individualized equalization. Parameters, such as localization, spatial impression and realism, were taken into consideration to determine the relevance of providing binaural information for the auralization of a given room. The analysis of the data has been conducted following a statistical approach based on ANOVA and Pearson correlation. The results indicate that spatial perception is strongly dependent on the acoustic characteristics of the rooms and on the listening condition of the audio material. Furthermore, as expected, advantages of binaural rendering in terms of source localization was also confirmed.
Although the emotions and learning based on emotional reaction are individual-specific, the main features are consistent among all people. Depending on the emotional states of the persons, various physical and physiological changes can be observed in pulse and breathing, blood flow velocity, hormonal balance, sound properties, face expression and hand movements. The diversity, size and grade of these changes are shaped by different emotional states. Acoustic analysis, which is an objective evaluation method, is used to determine the emotional state of people’s voice characteristics. In this study, the reflection of anxiety disorder in people’s voices was investigated through acoustic parameters. The study is a case-control study in cross-sectional quality. Voice recordings were obtained from healthy people and patients. With acoustic analysis, 122 acoustic parameters were obtained from these voice recordings. The relation of these parameters to anxious state was investigated statistically. According to the results obtained, 42 acoustic parameters are variable in the anxious state. In the anxious state, the subglottic pressure increases and the vocalization of the vowels decreases. The MFCC parameter, which changes in the anxious state, indicates that people can perceive this situation while listening to the speech. It has also been shown that text reading is also effective in triggering the emotions. These findings show that there is a change in the voice in the anxious state and that the acoustic parameters are influenced by the anxious state. For this reason, acoustic analysis can be used as an expert decision support system for the diagnosis of anxiety.
Recently, the rapid advancement of the IT industry has resulted in significant changes in audio-system configurations; particularly, the audio over internet protocol (AoIP) network-based audio-transmission technology has received favourable evaluations in this field. Applying the AoIP in a certain section of the multiple-cable zone is advantageous because the installation cost is lower than that for the existing systems, and the original sound is transmitted without any distortion. The existing AoIP-based technology, however, cannot control the audio-signal characteristics of every device and can only transmit multiple audio signals through a network. In this paper, the proposed Audio Network & Control Hierarchy Over peer-to-peer (Anchor) system enables all audio equipment to send and receive signals via a data network, and the receiving device can mix the signals of different IPs. Accordingly, it was possible to improve the system-application flexibility by simplifying the audio-system configuration. The research results confirmed that the received audio signals from different IPs were received, mixed, and output without errors. It is expected that Anchor will become a standard for audio-network protocols.
An original model based on first principles is constructed for the temporal correlation of acoustic waves propagating in random scattering media. The model describes the dynamics of wave fields in a previously unexplored, moderately strong (mesoscopic) scattering regime, intermediate between those of weak scattering, on the one hand, and diffusing waves, on the other. It is shown that by considering the wave vector as a free parameter that can vary at will, one can provide an additional dimension to the data, resulting in a tomographic-type reconstruction of the full space-time dynamics of a complex structure, instead of a plain spectroscopic technique. In Fourier space, the problem is reduced to a spherical mean transform defined for a family of spheres containing the origin, and therefore is easily invertible. The results may be useful in probing the statistical structure of various random media with both spatial and temporal resolution.
Ray tracing simulation of sound field in rooms is a common tool in room acoustic design for predicting impulse response. There are numerous commercial engineering tools utilising ray tracing simulation. A specific problem in the simulation is the modelling of diffuse reflections when contribution of individual surface is prevailing. The paper introduces modelling of scattering which is interesting when the whole impulse response of a room is not a goal but contribution of certain surface. The main goal of the project is to shape directivity characteristics of scattered reflection. Also, an innovative approach is suggested for converting the energy histogram information obtained by ray tracing into an “equivalent impulse response”. The proposed algorithm is tested by comparing the results with measurements in a real sound field, realised in a scaled model where a diffusing surface is hardware-implemented.
This paper proposes an analytical model to describe the interaction of a bounded ultrasonic beam with an immersed plate. This model, based on the Gaussian beams decomposition, takes into account multiple reflections into the plate. It allows predicting three-dimensional spatial distributions of both transmitted and reflected fields. Thereby, it makes it easy to calculate the average pressure over the receiver’s area taking into account diffraction losses. So the acoustical parameters of the plate can be determined more accurately. A Green’s function for the interaction of an ultrasonic beam with the plate is derived. The obtained results are compared to those given by the angular spectrum approach. A good agreement is seen showing the validity of the proposed model.
A mixed pseudo-orthogonal frequency coding (Mixed-POFC) structure is proposed as a new spreadspectrum technique in this paper, which employs frequency and time diversity to enhance tag properties and balances the spectrum utilization and code diversity. The coding method of SAW RFID tags in this paper uses Mixed-POFC with multi-track chip arrangements. The cross-correlation and auto correlation of Mixed-POFC and POFC are calculated to demonstrate the reduced overlap between the adjacent center frequencies with the Mixed-POFC method. The center frequency of the IDT and Bragg reflectors is calculated by a coupling of modes (COM) module. The combination of the calculation results of the Bragg reflectors shows that compared with a 7-chip POFC, the coding number of a 7-chip Mixed-POFC is increased from 120 to 144 with the same fractional bandwidth of 12%. To demonstrate the validity of Mixed-POFC, finite element analysis (FEA) technology is used to analyze the frequency characteristics of Mixed-POFC chips. The maximum error between designed frequencies and simulation frequencies is only 1.7%, which verifies that the Mixed-POFC method is feasible.
Passive noise reduction means are commonly used to reduce noise in the industry but, unfortunately, their effectiveness is poor in the low frequency range. By applying active structural acoustic control to the enclosure walls significant improvement of the insulating properties in this frequency range can be achieved. In this paper a model of double panel structure with ASAC is presented. The structure consists of two aluminium plates separated by an air gap. Two inertial magnetoelectric actuators and two piezoceramic MFC sensors were used for controlling the structure. A multichannel FxLMS algorithm with virtual error microphone technique is used as a control algorithm. The signal of a virtual error microphone is extrapolated basing on signals from MFC sensors. Performance of this actively controlled structure for tonal signals at selected frequencies is presented in the article. During the study, a double panel structure was mounted on one wall of sound insulating enclosure located in an acoustic chamber. During the measurements local and global reduction of noise test signal was investigated.
New Year’s Eve is an example of a situation in which urban residents are exposed to an almost continuous and increased noise level from the impulsive sounds sources – fireworks. This custom has become a source of many controversies related to the protection of human and animal health or environmental pollution. However, current legal regulations only slightly affect the subject of noise of fireworks and its harmfulness. Currently, it does not seem possible to completely prohibit the use of fireworks in urban areas, but this does not mean that it is not possible to decrease the degree of their annoyance.
The paper consists the issues of identification, analysis and assessment of impulsive noise of fireworks and acoustic climate during New Year’s Eve. Material presented refers to measurements of time series, frequency spectrum and values of noise parameters of selected fireworks. It was presented, among others, that the measured values exceed the criteria for occupational noise (LCpeak), due to the direct hazard of hearing loss, from 1.8 dB at a distance of 25 m and 6.2 dB at a distance of 15 m. Also this work discusses results of impulsive noise measurements of fireworks recorded during New Year’s Eve in years 2016–2017. Material refers to measurements at three measurement points spread over the city of Kraków. Obtained results were compared with typical noise levels for night time in urban area, indicating also the main sources of annoyance and hazard from this type of noise.
For building applications, woven fabrics have been widely used as finishing elements of room interior but not in particular aimed for sound absorbers. Considering the micro perforation of the woven fabrics, they should have potential to be used as micro-perforated panel (MPP) absorbers; some measurement results indicated such absorption ability. Hence, it is of importance to have a sound absorption model of the woven fabrics to enable us predicting their sound absorption characteristic that is beneficial in engineering design phase. Treating the woven fabric as a rigid frame, a fluid equivalent model is employed based on the formulation of Johnson-Champoux-Allard (JCA). The model obtained is then validated by measurement results where three kinds of commercially available woven fabrics are evaluated by considering their perforation properties. It is found that the model can reasonably predict their sound absorption coefficients. However, the presence of perturbations in pores give rise to inaccuracy of resistive component of the predicted surface impedance. The use of measured static flow resistive and corrected viscous length in the calculations are useful to cope with such a situation. Otherwise, the use of an optimized simple model as a function of flow resistivity is also applicable for this case.
The locally resonant phononic crystal (LRPC) composite double panel structure (DPS) made of a twodimensional periodic array of a two-component cylindrical LR pillar connected between the upper and lower composite plates is proposed. The plates are composed of two kinds of materials and periodically etched holes. In order to reveal the bandgap properties of structure theoretically, the band structures, displacement fields of eigenmodes and transmission power spectrums of corresponding 8 × 8 finite structure are calculated and displayed by using finite element method (FEM). Numerical results and further analysis demonstrate that if the excitation and response points are picked on different sides of the structure, a wide band gap with low starting frequency is opened, which can be treated as the coupling between dominant vibrations of pillars and plate modes. In addition, the influences of filled-in rubber, etched hole and viscidity of soft material on band gap are studied and understood with the help of “base-spring-mass” simplified model.
Journal bearings are the most common type of bearings in which a shaft freely rotates in a metallic sleeve. They find a lot of applications in industry, especially where extremely high loads are involved. Proper analysis of the various bearing faults and predicting the modes of failure beforehand are essential to increase the working life of the bearing. In the current study, the vibration data of a journal bearing in the healthy condition and in five different fault conditions are collected. A feature extraction method is employed to classify the different fault conditions. Automatic fault classification is performed using artificial neural networks (ANN). As the probability of a correct prediction goes down for a higher number of faults in ANN, the method is made more robust by incorporating deep neural networks (DNN) with the help of autoencoders. Training was done using the scaled conjugate gradient algorithm and the performance was calculated by the cross entropy method. Due to the increased number of hidden layers in DNN, it is possible to achieve a high efficiency of 100% with the feature extraction method.
A plenum window with incorporation of Helmholtz resonators in between two glass panes was tested in a reverberation room. The effects of jagged flap on reducing strength of diffracted sound was also investigated in the present studies where white, traffic and construction noises were examined during each set of experiment. When the noise source was located at the central line of the plenum window, the plenum window with Helmholtz resonators was able to mitigate 8.5 dBA, 8.9 dBA and 8.2 dBA of white, traffic and construction noises, respectively, compared with the case of without window. These amounts of noises that attenuated by the plenum window were slightly higher than the case where noise source was diverged 30º away from the plenum window. The effects of jagged flaps on the acoustical performance of the plenum window were negligible. The Helmholtz resonators had the best performance in the frequency region between 900 Hz to 1300 Hz where in this frequency range, the plenum window with Helmholtz resonators was able to attenuate additional 1.7 dBA, 1.9 dBA and 1.6 dBA of white, traffic and construction noises, respectively, compared with the case of without resonators.