-
Leading edge vortex formation and wake trajectory: Synthesizing measurements, analysis, and machine learning
Authors:
Howon Lee,
Nicholas Simone,
Yunxing Su,
Yuanhang Zhu,
Bernardo Luiz R. Ribeiro,
Jennifer A. Franck,
Kenneth Breuer
Abstract:
The strength and trajectory of a leading edge vortex (LEV) formed by a pitching-heaving hydrofoil (chord $c$) is studied. The LEV is identified using the $Q$-criterion method, which is calculated from the 2D velocity field obtained from PIV measurements. The relative angle of attack at mid-stroke, ${α_{T/4}} $, proves to be an effective method of combining heave amplitude ($h_0/c$), pitch amplitud…
▽ More
The strength and trajectory of a leading edge vortex (LEV) formed by a pitching-heaving hydrofoil (chord $c$) is studied. The LEV is identified using the $Q$-criterion method, which is calculated from the 2D velocity field obtained from PIV measurements. The relative angle of attack at mid-stroke, ${α_{T/4}} $, proves to be an effective method of combining heave amplitude ($h_0/c$), pitch amplitude ($θ_0$), and reduced frequency ($f^*$) into a single variable that predicts the maximum value of $Q$ over a wide range of operating conditions. Once the LEV separates from the foil, it travels downstream and rapidly weakens and diffuses. The downstream trajectory of the LEV has two characteristic shapes. At low values of ${α_{T/4}}$, it travels straight downstream after separating from the foil, while at higher values of ${α_{T/4}} $, an accompanying Trailing Edge Vortex (TEV) forms and the induced velocity generates a cross-stream component to the vortex trajectories. This behavior is accurately predicted using a potential flow model for the LEV and TEV. Supervised machine learning algorithms, namely Support Vector Regression and Gaussian Process Regression, are used to create regression models that predicts the vortex strength, shape and trajectory during growth and after separation. The regression model successfully captures the features of two vortex regimes observed at different values of ${α_{T/4}} $. However, the predicted LEV trajectories are somewhat smoother than observed in the experiments. The strengths of the vortex is often under-predicted. Both of these shortcomings may be attributed to the relatively small size of the training data set.
△ Less
Submitted 25 May, 2022;
originally announced May 2022.
-
A Machine Learning Approach to Classify Vortex Wakes of Energy Harvesting Oscillating Foils
Authors:
Bernardo Luiz R. Ribeiro,
Jennifer A. Franck
Abstract:
A machine learning model is developed to establish wake patterns behind oscillating foils whose kinematics are within the energy harvesting regime. The role of wake structure is particularly important for array deployments of oscillating foils, since the unsteady wake highly influences performance of downstream foils. This work explores 46 oscillating foil kinematics, with the goal of parameterizi…
▽ More
A machine learning model is developed to establish wake patterns behind oscillating foils whose kinematics are within the energy harvesting regime. The role of wake structure is particularly important for array deployments of oscillating foils, since the unsteady wake highly influences performance of downstream foils. This work explores 46 oscillating foil kinematics, with the goal of parameterizing the wake based on the input kinematic variables and grouping vortex wakes through image analysis of vorticity fields. A combination of a convolutional neural network (CNN) with long short-term memory (LSTM) units is developed to classify the wakes into three groups. To fully verify the physical wake differences among foil kinematics, a convolutional autoencoder combined with k-means++ clustering is utilized and four different wake patterns are found. With the classification model, these patterns are associated with a range of foil kinematics. Future work can use these correlations to predict the performance of foils placed in the wake and build optimal foil arrangements for tidal energy harvesting.
△ Less
Submitted 5 November, 2022; v1 submitted 12 May, 2022;
originally announced May 2022.
-
A Machine Learning Approach to Classify Kinematics and Vortex Wake Modes of Oscillating Foils
Authors:
Bernardo Luiz R. Ribeiro,
Jennifer A. Franck
Abstract:
Machine learning techniques have received attention in fluid dynamics in terms of predicting, clustering and classifying complex flow physics. One application has been the classification or clustering of various wake structures that emanate from bluff bodies such as cylinders or flapping foils, creating a rich diversity of vortex formations specific to flow conditions, geometry, and/or kinematics…
▽ More
Machine learning techniques have received attention in fluid dynamics in terms of predicting, clustering and classifying complex flow physics. One application has been the classification or clustering of various wake structures that emanate from bluff bodies such as cylinders or flapping foils, creating a rich diversity of vortex formations specific to flow conditions, geometry, and/or kinematics of the body. When utilizing oscillating foils to harvest energy from tidal or river flows, it is critical to understand the intricate and nonlinear relationship between flapping kinematics and the downstream vortex wake structure for optimal siting and operation of arrays. This paper develops a classification model to obtain groups of kinematics that contain similar wake patterns within the energy harvesting regime. Data is obtained through simulations of 27 unique oscillating foil kinematics for a total of 13,650 samples of the wake vorticity field. Within these samples three groups are visually labeled based on the relative angle of attack. A machine learning approach combining a convolutional neural network (CNN) with long short-term memory (LSTM) units is utilized to automatically classify the wakes into the three groups. The average accuracy on five test data subsets is 80% when the three visually labeled groups are used for classification. After analyzing the test subset with lowest accuracy, an update on the group division boundaries is proposed. With this update, the algorithm achieves an average accuracy of 90%, demonstrating that the three groups are able to discern distinct wake structures within a range of energy harvesting kinematics.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Moiré-localized interlayer exciton wavefunctions captured by imaging its electron and hole constituents
Authors:
Ouri Karni,
Elyse Barré,
Vivek Pareek,
Johnathan D. Georgaras,
Michael K. L. Man,
Chakradhar Sahoo,
David R. Bacon,
Xing Zhu,
Henrique B. Ribeiro,
Aidan L. O'Beirne,
Jenny Hu,
Abdullah Al-Mahboob,
Mohamed M. M. Abdelrasoul,
Nicholas S. Chan,
Arka Karmakar,
Andrew J. Winchester,
Bumho Kim,
Kenji Watanabe,
Takashi Taniguchi,
Katayun Barmak,
Julien Madéo,
Felipe H. da Jornada,
Tony F. Heinz,
Keshav M. Dani
Abstract:
Interlayer excitons (ILXs) - electron-hole pairs bound across two atomically thin layered semiconductors - have emerged as attractive platforms to study exciton condensation, single-photon emission and other quantum-information applications. Yet, despite extensive optical spectroscopic investigations, critical information about their size, valley configuration and the influence of the moiré potent…
▽ More
Interlayer excitons (ILXs) - electron-hole pairs bound across two atomically thin layered semiconductors - have emerged as attractive platforms to study exciton condensation, single-photon emission and other quantum-information applications. Yet, despite extensive optical spectroscopic investigations, critical information about their size, valley configuration and the influence of the moiré potential remains unknown. Here, we captured images of the time- and momentum-resolved distribution of both the electron and the hole that bind to form the ILX in a WSe2/MoS2 heterostructure. We thereby obtain a direct measurement of the interlayer exciton diameter of ~5.4 nm, comparable to the moiré unit-cell length of 6.1 nm. Surprisingly, this large ILX is well localized within the moiré cell to a region of only 1.8 nm - smaller than the size of the exciton itself. This high degree of localization of the interlayer exciton is backed by Bethe-Salpeter equation calculations and demonstrates that the ILX can be localized within small moiré unit cells. Unlike large moiré cells, these are uniform over large regions, thus allowing the formation of extended arrays of localized excitations for quantum technology.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Wake-foil interactions and energy harvesting efficiency in tandem oscillating foils
Authors:
Bernardo Luiz R. Ribeiro,
Yunxing Su,
Quentin Guillaumin,
Kenneth S. Breuer,
Jennifer A. Franck
Abstract:
Oscillating foils in synchronized pitch/heave motions can be used to harvest hydrokinetic energy. By understanding the wake structure and its correlation with the foil kinematics, predictive models for how foils can operate in array configurations can be developed. To establish a relationship between foil kinematics and wake characteristics, a wide range of kinematics is explored in a two-foil tan…
▽ More
Oscillating foils in synchronized pitch/heave motions can be used to harvest hydrokinetic energy. By understanding the wake structure and its correlation with the foil kinematics, predictive models for how foils can operate in array configurations can be developed. To establish a relationship between foil kinematics and wake characteristics, a wide range of kinematics is explored in a two-foil tandem configuration with interfoil spacing from four to nine chord lengths separation and multiple interfoil phases. Using data from experiments and simulations, an in-depth wake analysis is performed and the mean velocity and the turbulent kinetic energy are quantified in the wake. With this energy quantification, the trailing foil efficiency is modified to account for the mean flow in addition to the energy transported by the coherent leading edge vortices (LEVs) shed from the leading foil. With the mean wake velocity, a predictive wake model is able to distinguish three regimes through analyzing trailing foil efficiency profiles and the strength of the primary LEV shed from the leading foil. Dividing the wake into regimes is an insightful way to narrow the range of foil kinematics and configurations and improve the energy harvesting in a two-tandem foil array.
△ Less
Submitted 27 July, 2021; v1 submitted 10 March, 2021;
originally announced March 2021.
-
Fluid dynamics in the warp drive spacetime geometry
Authors:
Osvaldo L. Santos-Pereira,
Everton M. C. Abreu,
Marcelo B. Ribeiro
Abstract:
The Alcubierre warp drive metric is a spacetime geometry featuring a spacetime distortion, called warp bubble, where a massive particle inside it acquires global superluminal velocities, or warp speeds. This work presents solutions of the Einstein equations for the Alcubierre metric having fluid matter as gravity source. The energy-momentum tensor considered two fluid contents, the perfect fluid a…
▽ More
The Alcubierre warp drive metric is a spacetime geometry featuring a spacetime distortion, called warp bubble, where a massive particle inside it acquires global superluminal velocities, or warp speeds. This work presents solutions of the Einstein equations for the Alcubierre metric having fluid matter as gravity source. The energy-momentum tensor considered two fluid contents, the perfect fluid and the parametrized perfect fluid (PPF), a tentative more flexible model whose aim is to explore the possibilities of warp drive solutions with positive matter density content. Santos-Pereira et al. (2020; arXiv:2008.06560) have already showed that the Alcubierre metric having dust as source connects this geometry to the Burgers equation, which describes shock waves moving through an inviscid fluid, but led the solutions back to vacuum. The same happened for two out of four solutions subcases for the perfect fluid. Other solutions for the perfect fluid indicate the possibility of warp drive with positive matter density, but at the cost of a complex solution for the warp drive regulating function. Regarding the PPF, solutions were also obtained indicating that warp speeds could be created with positive matter density. Weak, dominant, strong and null energy conditions were calculated for all studied subcases, being satisfied for the perfect fluid and creating constraints in the PPF quantities such that positive matter density is also possible for creating a warp bubble. Summing up all results,energy-momentum tensors describing more complex forms of matter, or field, distributions generate solutions for the Einstein equations with the warp drive metric where negative matter density might not be a strict precondition for attaining warp speeds.
△ Less
Submitted 8 February, 2021; v1 submitted 27 January, 2021;
originally announced January 2021.
-
Front-end control system and precise threshold configuration of the v-Angra experiment
Authors:
Mariana L Migliorini,
Antonio Fernandes Jr,
Joao C Anjos,
Pietro Chimenti,
Igor A Costa,
Luis F G Gonzalez,
Germano P Guedes,
Ernesto Kemp,
Herman P Lima Jr,
Guilherme S P Lopes,
Amaro S Lopes Jr,
Rafael A Nobrega,
Igor F Pains,
Iuri M Pepe,
Dion B S Ribeiro,
David M Souza
Abstract:
The v-Angra experiment aims to estimate the flux of antineutrino particles coming out from the Angra II nuclear reactor. Such flux is proportional to the thermal power released in the fission process and therefore can be used to infer the quantity of fuel that has been burned during a certain period. To do so, the v-Angra Collaboration has developed an antineutrino detector and a complete acquisit…
▽ More
The v-Angra experiment aims to estimate the flux of antineutrino particles coming out from the Angra II nuclear reactor. Such flux is proportional to the thermal power released in the fission process and therefore can be used to infer the quantity of fuel that has been burned during a certain period. To do so, the v-Angra Collaboration has developed an antineutrino detector and a complete acquisition system to readout and store the signals generated by its sensors. The entire detection system has been installed inside a container laboratory placed beside the dome of the nuclear reactor, in a restricted zone of the Angra II site. The system is supposed to work standalone for a few years in order to collect enough data so that the experiment can be validated. The detector's readout electronics and its environmental conditions are crucial parts of the experiment and they should work autonomously and be controlled and monitored remotely. Additionally, threshold configuration is a central issue of the experiment since antineutrino particles produce low energy signals in the detector, being necessary to carefully adjust it for all the detector channels in order to make the system capable of detecting signals as low as those generated by single photons. To this end, an embedded system was developed and integrated to the detection apparatus installed in the container at the Angra II site and is now operational and accessible to the v-Angra Collaboration. This article aims at describing the proposed embedded system and presenting the results obtained during its commissioning phase.
△ Less
Submitted 21 July, 2020;
originally announced July 2020.
-
Electrodialytic removal of tungsten and arsenic from secondary mine resources: Deep eutectic solvents enhancement
Authors:
Joana Almeida,
Rita Craveiro,
Paulina Faria,
Antonio Santos Silva,
Eduardo Mateus,
Susana Barreiros,
Alexandre Paiva,
Alexandra Branco Ribeiro
Abstract:
Tungsten is a critical raw material for European and U.S. economies. Tungsten mine residues, usually considered an environmental burden due to e.g. arsenic content, are also secondary tungsten resources. The electrodialytic (ED) process and deep eutectic solvents (DES) have been successfully and independently applied for the extraction of metals fromdifferent complex environmentalmatrices. In this…
▽ More
Tungsten is a critical raw material for European and U.S. economies. Tungsten mine residues, usually considered an environmental burden due to e.g. arsenic content, are also secondary tungsten resources. The electrodialytic (ED) process and deep eutectic solvents (DES) have been successfully and independently applied for the extraction of metals fromdifferent complex environmentalmatrices. In this study a proof of concept demonstrates that coupling DES in a two-compartment ED set-up enhances the removal and separation of arsenic and tungsten from Panasqueira mine secondary resources. Choline chloride with malonic acid (1:2), and choline chloride with oxalic acid (1:1) were the DES that in batch extracted the average maximum contents of arsenic (16%) and tungsten (9%) from the residues. However, when ED was operated at a current intensity of 100 mA for 4 days, the extraction yields increased 22% for arsenic and 11% for tungsten, comparing to the tests with no current. From the total arsenic and tungsten extracted, 82% and 77% respectively were successfully removed from the matrix compartment, as they electromigrated to the anolyte compartment, from where these elements can be further separated. This achievement potentiates circular economy, as the final treated residue could be incorporated in construction materials production, mitigating current environmental problems in both mining and construction sectors.
△ Less
Submitted 14 February, 2020;
originally announced April 2020.
-
Exploring hydrogen production for self-energy generation in electroremediation: A proof of concept
Authors:
C. Magroa,
J. Almeidaa,
J. M. Paz-Garcia,
E. P. Mateus,
A. B. Ribeiro
Abstract:
Electrodialytic technologies are clean up processes based on the application of a low-level electrical current to produce electrolysis reactions and the consequent electrochemically induced transport of contaminants. These treatments inherently produce electrolytic hydrogen, an energy carrier, at the cathode compartment, in addition to other cathode reactions. However, exploring this by-product fo…
▽ More
Electrodialytic technologies are clean up processes based on the application of a low-level electrical current to produce electrolysis reactions and the consequent electrochemically induced transport of contaminants. These treatments inherently produce electrolytic hydrogen, an energy carrier, at the cathode compartment, in addition to other cathode reactions. However, exploring this by-product for self energy generation in electroremediation has never been researched. In this work we present the study of hydrogen production during the electrodialytic treatment of three different environmental matrices.
△ Less
Submitted 14 February, 2020;
originally announced February 2020.
-
Emerging organic contaminants in wastewater: Understanding electrochemical reactors for triclosan and its by-products degradation
Authors:
Catia Magro,
Eduardo P. Mateus,
Juan M. Paz-Garcia,
Alexandra B. Ribeiro
Abstract:
Degradation technologies applied to emerging organic contaminants from human activities are one of the major water challenges in the contamination legacy. Triclosan is an emerging contaminant, commonly used as antibacterial agent in personal care products. Triclosan is stable, lipophilic and it is proved to have ecotoxicologic effects in organics. This induces great concern since its elimination i…
▽ More
Degradation technologies applied to emerging organic contaminants from human activities are one of the major water challenges in the contamination legacy. Triclosan is an emerging contaminant, commonly used as antibacterial agent in personal care products. Triclosan is stable, lipophilic and it is proved to have ecotoxicologic effects in organics. This induces great concern since its elimination in wastewater treatment plants is not efficient and its by-products (e.g. methyl-triclosan, 2,4-dichlorophenol or 2,4,6-trichlorophenol) are even more hazardous to several environmental compartments. This work provides understanding of two different electrochemical reactors for the degradation of triclosan and its derivative by-products in effluent. A batch reactor and a flow reactor (mimicking a secondary settling tank in a wastewater treatment plant) were tested with two different working anodes: Ti/MMO and Nb/BDD. The degradation efficiency and kinetics were evaluated to find the best combination of current density, electrodes and set-up design. For both reactors the best electrode combination was achieved with Ti/MMO as anode. The batch reactor at 7 mA/cm2 during 4 h attained degradation rates below the detection limit for triclosan and 2,4,6-trichlorophenol and, 94% and 43% for 2,4-dichlorophenol and methyl triclosan, respectively. The flow reactor obtained, in approximately 1 h, degradation efficiencies between 41% and 87% for the four contaminants. This study suggests an alternative technology for emerging organic contaminants degradation, since the combination of a low current density with the flow and matrix induced disturbance increases and speeds up the compounds elimination in a real environmental matrix.
△ Less
Submitted 13 February, 2020;
originally announced February 2020.
-
Brazilian Report on Safeguards Application of Reactor Neutrinos
Authors:
E. Kemp,
J. A. M. Alfonzo,
J. C. Anjos,
G. Cernicchiaro,
P. Chimenti,
I. A. Costa,
P. C. M. A. Farias,
A. Fernandes Jr.,
G. P. Guedes,
L. F. G. Gonzalez,
H. P. Lima Jr.,
A. S. Lopes Jr.,
J. Marcelo,
M. L. Migliorini,
R. A. Nóbrega,
I. M. Pepe,
D. B. S. Ribeiro,
W. V. Santos,
D. M. Souza,
L. R. Teixeira,
A. M. Trzeciak
Abstract:
The Neutrinos Angra Experiment is a water-based Cherenkov detector located in the Angra dos Reis nuclear power plant. The experiment has completed a major step by finishing the commissioning of the detector and the data acquisition system at the experimental site. The experiment was designed to detect the electron antineutrinos produced by the nuclear reactor with the main purpose to demonstrate t…
▽ More
The Neutrinos Angra Experiment is a water-based Cherenkov detector located in the Angra dos Reis nuclear power plant. The experiment has completed a major step by finishing the commissioning of the detector and the data acquisition system at the experimental site. The experiment was designed to detect the electron antineutrinos produced by the nuclear reactor with the main purpose to demonstrate the feasibility of monitoring the reactor activity using an antineutrino detector. This effort is within the context of the International Atomic Energy Agency (IAEA) program to identify potential and novel technologies that can be applied for non-proliferation safeguards. Challenges, such as operating at the surface, therefore with huge noise rates, and the need to build very sensitive but small-scale detectors, make the Angra experiment an excellent platform for developing the application itself, as well as acquiring expertise in new technologies and analysis methods. In this report, we describe the main detector features and the electronics chain (front-end and data acquisition). We also report preliminary physics results obtained from the commissioning phase data. Finally, we address conclusions regarding the future perspectives to keep this program active, due to its importance in the insertion of Latin-American scientists and engineers in a world-scale cutting edge scientific program.
△ Less
Submitted 20 December, 2019;
originally announced December 2019.
-
Neutrinos Angra experiment: commissioning and first operational measurements
Authors:
H. P. Lima Jr,
J. A. M. Alfonzo,
J. C. Anjos,
G. Cernicchiaro,
P. Chimenti,
I. A. Costa,
M. P. Dias,
P. C. M. A. Farias,
A. Fernandes Junior,
G. P. Guedes,
L. F. G. Gonzalez,
E. Kemp,
G. S. Lopes,
J. Marcelo,
M. L. Migliorini,
R. A. Nobrega,
I. M. Pepe,
D. B. S. Ribeiro,
D. M. Souza,
L. R. Teixeira
Abstract:
The Neutrinos Angra Experiment has completed a major step by finishing the comissioning of the detector and the data acquisition system at the experimental site located in the Angra dos Reis nuclear power plant. The experiment consists of a water-based detector and associated electronics, both designed with the goal of detecting the electron antineutrinos produced by the nuclear reactor. The detec…
▽ More
The Neutrinos Angra Experiment has completed a major step by finishing the comissioning of the detector and the data acquisition system at the experimental site located in the Angra dos Reis nuclear power plant. The experiment consists of a water-based detector and associated electronics, both designed with the goal of detecting the electron antineutrinos produced by the nuclear reactor. The detection is possible due to the Inverse Beta Decay, where the final products in the water are photons in the UV-to-visible range of the spectrum. The assembled detector comprises three active volumes filled with water: (i) a cubic target detector for electron antineutrinos, covered by 32 8-inches PMTs, (ii) a lateral layer surrounding the target (veto) equipped with 4 PMTs and (iii) a third volume covering the top of both, also equipped with 4~PMTs. In the present document the main features of the detector assembly as well as the integration of the readout electronics on-site are reported. Finally, some operational characteristics are shown based on straightforward analysis of the first measurements performed during the last months with the fully working detector.
△ Less
Submitted 22 May, 2019; v1 submitted 30 December, 2018;
originally announced December 2018.
-
Vortex dynamics and Reynolds number effects of an oscillating hydrofoil in energy harvesting mode
Authors:
Bernardo Luiz R. Ribeiro,
Sarah L. Frank,
Jennifer A. Franck
Abstract:
The energy extraction and vortex dynamics from the sinusoidal heaving and pitching motion of an elliptical hydrofoil is explored through large-eddy simulations (LES) at a Reynolds number of $50,000$. The LES is able to capture the time-dependent vortex shedding and dynamic stall properties of the foil as it undergoes high relative angles of attack. Results of the computations are validated against…
▽ More
The energy extraction and vortex dynamics from the sinusoidal heaving and pitching motion of an elliptical hydrofoil is explored through large-eddy simulations (LES) at a Reynolds number of $50,000$. The LES is able to capture the time-dependent vortex shedding and dynamic stall properties of the foil as it undergoes high relative angles of attack. Results of the computations are validated against experimental flume data in terms of power extraction and leading edge vortex (LEV) position and trajectory. The kinematics for optimal efficiency are found in the range of heave amplitude $h_o/c=0.5-1$ and pitch amplitude $θ_o=60^{\circ}-65^{\circ}$ for $fc/U_{\infty}=0.1$ and of $h_o/c=1-1.5$ and $θ_o=75^{\circ}-85^{\circ}$ for $fc/U_{\infty}=0.15$. Direct comparison with low Reynolds number simulations and experiments demonstrate strong agreement in energy harvesting performance between Reynolds numbers of $1000$ to $50,000$, with the high Reynolds number flows demonstrating a moderate $0.8-6.7\%$ increase in power compared to the low Reynolds number flow. In the high Reynolds number flows, the coherent LEV, which is critical for high-efficiency energy conversion, forms earlier and is slightly stronger, resulting in more power extraction. After the LEV is shed from the foil, the LEV trajectory is demonstrated to be relatively independent of Reynolds number, but has a very strong nonlinear dependence with kinematics. It is shown that the LEV trajectories are highly influenced by the heave and pitch amplitudes as well as the oscillation frequency. This has strong implications for arrays of oscillating foils since the coherent LEVs can influence the energy extraction efficiency and performance of downstream foils.
△ Less
Submitted 1 February, 2020; v1 submitted 14 February, 2018;
originally announced February 2018.
-
Efficient modeling of higher-order dependencies in networks: from algorithm to application for anomaly detection
Authors:
Mandana Saebi,
Jian Xu,
Lance M. Kaplan,
Bruno Ribeiro,
Nitesh V. Chawla
Abstract:
Complex systems, represented as dynamic networks, comprise of components that influence each other via direct and/or indirect interactions. Recent research has shown the importance of using Higher-Order Networks (HONs) for modeling and analyzing such complex systems, as the typical Markovian assumption in developing the First Order Network (FON) can be limiting. This higher-order network represent…
▽ More
Complex systems, represented as dynamic networks, comprise of components that influence each other via direct and/or indirect interactions. Recent research has shown the importance of using Higher-Order Networks (HONs) for modeling and analyzing such complex systems, as the typical Markovian assumption in developing the First Order Network (FON) can be limiting. This higher-order network representation not only creates a more accurate representation of the underlying complex system, but also leads to more accurate network analysis. In this paper, we first present a scalable and accurate model, BuildHON+, for higher-order network representation of data derived from a complex system with various orders of dependencies. Then, we show that this higher-order network representation modeled by BuildHON+ is significantly more accurate in identifying anomalies than FON, demonstrating a need for the higher-order network representation and modeling of complex systems for deriving meaningful conclusions.
△ Less
Submitted 11 October, 2020; v1 submitted 27 December, 2017;
originally announced December 2017.
-
Oscillations in the Tsallis income distribution
Authors:
Everton M. C. Abreu,
Newton J. Moura Jr.,
Abner D. Soares,
Marcelo B. Ribeiro
Abstract:
Oscillations in the complementary cumulative distribution function (CCDF) of individual income data have been found in the data of various countries studied by different authors at different time periods, but the dynamical origins of this behavior are currently unknown. Although these datasets can be fitted by different functions at different income ranges, the Tsallis distribution has recently be…
▽ More
Oscillations in the complementary cumulative distribution function (CCDF) of individual income data have been found in the data of various countries studied by different authors at different time periods, but the dynamical origins of this behavior are currently unknown. Although these datasets can be fitted by different functions at different income ranges, the Tsallis distribution has recently been found capable of fitting the whole distribution by means of only two parameters. This procedure showed clearly such oscillatory feature in the entire income range feature, but made it particularly visible at the tail of the distribution. Although log-periodic functions fitted to the data are capable of describing this behavior, a different approach to naturally disclose such oscillatory characteristics is to allow the Tsallis $q$-parameter to become complex. In this paper we use this idea in order to describe the behavior of the CCDF of the Brazilian personal income recently studied empirically by Soares et al.\ (2016). Typical elements of periodic motion, such as amplitude and angular frequency coupled to this income analysis, were obtained by means of this approach. A highly non-linear function for the CCDF was obtained through this methodology and a numerical test showed it capable of recovering the main oscillatory feature of the original CCDF of the personal income data of Brazil.
△ Less
Submitted 14 July, 2019; v1 submitted 21 June, 2017;
originally announced June 2017.
-
Characterizing Directed and Undirected Networks via Multidimensional Walks with Jumps
Authors:
Fabricio Murai,
Bruno Ribeiro,
Don Towsley,
Pinghui Wang
Abstract:
Estimating distributions of node characteristics (labels) such as number of connections or citizenship of users in a social network via edge and node sampling is a vital part of the study of complex networks. Due to its low cost, sampling via a random walk (RW) has been proposed as an attractive solution to this task. Most RW methods assume either that the network is undirected or that walkers can…
▽ More
Estimating distributions of node characteristics (labels) such as number of connections or citizenship of users in a social network via edge and node sampling is a vital part of the study of complex networks. Due to its low cost, sampling via a random walk (RW) has been proposed as an attractive solution to this task. Most RW methods assume either that the network is undirected or that walkers can traverse edges regardless of their direction. Some RW methods have been designed for directed networks where edges coming into a node are not directly observable. In this work, we propose Directed Unbiased Frontier Sampling (DUFS), a sampling method based on a large number of coordinated walkers, each starting from a node chosen uniformly at random. It is applicable to directed networks with invisible incoming edges because it constructs, in real-time, an undirected graph consistent with the walkers trajectories, and due to the use of random jumps which prevent walkers from being trapped. DUFS generalizes previous RW methods and is suited for undirected networks and to directed networks regardless of in-edges visibility. We also propose an improved estimator of node label distributions that combines information from the initial walker locations with subsequent RW observations. We evaluate DUFS, compare it to other RW methods, investigate the impact of its parameters on estimation accuracy and provide practical guidelines for choosing them. In estimating out-degree distributions, DUFS yields significantly better estimates of the head of the distribution than other methods, while matching or exceeding estimation accuracy of the tail. Last, we show that DUFS outperforms uniform node sampling when estimating distributions of node labels of the top 10% largest degree nodes, even when sampling a node uniformly has the same cost as RW steps.
△ Less
Submitted 13 July, 2018; v1 submitted 23 March, 2017;
originally announced March 2017.
-
Brownian regime of finite-N corrections to particle motion in the XY hamiltonian mean field model
Authors:
Bruno V Ribeiro,
Marco A Amato,
Yves Elskens
Abstract:
We study the dynamics of the N-particle system evolving in the XY hamiltonian mean field (HMF) model for a repulsive potential, when no phase transition occurs. Starting from a homogeneous distribution, particles evolve in a mean field created by the interaction with all others. This interaction does not change the homogeneous state of the system, and particle motion is approximately ballistic wit…
▽ More
We study the dynamics of the N-particle system evolving in the XY hamiltonian mean field (HMF) model for a repulsive potential, when no phase transition occurs. Starting from a homogeneous distribution, particles evolve in a mean field created by the interaction with all others. This interaction does not change the homogeneous state of the system, and particle motion is approximately ballistic with small corrections. For initial particle data approaching a waterbag, it is explicitly proved that corrections to the ballistic velocities are in the form of independent brownian noises over a time scale diverging not slower than $N^{2/5}$ as $N \to \infty$, which proves the propagation of molecular chaos. Molecular dynamics simulations of the XY-HMF model confirm our analytical findings.
△ Less
Submitted 18 May, 2016;
originally announced May 2016.
-
Birefringence phenomena revisited
Authors:
Dante D. Pereira,
Baltazar J. Ribeiro,
Bruno Gonçalves
Abstract:
The propagation of electromagnetic waves is investigated in the context of the isotropic and nonlinear dielectric media at rest in the eikonal limit of the geometrical optics. Taking into account the functional dependence $\varepsilon=\varepsilon(E,B)$ and $μ=μ(E,B)$ for the dielectric coefficients, a set of phenomena related to the birefringence of the electromagnetic waves induced by external fi…
▽ More
The propagation of electromagnetic waves is investigated in the context of the isotropic and nonlinear dielectric media at rest in the eikonal limit of the geometrical optics. Taking into account the functional dependence $\varepsilon=\varepsilon(E,B)$ and $μ=μ(E,B)$ for the dielectric coefficients, a set of phenomena related to the birefringence of the electromagnetic waves induced by external fields are derived and discussed. Our results contemplate the known cases already reported in the literature: Kerr, Cotton-Mouton, Jones and magnetoelectric effects. Moreover, new effects are presented here as well as the perspectives of its experimental confirmations.
△ Less
Submitted 6 April, 2016;
originally announced April 2016.
-
Tsallis statistics in the income distribution of Brazil
Authors:
Abner D. Soares,
Newton J. Moura Jr.,
Marcelo B. Ribeiro
Abstract:
This paper discusses the empirical evidence of Tsallis statistical functions in the personal income distribution of Brazil. Yearly samples from 1978 to 2014 were linearized by the q-logarithm and straight lines were fitted to the entire range of the income data in all samples, producing a two-parameters-only single function representation of the whole distribution in every year. The results showed…
▽ More
This paper discusses the empirical evidence of Tsallis statistical functions in the personal income distribution of Brazil. Yearly samples from 1978 to 2014 were linearized by the q-logarithm and straight lines were fitted to the entire range of the income data in all samples, producing a two-parameters-only single function representation of the whole distribution in every year. The results showed that the time evolution of the parameters is periodic and plotting one in terms of the other reveals a cycle mostly clockwise. It was also found that the empirical data oscillate periodically around the fitted straight lines with the amplitude growing as the income values increase. Since the entire income data range can be fitted by a single function, this raises questions on previous results claiming that the income distribution is constituted by a well defined two-classes-base income structure, since such a division in two very distinct income classes might not be an intrinsic property of societies, but a consequence of an a priori fitting-choice procedure that may leave aside possibly important income dynamics at the intermediate levels.
△ Less
Submitted 9 March, 2016; v1 submitted 22 February, 2016;
originally announced February 2016.
-
TribeFlow: Mining & Predicting User Trajectories
Authors:
Flavio Figueiredo,
Bruno Ribeiro,
Jussara Almeida,
Christos Faloutsos
Abstract:
Which song will Smith listen to next? Which restaurant will Alice go to tomorrow? Which product will John click next? These applications have in common the prediction of user trajectories that are in a constant state of flux over a hidden network (e.g. website links, geographic location). What users are doing now may be unrelated to what they will be doing in an hour from now. Mindful of these cha…
▽ More
Which song will Smith listen to next? Which restaurant will Alice go to tomorrow? Which product will John click next? These applications have in common the prediction of user trajectories that are in a constant state of flux over a hidden network (e.g. website links, geographic location). What users are doing now may be unrelated to what they will be doing in an hour from now. Mindful of these challenges we propose TribeFlow, a method designed to cope with the complex challenges of learning personalized predictive models of non-stationary, transient, and time-heterogeneous user trajectories. TribeFlow is a general method that can perform next product recommendation, next song recommendation, next location prediction, and general arbitrary-length user trajectory prediction without domain-specific knowledge. TribeFlow is more accurate and up to 413x faster than top competitors.
△ Less
Submitted 19 February, 2016; v1 submitted 3 November, 2015;
originally announced November 2015.
-
Bayesian Inference of Online Social Network Statistics via Lightweight Random Walk Crawls
Authors:
Konstantin Avrachenkov,
Bruno Ribeiro,
Jithin K. Sreedharan
Abstract:
Online social networks (OSN) contain extensive amount of information about the underlying society that is yet to be explored. One of the most feasible technique to fetch information from OSN, crawling through Application Programming Interface (API) requests, poses serious concerns over the the guarantees of the estimates. In this work, we focus on making reliable statistical inference with limited…
▽ More
Online social networks (OSN) contain extensive amount of information about the underlying society that is yet to be explored. One of the most feasible technique to fetch information from OSN, crawling through Application Programming Interface (API) requests, poses serious concerns over the the guarantees of the estimates. In this work, we focus on making reliable statistical inference with limited API crawls. Based on regenerative properties of the random walks, we propose an unbiased estimator for the aggregated sum of functions over edges and proved the connection between variance of the estimator and spectral gap. In order to facilitate Bayesian inference on the true value of the estimator, we derive the approximate posterior distribution of the estimate. Later the proposed ideas are validated with numerical experiments on inference problems in real-world networks.
△ Less
Submitted 18 December, 2015; v1 submitted 19 October, 2015;
originally announced October 2015.
-
Revisit Behavior in Social Media: The Phoenix-R Model and Discoveries
Authors:
Flavio Figueiredo,
Jussara M. Almeida,
Yasuko Matsubara,
Bruno Ribeiro,
Christos Faloutsos
Abstract:
How many listens will an artist receive on a online radio? How about plays on a YouTube video? How many of these visits are new or returning users? Modeling and mining popularity dynamics of social activity has important implications for researchers, content creators and providers. We here investigate the effect of revisits (successive visits from a single user) on content popularity. Using four d…
▽ More
How many listens will an artist receive on a online radio? How about plays on a YouTube video? How many of these visits are new or returning users? Modeling and mining popularity dynamics of social activity has important implications for researchers, content creators and providers. We here investigate the effect of revisits (successive visits from a single user) on content popularity. Using four datasets of social activity, with up to tens of millions media objects (e.g., YouTube videos, Twitter hashtags or LastFM artists), we show the effect of revisits in the popularity evolution of such objects. Secondly, we propose the Phoenix-R model which captures the popularity dynamics of individual objects. Phoenix-R has the desired properties of being: (1) parsimonious, being based on the minimum description length principle, and achieving lower root mean squared error than state-of-the-art baselines; (2) applicable, the model is effective for predicting future popularity values of objects.
△ Less
Submitted 22 June, 2014; v1 submitted 6 May, 2014;
originally announced May 2014.
-
Efficient Network Generation Under General Preferential Attachment
Authors:
James Atwood,
Bruno Ribeiro,
Don Towsley
Abstract:
Preferential attachment (PA) models of network structure are widely used due to their explanatory power and conceptual simplicity. PA models are able to account for the scale-free degree distributions observed in many real-world large networks through the remarkably simple mechanism of sequentially introducing nodes that attach preferentially to high-degree nodes. The ability to efficiently genera…
▽ More
Preferential attachment (PA) models of network structure are widely used due to their explanatory power and conceptual simplicity. PA models are able to account for the scale-free degree distributions observed in many real-world large networks through the remarkably simple mechanism of sequentially introducing nodes that attach preferentially to high-degree nodes. The ability to efficiently generate instances from PA models is a key asset in understanding both the models themselves and the real networks that they represent. Surprisingly, little attention has been paid to the problem of efficient instance generation. In this paper, we show that the complexity of generating network instances from a PA model depends on the preference function of the model, provide efficient data structures that work under any preference function, and present empirical results from an implementation based on these data structures. We demonstrate that, by indexing growing networks with a simple augmented heap, we can implement a network generator which scales many orders of magnitude beyond existing capabilities ($10^6$ -- $10^8$ nodes). We show the utility of an efficient and general PA network generator by investigating the consequences of varying the preference functions of an existing model. We also provide "quicknet", a freely-available open-source implementation of the methods described in this work.
△ Less
Submitted 20 May, 2014; v1 submitted 18 March, 2014;
originally announced March 2014.
-
Modeling Website Popularity Competition in the Attention-Activity Marketplace
Authors:
Bruno Ribeiro,
Christos Faloutsos
Abstract:
How does a new startup drive the popularity of competing websites into oblivion like Facebook famously did to MySpace? This question is of great interest to academics, technologists, and financial investors alike. In this work we exploit the singular way in which Facebook wiped out the popularity of MySpace, Hi5, Friendster, and Multiply to guide the design of a new popularity competition model. O…
▽ More
How does a new startup drive the popularity of competing websites into oblivion like Facebook famously did to MySpace? This question is of great interest to academics, technologists, and financial investors alike. In this work we exploit the singular way in which Facebook wiped out the popularity of MySpace, Hi5, Friendster, and Multiply to guide the design of a new popularity competition model. Our model provides new insights into what Nobel Laureate Herbert A. Simon called the "marketplace of attention," which we recast as the attention-activity marketplace. Our model design is further substantiated by user-level activity of 250,000 MySpace users obtained between 2004 and 2009. The resulting model not only accurately fits the observed Daily Active Users (DAU) of Facebook and its competitors but also predicts their fate four years into the future.
△ Less
Submitted 31 July, 2014; v1 submitted 3 March, 2014;
originally announced March 2014.
-
On the duration and intensity of cumulative advantage competitions
Authors:
Bo Jiang,
Liyuan Sun,
Daniel R. Figueiredo,
Bruno Ribeiro,
Don Towsley
Abstract:
The role of skill (fitness) and luck (randomness) as driving forces on the dynamics of resource accumulation in a myriad of systems have long puzzled scientists. Fueled by undisputed inequalities that emerge from actual competitions, there is a pressing need for better understanding the effects of skill and luck in resource accumulation. When such competitions are driven by externalities such as c…
▽ More
The role of skill (fitness) and luck (randomness) as driving forces on the dynamics of resource accumulation in a myriad of systems have long puzzled scientists. Fueled by undisputed inequalities that emerge from actual competitions, there is a pressing need for better understanding the effects of skill and luck in resource accumulation. When such competitions are driven by externalities such as cumulative advantage (CA), the rich-get-richer effect, little is known with respect to fundamental properties such as their duration and intensity. In this work we provide a mathematical understanding of how CA exacerbates the role of luck in detriment of skill in simple and well-studied competition models. We show, for instance, that if two agents are competing for resources that arrive sequentially at each time unit, an early stroke of luck can place the less skilled in the lead for an extremely long period of time, a phenomenon we call "struggle of the fittest". In the absence of CA, the more skilled quickly prevails despite any early stroke of luck that the less skilled may have. We prove that duration of a simple skill and luck competition model exhibit power law tails when CA is present, regardless of skill difference, which is in sharp contrast to exponential tails when CA is absent. Our findings have important implications to competitions not only in complex social systems but also in contexts that leverage such models.
△ Less
Submitted 13 December, 2014; v1 submitted 18 February, 2014;
originally announced February 2014.
-
Classifying Latent Infection States in Complex Networks
Authors:
Yeon-sup Lim,
Bruno Ribeiro,
Don Towsley
Abstract:
Algorithms for identifying the infection states of nodes in a network are crucial for understanding and containing infections. Often, however, only a relatively small set of nodes have a known infection state. Moreover, the length of time that each node has been infected is also unknown. This missing data -- infection state of most nodes and infection time of the unobserved infected nodes -- poses…
▽ More
Algorithms for identifying the infection states of nodes in a network are crucial for understanding and containing infections. Often, however, only a relatively small set of nodes have a known infection state. Moreover, the length of time that each node has been infected is also unknown. This missing data -- infection state of most nodes and infection time of the unobserved infected nodes -- poses a challenge to the study of real-world cascades.
In this work, we develop techniques to identify the latent infected nodes in the presence of missing infection time-and-state data. Based on the likely epidemic paths predicted by the simple susceptible-infected epidemic model, we propose a measure (Infection Betweenness) for uncovering these unknown infection states. Our experimental results using machine learning algorithms show that Infection Betweenness is the most effective feature for identifying latent infected nodes.
△ Less
Submitted 31 January, 2014;
originally announced February 2014.
-
Online Dating Recommendations: Matching Markets and Learning Preferences
Authors:
Kun Tu,
Bruno Ribeiro,
Hua Jiang,
Xiaodong Wang,
David Jensen,
Benyuan Liu,
Don Towsley
Abstract:
Recommendation systems for online dating have recently attracted much attention from the research community. In this paper we proposed a two-side matching framework for online dating recommendations and design an LDA model to learn the user preferences from the observed user messaging behavior and user profile features. Experimental results using data from a large online dating website shows that…
▽ More
Recommendation systems for online dating have recently attracted much attention from the research community. In this paper we proposed a two-side matching framework for online dating recommendations and design an LDA model to learn the user preferences from the observed user messaging behavior and user profile features. Experimental results using data from a large online dating website shows that two-sided matching improves significantly the rate of successful matches by as much as 45%. Finally, using simulated matchings we show that the the LDA model can correctly capture user preferences.
△ Less
Submitted 30 January, 2014;
originally announced January 2014.
-
Who is Dating Whom: Characterizing User Behaviors of a Large Online Dating Site
Authors:
Peng Xia,
Kun Tu,
Bruno Ribeiro,
Hua Jiang,
Xiaodong Wang,
Cindy Chen,
Benyuan Liu,
Don Towsley
Abstract:
Online dating sites have become popular platforms for people to look for potential romantic partners. It is important to understand users' dating preferences in order to make better recommendations on potential dates. The message sending and replying actions of a user are strong indicators for what he/she is looking for in a potential date and reflect the user's actual dating preferences. We study…
▽ More
Online dating sites have become popular platforms for people to look for potential romantic partners. It is important to understand users' dating preferences in order to make better recommendations on potential dates. The message sending and replying actions of a user are strong indicators for what he/she is looking for in a potential date and reflect the user's actual dating preferences. We study how users' online dating behaviors correlate with various user attributes using a large real-world dateset from a major online dating site in China. Many of our results on user messaging behavior align with notions in social and evolutionary psychology: males tend to look for younger females while females put more emphasis on the socioeconomic status (e.g., income, education level) of a potential date. In addition, we observe that the geographic distance between two users and the photo count of users play an important role in their dating behaviors. Our results show that it is important to differentiate between users' true preferences and random selection. Some user behaviors in choosing attributes in a potential date may largely be a result of random selection. We also find that both males and females are more likely to reply to users whose attributes come closest to the stated preferences of the receivers, and there is significant discrepancy between a user's stated dating preference and his/her actual online dating behavior. These results can provide valuable guidelines to the design of a recommendation engine for potential dates.
△ Less
Submitted 22 January, 2014;
originally announced January 2014.
-
Practical Characterization of Large Networks Using Neighborhood Information
Authors:
Pinghui Wang,
Bruno Ribeiro,
Junzhou Zhao,
John C. S. Lui,
Don Towsley,
Xiaohong Guan
Abstract:
Characterizing large online social networks (OSNs) through node querying is a challenging task. OSNs often impose severe constraints on the query rate, hence limiting the sample size to a small fraction of the total network. Various ad-hoc subgraph sampling methods have been proposed, but many of them give biased estimates and no theoretical basis on the accuracy. In this work, we focus on develop…
▽ More
Characterizing large online social networks (OSNs) through node querying is a challenging task. OSNs often impose severe constraints on the query rate, hence limiting the sample size to a small fraction of the total network. Various ad-hoc subgraph sampling methods have been proposed, but many of them give biased estimates and no theoretical basis on the accuracy. In this work, we focus on developing sampling methods for OSNs where querying a node also reveals partial structural information about its neighbors. Our methods are optimized for NoSQL graph databases (if the database can be accessed directly), or utilize Web API available on most major OSNs for graph sampling. We show that our sampling method has provable convergence guarantees on being an unbiased estimator, and it is more accurate than current state-of-the-art methods. We characterize metrics such as node label density estimation and edge label density estimation, two of the most fundamental network characteristics from which other network characteristics can be derived. We evaluate our methods on-the-fly over several live networks using their native APIs. Our simulation studies over a variety of offline datasets show that by including neighborhood information, our method drastically (4-fold) reduces the number of samples required to achieve the same estimation accuracy of state-of-the-art methods.
△ Less
Submitted 13 November, 2013;
originally announced November 2013.
-
Cosmologia e Representação
Authors:
Marcelo Byrro Ribeiro
Abstract:
This work presents a brief and non-technical description of the main results and concepts of the modern scientific cosmology, viewing it from an epistemological perspective which allows a dialog with other modes of thinking like e.g. history, philosophy, sociology and religion. This epistemological viewpoint is based on the philosophical theses advanced by Ludwig Boltzmann (1844-1906) which states…
▽ More
This work presents a brief and non-technical description of the main results and concepts of the modern scientific cosmology, viewing it from an epistemological perspective which allows a dialog with other modes of thinking like e.g. history, philosophy, sociology and religion. This epistemological viewpoint is based on the philosophical theses advanced by Ludwig Boltzmann (1844-1906) which states that scientific theories are nothing more than representations, or images, of nature (arXiv:physics/0701308v1). By being representations one cannot know how nature really is because the intrinsic and indispensable properties that characterize nature are unreachable by science. In other words, the true essences that constitute nature are unknowable. Therefore, all answers proposed by science are partial, simplified and replaceable. Another way of putting forward this viewpoint is to state that all scientific truths are provisional, a result which naturally leads to the conclusion that the same set of phenomena, or scientific questions, may have various answers, or representations. This conclusion is generally known as theoretical pluralism (arXiv:physics/9806011). It is exactly such a plurality for conceiving, or representing, nature that opens the way for a possibly fruitful dialog among the various forms of thinking, since this dialog can take place in the realm of the representations. A few examples taken from cosmology, sociology and theology are discussed in the context of this epistemological framework.
△ Less
Submitted 13 August, 2013;
originally announced August 2013.
-
Modeling and Predicting the Growth and Death of Membership-based Websites
Authors:
Bruno Ribeiro
Abstract:
Driven by outstanding success stories of Internet startups such as Facebook and The Huffington Post, recent studies have thoroughly described their growth. These highly visible online success stories, however, overshadow an untold number of similar ventures that fail. The study of website popularity is ultimately incomplete without general mechanisms that can describe both successes and failures.…
▽ More
Driven by outstanding success stories of Internet startups such as Facebook and The Huffington Post, recent studies have thoroughly described their growth. These highly visible online success stories, however, overshadow an untold number of similar ventures that fail. The study of website popularity is ultimately incomplete without general mechanisms that can describe both successes and failures. In this work we present six years of the daily number of users (DAU) of twenty-two membership-based websites - encompassing online social networks, grassroots movements, online forums, and membership-only Internet stores - well balanced between successes and failures. We then propose a combination of reaction-diffusion-decay processes whose resulting equations seem not only to describe well the observed DAU time series but also provide means to roughly predict their evolution. This model allows an approximate automatic DAU-based classification of websites into self-sustainable v.s. unsustainable and whether the startup growth is mostly driven by marketing & media campaigns or word-of-mouth adoptions.
△ Less
Submitted 27 January, 2014; v1 submitted 4 July, 2013;
originally announced July 2013.
-
Efficiently Estimating Motif Statistics of Large Networks
Authors:
Pinghui Wang,
John C. S. Lui,
Bruno Ribeiro,
Don Towsley,
Junzhou Zhao,
Xiaohong Guan
Abstract:
Exploring statistics of locally connected subgraph patterns (also known as network motifs) has helped researchers better understand the structure and function of biological and online social networks (OSNs). Nowadays the massive size of some critical networks -- often stored in already overloaded relational databases -- effectively limits the rate at which nodes and edges can be explored, making i…
▽ More
Exploring statistics of locally connected subgraph patterns (also known as network motifs) has helped researchers better understand the structure and function of biological and online social networks (OSNs). Nowadays the massive size of some critical networks -- often stored in already overloaded relational databases -- effectively limits the rate at which nodes and edges can be explored, making it a challenge to accurately discover subgraph statistics. In this work, we propose sampling methods to accurately estimate subgraph statistics from as few queried nodes as possible. We present sampling algorithms that efficiently and accurately estimate subgraph properties of massive networks. Our algorithms require no pre-computation or complete network topology information. At the same time, we provide theoretical guarantees of convergence. We perform experiments using widely known data sets, and show that for the same accuracy, our algorithms require an order of magnitude less queries (samples) than the current state-of-the-art algorithms.
△ Less
Submitted 27 March, 2014; v1 submitted 22 June, 2013;
originally announced June 2013.
-
Collective modes in free plasmas subjected to a radiation field
Authors:
Bruno Vieira Ribeiro,
Daniel Dourado de A. Santos,
Marco A. Amato
Abstract:
In this study we report the effects of an external electromagnetic field on the collective properties of unmagnetized plasmas. The calculations are carried out in the semi-classical approximation, i.e., the electromagnetic field is treated classically and the electrons from a quantum mechanical viewpoint. The results show that the collective modes are damped away more smoothly and in a smaller fre…
▽ More
In this study we report the effects of an external electromagnetic field on the collective properties of unmagnetized plasmas. The calculations are carried out in the semi-classical approximation, i.e., the electromagnetic field is treated classically and the electrons from a quantum mechanical viewpoint. The results show that the collective modes are damped away more smoothly and in a smaller frequency range than those reported by previous studies. An exponential-like decay of the plasmon frequencies as a function of the external field amplitude is readily observed. We successfully recreate the results of previous studies. We also find that the single photon processes has a pronounced effect on the decrease of the frequency range of modulation.
△ Less
Submitted 18 April, 2013;
originally announced April 2013.
-
Testing the Goodwin growth-cycle macroeconomic dynamics in Brazil
Authors:
N. J. Moura Jr,
Marcelo B. Ribeiro
Abstract:
This paper discusses the empirical validity of Goodwin's (1967) macroeconomic model of growth with cycles by assuming that the individual income distribution of the Brazilian society is described by the Gompertz-Pareto distribution (GPD). This is formed by the combination of the Gompertz curve, representing the overwhelming majority of the population (~99%), with the Pareto power law, representing…
▽ More
This paper discusses the empirical validity of Goodwin's (1967) macroeconomic model of growth with cycles by assuming that the individual income distribution of the Brazilian society is described by the Gompertz-Pareto distribution (GPD). This is formed by the combination of the Gompertz curve, representing the overwhelming majority of the population (~99%), with the Pareto power law, representing the tiny richest part (~1%). In line with Goodwin's original model, we identify the Gompertzian part with the workers and the Paretian component with the class of capitalists. Since the GPD parameters are obtained for each year and the Goodwin macroeconomics is a time evolving model, we use previously determined, and further extended here, Brazilian GPD parameters, as well as unemployment data, to study the time evolution of these quantities in Brazil from 1981 to 2009 by means of the Goodwin dynamics. This is done in the original Goodwin model and an extension advanced by Desai et al. (2006). As far as Brazilian data is concerned, our results show partial qualitative and quantitative agreement with both models in the studied time period, although the original one provides better data fit. Nevertheless, both models fall short of a good empirical agreement as they predict single center cycles which were not found in the data. We discuss the specific points where the Goodwin dynamics must be improved in order to provide a more realistic representation of the dynamics of economic systems.
△ Less
Submitted 29 January, 2013; v1 submitted 6 January, 2013;
originally announced January 2013.
-
Online Myopic Network Covering
Authors:
Konstantin Avrachenkov,
Prithwish Basu,
Giovanni Neglia,
Bruno Ribeiro,
Don Towsley
Abstract:
Efficient marketing or awareness-raising campaigns seek to recruit $n$ influential individuals -- where $n$ is the campaign budget -- that are able to cover a large target audience through their social connections. So far most of the related literature on maximizing this network cover assumes that the social network topology is known. Even in such a case the optimal solution is NP-hard. In practic…
▽ More
Efficient marketing or awareness-raising campaigns seek to recruit $n$ influential individuals -- where $n$ is the campaign budget -- that are able to cover a large target audience through their social connections. So far most of the related literature on maximizing this network cover assumes that the social network topology is known. Even in such a case the optimal solution is NP-hard. In practice, however, the network topology is generally unknown and needs to be discovered on-the-fly. In this work we consider an unknown topology where recruited individuals disclose their social connections (a feature known as {\em one-hop lookahead}). The goal of this work is to provide an efficient greedy online algorithm that recruits individuals as to maximize the size of target audience covered by the campaign.
We propose a new greedy online algorithm, Maximum Expected $d$-Excess Degree (MEED), and provide, to the best of our knowledge, the first detailed theoretical analysis of the cover size of a variety of well known network sampling algorithms on finite networks. Our proposed algorithm greedily maximizes the expected size of the cover. For a class of random power law networks we show that MEED simplifies into a straightforward procedure, which we denote MOD (Maximum Observed Degree). We substantiate our analytical results with extensive simulations and show that MOD significantly outperforms all analyzed myopic algorithms. We note that performance may be further improved if the node degree distribution is known or can be estimated online during the campaign.
△ Less
Submitted 20 December, 2012;
originally announced December 2012.
-
Quantifying the effect of temporal resolution on time-varying networks
Authors:
Bruno Ribeiro,
Nicola Perra,
Andrea Baronchelli
Abstract:
Time-varying networks describe a wide array of systems whose constituents and interactions evolve over time. They are defined by an ordered stream of interactions between nodes, yet they are often represented in terms of a sequence of static networks, each aggregating all edges and nodes present in a time interval of size Δt. In this work we quantify the impact of an arbitrary Δt on the descriptio…
▽ More
Time-varying networks describe a wide array of systems whose constituents and interactions evolve over time. They are defined by an ordered stream of interactions between nodes, yet they are often represented in terms of a sequence of static networks, each aggregating all edges and nodes present in a time interval of size Δt. In this work we quantify the impact of an arbitrary Δt on the description of a dynamical process taking place upon a time-varying network. We focus on the elementary random walk, and put forth a simple mathematical framework that well describes the behavior observed on real datasets. The analytical description of the bias introduced by time integrating techniques represents a step forward in the correct characterization of dynamical processes on time-varying graphs.
△ Less
Submitted 22 October, 2013; v1 submitted 29 November, 2012;
originally announced November 2012.
-
Multiple Random Walks to Uncover Short Paths in Power Law Networks
Authors:
Bruno Ribeiro,
Prithwish Basu,
Don Towsley
Abstract:
Consider the following routing problem in the context of a large scale network $G$, with particular interest paid to power law networks, although our results do not assume a particular degree distribution. A small number of nodes want to exchange messages and are looking for short paths on $G$. These nodes do not have access to the topology of $G$ but are allowed to crawl the network within a limi…
▽ More
Consider the following routing problem in the context of a large scale network $G$, with particular interest paid to power law networks, although our results do not assume a particular degree distribution. A small number of nodes want to exchange messages and are looking for short paths on $G$. These nodes do not have access to the topology of $G$ but are allowed to crawl the network within a limited budget. Only crawlers whose sample paths cross are allowed to exchange topological information. In this work we study the use of random walks (RWs) to crawl $G$. We show that the ability of RWs to find short paths bears no relation to the paths that they take. Instead, it relies on two properties of RWs on power law networks: 1) RW's ability observe a sizable fraction of the network edges; and 2) an almost certainty that two distinct RW sample paths cross after a small percentage of the nodes have been visited. We show promising simulation results on several real world networks.
△ Less
Submitted 26 May, 2012;
originally announced May 2012.
-
Characterizing Continuous Time Random Walks on Time Varying Graphs
Authors:
Daniel Figueiredo,
Philippe Nain,
Bruno Ribeiro,
Edmundo de Souza e Silva,
Don Towsley
Abstract:
In this paper we study the behavior of a continuous time random walk (CTRW) on a stationary and ergodic time varying dynamic graph. We establish conditions under which the CTRW is a stationary and ergodic process. In general, the stationary distribution of the walker depends on the walker rate and is difficult to characterize. However, we characterize the stationary distribution in the following c…
▽ More
In this paper we study the behavior of a continuous time random walk (CTRW) on a stationary and ergodic time varying dynamic graph. We establish conditions under which the CTRW is a stationary and ergodic process. In general, the stationary distribution of the walker depends on the walker rate and is difficult to characterize. However, we characterize the stationary distribution in the following cases: i) the walker rate is significantly larger or smaller than the rate in which the graph changes (time-scale separation), ii) the walker rate is proportional to the degree of the node that it resides on (coupled dynamics), and iii) the degrees of node belonging to the same connected component are identical (structural constraints). We provide examples that illustrate our theoretical findings.
△ Less
Submitted 2 December, 2012; v1 submitted 24 December, 2011;
originally announced December 2011.
-
The Gompertz-Pareto Income Distribution
Authors:
F. Chami Figueira,
N. J. Moura Jr,
Marcelo B. Ribeiro
Abstract:
This work analyzes the Gompertz-Pareto distribution (GPD) of personal income, formed by the combination of the Gompertz curve, representing the overwhelming majority of the economically less favorable part of the population of a country, and the Pareto power law, which describes its tiny richest part. Equations for the Lorenz curve, Gini coefficient and the percentage share of the Gompertzian part…
▽ More
This work analyzes the Gompertz-Pareto distribution (GPD) of personal income, formed by the combination of the Gompertz curve, representing the overwhelming majority of the economically less favorable part of the population of a country, and the Pareto power law, which describes its tiny richest part. Equations for the Lorenz curve, Gini coefficient and the percentage share of the Gompertzian part relative to the total income are all written in this distribution. We show that only three parameters, determined by linear data fitting, are required for its complete characterization. Consistency checks are carried out using income data of Brazil from 1981 to 2007 and they lead to the conclusion that the GPD is consistent and provides a coherent and simple analytical tool to describe personal income distribution data.
△ Less
Submitted 11 October, 2010;
originally announced October 2010.
-
Evidence for the Gompertz Curve in the Income Distribution of Brazil 1978-2005
Authors:
Newton J. Moura Jr.,
Marcelo B. Ribeiro
Abstract:
This work presents an empirical study of the evolution of the personal income distribution in Brazil. Yearly samples available from 1978 to 2005 were studied and evidence was found that the complementary cumulative distribution of personal income for 99% of the economically less favorable population is well represented by a Gompertz curve of the form $G(x)=\exp [\exp (A-Bx)]$, where $x$ is the n…
▽ More
This work presents an empirical study of the evolution of the personal income distribution in Brazil. Yearly samples available from 1978 to 2005 were studied and evidence was found that the complementary cumulative distribution of personal income for 99% of the economically less favorable population is well represented by a Gompertz curve of the form $G(x)=\exp [\exp (A-Bx)]$, where $x$ is the normalized individual income. The complementary cumulative distribution of the remaining 1% richest part of the population is well represented by a Pareto power law distribution $P(x)= βx^{-α}$. This result means that similarly to other countries, Brazil's income distribution is characterized by a well defined two class system. The parameters $A$, $B$, $α$, $β$ were determined by a mixture of boundary conditions, normalization and fitting methods for every year in the time span of this study. Since the Gompertz curve is characteristic of growth models, its presence here suggests that these patterns in income distribution could be a consequence of the growth dynamics of the underlying economic system. In addition, we found out that the percentage share of both the Gompertzian and Paretian components relative to the total income shows an approximate cycling pattern with periods of about 4 years and whose maximum and minimum peaks in each component alternate at about every 2 years. This finding suggests that the growth dynamics of Brazil's economic system might possibly follow a Goodwin-type class model dynamics based on the application of the Lotka-Volterra equation to economic growth and cycle.
△ Less
Submitted 15 December, 2008;
originally announced December 2008.
-
Theory assessment and reality in Boltzmann's epistemological thinking
Authors:
Marcelo Byrro Ribeiro,
Antonio Augusto Passos Videira
Abstract:
This paper discusses how theories can be assessed within the epistemological viewpoint advanced by the Austrian physicist Ludwig E. Boltzmann. It builds upon, and further develops, the perspective of Boltzmann's thinking as advanced by Ribeiro and Videira (1998, arXiv:physics/9806011). Boltzmann's epistemological viewpoint accepts that reality is real and proposes that reality can be described by…
▽ More
This paper discusses how theories can be assessed within the epistemological viewpoint advanced by the Austrian physicist Ludwig E. Boltzmann. It builds upon, and further develops, the perspective of Boltzmann's thinking as advanced by Ribeiro and Videira (1998, arXiv:physics/9806011). Boltzmann's epistemological viewpoint accepts that reality is real and proposes that reality can be described by different points of view because his main philosophical thesis states that scientific theories are images of Nature. We present the historical context that witnessed the genesis of Boltzmann's ideas and expand Ribeiro and Videira's (Ibid.) perspective by arguing that later in his life Boltzmann realized the insufficiency of his thesis as justification for theoretical pluralism and avoidance of dogmatism. Consequently, his thinking went beyond epistemology, the nature of scientific knowledge, to include realism, the nature of the represented objects.
△ Less
Submitted 19 December, 2022; v1 submitted 26 January, 2007;
originally announced January 2007.
-
Zipf Law for Brazilian Cities
Authors:
Newton J. Moura Jr.,
Marcelo B. Ribeiro
Abstract:
This work studies the Zipf Law for cities in Brazil. Data from censuses of 1970, 1980, 1991 and 2000 were used to select a sample containing only cities with 30,000 inhabitants or more. The results show that the population distribution in Brazilian cities does follow a power law similar to the ones found in other countries. Estimates of the power law exponent were found to be 2.22 +/- 0.34 for t…
▽ More
This work studies the Zipf Law for cities in Brazil. Data from censuses of 1970, 1980, 1991 and 2000 were used to select a sample containing only cities with 30,000 inhabitants or more. The results show that the population distribution in Brazilian cities does follow a power law similar to the ones found in other countries. Estimates of the power law exponent were found to be 2.22 +/- 0.34 for the 1970 and 1980 censuses, and 2.26 +/- 0.11 for censuses of 1991 and 2000. More accurate results were obtained with the maximum likelihood estimator, showing an exponent equal to 2.41 for 1970 and 2.36 for the other three years.
△ Less
Submitted 29 August, 2006; v1 submitted 25 November, 2005;
originally announced November 2005.
-
Dogmatism and Theoretical Pluralism in Modern Cosmology
Authors:
Marcelo B. Ribeiro,
Antonio A. P. Videira
Abstract:
This work discusses the presence of a dogmatic tendency within modern cosmology, and some ideas capable of neutralizing its negative influence. It is verified that warnings about the dangers of dogmatic thinking in cosmology can be found as early as the 1930's, and we discuss the modern appearance of "scientific dogmatism". The solution proposed to counteract such an influence, which is capable…
▽ More
This work discusses the presence of a dogmatic tendency within modern cosmology, and some ideas capable of neutralizing its negative influence. It is verified that warnings about the dangers of dogmatic thinking in cosmology can be found as early as the 1930's, and we discuss the modern appearance of "scientific dogmatism". The solution proposed to counteract such an influence, which is capable of neutralizing this dogmatic tendency, has its origins in the philosophical thinking of the Austrian physicist Ludwig Boltzmann (1844-1906). In particular we use his two main epistemological theses, scientific theories as representations of nature and theoretical pluralism, to show that once they are embodied in the research practice of modern cosmology, there is no longer any reason for dogmatic behaviours.
△ Less
Submitted 8 June, 1998;
originally announced June 1998.