Search

Scholarly Works (8 results)

Sort By:

Article
Peer Reviewed

In search of time crystalline behavior in Kerr optical frequency combs

Taheri, Hossein

UC Riverside Previously Published Works (2021)

In this invited article, we report the experimental demonstration of the simultaneous coherent locking of two independent lasers with arbitrary multi-FSR (free spectral range) frequency separation to a Kerr microcomb soliton, resulting in the creation of synthetic microcomb soliton crystals. Each of the two pumps is self-injection-locked to its neighboring microcavity mode and acts as an arbiter linking the microcomb to the cavity. We show that the beating of the two pumps creates a manifest discrete symmetry and that certain microcomb states generated in this pumping scheme spontaneously break this symmetry, thereby realizing \emph{dissipative} discrete time crystals. Apart from introducing a powerful platform leveraging advanced photonics for the creation and scientific exploration of various types of dissipative time crystals and their properties, our results constitute a decisive step towards the two-point locking of Kerr microcombs with moderate bandwidths much smaller than an octave which cannot be self-referenced through standard approaches such as the f-2f technique.

Cover page: In search of time crystalline behavior in Kerr optical frequency combs

Article

Quantized Decentralized Stochastic Learning over Directed Graphs

Taheri, Hossein

UC Santa Barbara Previously Published Works (2020)

Cover page: Quantized Decentralized Stochastic Learning over Directed Graphs

Article
Peer Reviewed

Stable Kerr frequency combs excited in the vicinity of strong modal dispersion disruptions

UC Riverside Previously Published Works (2023)

Optical microresonators possessing Kerr-type nonlinearity have emerged over the past decade as reliable and versatile sources of optical frequency combs, with varied applications including in the generation of low-phase-noise radio frequency (RF) signals, small-footprint precision timekeeping, and LiDAR. One of the key parameters affecting Kerr microcomb generation in different wavelength ranges is cavity modal dispersion. Dispersion effects such as avoided mode crossings (AMCs) have been shown to greatly limit mode-locked microcomb generation, especially when pumping in close proximity to such disruptions. We present numerical modeling and experimental evidence demonstrating that using an auxiliary laser pump can suppress the detrimental impact of near-pump AMCs. We also report, for the first time to our knowledge, the possibility of the breaking of characteristic soliton steps into two stable branches corresponding to different stable pulse trains arising from the interplay of dichromatic pumping and AMCs. These findings bear significance, particularly for the generation of frequency combs in larger resonators or at smaller wavelengths, such as the visible range, where the cavities become overmoded.

Cover page: Stable Kerr frequency combs excited in the vicinity of strong modal dispersion disruptions

Thesis
Peer Reviewed

Generalization and Optimization in the Interpolation Regime: From Linear Models to Neural Networks

UC Santa Barbara Electronic Theses and Dissertations (2024)

Learning with large models has driven unprecedented advancements across diverse fields of machine learning. As model's size grows the capacity of the model to memorize or interpolate the dataset also increases. Learning under interpolation presents new challenges and opportunities which are not addressed in classical statistical learning theory. In this thesis, we explore the performance of learning methods in the interpolation regime across various models, including linear models and neural networks. Our primary goal is to understand how data and model characteristics influence the convergence behavior of gradient-based methods such as gradient descent and to quantify how well these models generalize to new data.

In the first section, we explore linear models, which are the simplest examples where learning under interpolation can be studied. In particular, we consider empirical risk minimization methods applied on high-dimensional generalized linear models and Gaussian-mixtures. Our goal is to understand the optimal test error performance for such models in an asymptotic set-up where the data-dimension is comparable to the number of training samples. By deriving a system of equations which precisely characterises the test error performance, we are able to find a tight lower-bound on the test error which holds for any convex loss function and ridge-regularization parameter. We then show the bound is tight by proposing a loss function and regularization parameter which achieves the bound. As a corollary, we are able to approximately quantify the sub-optimality of least-squares depending on the data-model.

Continuing with linear models, we consider adversarial learning with high-dimensional Gaussian-mixture models. Adversarial training, based on empirical risk minimization, currently represents one of the main approaches for defending against adversarial attacks, which involve small but targeted modifications to test data that result in misclassification. We derive precise asymptotic expressions for both standard and adversarial test errors under $\ell_p$ bounded perturbations within a Gaussian mixture model framework. Our results yield exact error formulas that demonstrate the relationship between adversarial and standard errors and the influence of factors such as the over-parameterization ratio, the data model, and the attack budget.

In the next part of the thesis, we aim to extend our theoretical findings to neural networks. Neural nets are known for their ability to memorize even complex datasets, often achieving near-zero training loss via gradient descent optimization. Despite this capability, they also demonstrate remarkable generalization to new data. We investigate the generalization error (i.e., the gap between training and test errors) of neural networks trained with logistic loss. Our main finding reveals that under a specific data-separability condition, optimal test loss bounds are achievable if the network width is only poly-logarithmically large with respect to the number of training samples. Moreover, our analysis framework which is based on algorithmic stability presents improved generalization bounds and width lower bounds compared to prior works employing alternative methods such as uniform convergence via Rademacher complexity.

Next in chapter five, we again consider the problem of learning two-layer neural networks in the interpolating regime, discussing the role of large-step sizes in speeding up the training. Particularly, we consider the Normalized Gradient Descent (NGD) algorithm where the step-size is chosen inversely proportional to the loss. NGD has proven effective in accelerating the convergence of exponentially-tailed loss functions, such as exponential and logistic losses, particularly for linear classifiers handling separable data. We demonstrate that for exponentially-tailed losses and two-layer neural nets, NGD achieves a linear convergence rate of the training loss towards the global optimum, provided the iterates identify an interpolating model. This is facilitated by our proof of gradient self-boundedness conditions and the establishment of a log-Lipschitz property. Additionally, we address the generalization capabilities of normalized GD for convex objectives through an algorithmic-stability analysis, showing that it avoids overfitting during training by providing finite-time generalization bounds. In the final section, we consider the decentralized learning scenario where the data is kept locally among several computing agents which are communicating their parameters over a graph. Our study focuses on decentralized learning in overparameterized settings, where models achieve zero training loss, specifically examining the properties of decentralized gradient descent (DGD) on separable data. Our research provides new finite-time generalization bounds for DGD, extending existing knowledge predominantly focused on centralized learning scenarios. Additionally, we develop enhanced gradient-based methods for decentralized learning with separable data, demonstrating significant orders of magnitude of speed-up compared to previous methods.

These results offer new insights and tools for understanding and improving learning in the interpolation regime across various model architectures and learning paradigms.

Cover page: Generalization and Optimization in the Interpolation Regime: From Linear Models to Neural Networks

Article
Peer Reviewed

Improved coupled-mode theory for high-index-contrast photonic platforms.

UC Riverside Previously Published Works (2020)

Coupled-mode theory (CMT) has been widely used in optics and photonics design. Despite its popularity, several different formulations of CMT exist in the literature, and their applicable range is not entirely clear, in particular when it comes to high-index-contrast photonics platforms. Here we propose an improved formulation of CMT and demonstrate its superior performance through numerical simulations that compare CMT-derived quantities with supermode calculations and full wave propagation simulations. In particular, application of the improved CMT to asymmetric waveguides reveals a necessary correction in the conventional phase matching condition for high-index-contrast systems, which could lead to more accurate photonic circuit designs involving asymmetric elements.

Cover page: Improved coupled-mode theory for high-index-contrast photonic platforms.

Article
Peer Reviewed

All-optical dissipative discrete time crystals.

UC Riverside Previously Published Works (2022)

Time crystals are periodic states exhibiting spontaneous symmetry breaking in either time-independent or periodically-driven quantum many-body systems. Spontaneous modification of discrete time-translation symmetry in periodically-forced physical systems can create a discrete time crystal (DTC) constituting a state of matter possessing properties like temporal rigid long-range order and coherence, which are inherently desirable for quantum computing and information processing. Despite their appeal, experimental demonstrations of DTCs are scarce and significant aspects of their behavior remain unexplored. Here, we report the experimental observation and theoretical investigation of DTCs in a Kerr-nonlinear optical microcavity. Empowered by the self-injection locking of two independent lasers with arbitrarily large frequency separation simultaneously to two same-family cavity modes and a dissipative Kerr soliton, this versatile platform enables realizing long-awaited phenomena such as defect-carrying DTCs and phase transitions. Combined with monolithic microfabrication, this room-temperature system paves the way for chip-scale time crystals supporting real-world applications outside sophisticated laboratories.

Cover page: All-optical dissipative discrete time crystals.

Creative Commons 'BY' version 4.0 license

Article
Peer Reviewed

Robust and Communication-Efficient Collaborative Learning

UC Lab Fees Research Program (LFRP) Funded Publications (2019)

Article
Peer Reviewed

Design and preliminary evaluation of the FINGER rehabilitation robot: controlling challenge and quantifying finger individuation during musical computer game play

UC Irvine Previously Published Works (2014)

Cover page: Design and preliminary evaluation of the FINGER rehabilitation robot: controlling challenge and quantifying finger individuation during musical computer game play