Geometry of statistical manifolds Sample Clauses

Geometry of statistical manifolds. ‌ Consider the parameterized conditional probability distribution πθ(a|s) wholly specified by θ — purposely notated as such since the following analysis will later be applied to stochastic policies in reinforcement learning. Denote S the space of all distributions of this type (e.g. all neural networks of a given structure across the space of θ) and consider θ to be the local coordinate system of S. More accurately, the space under consideration is the quotient space S˜ = S\R where R is defined by the equivalence relation θ ∼ θ′ when πθ(a|s) = πθ′ (a|s) (3.39) i.e. the two parameters specify identical conditional distributions. However, for ease of notation, the space under consideration will simply be denoted S. The divergence between two points θ, θ′ ∈ S is defined by the Kullback-Leibler (KL) divergence between the two distributions they specify as D (π : π′ ) = ∫ π (a|s) log πθ (a|s) KL θ θ θ πθ′ (a|s) Note that the KL-divergence is not symmetric and thus cannot be considered a metric. Choosing DKL as a notion of distance, however, allows for the specification of the Riemannian structure of S. Consider some loss function L(θ). With respect to DKL as the divergence between two coordinates, the steepest descent (natural gradient) direction upon the manifold is F (θ)−1∇L(θ) where F (θ) = ∇θ′ DKL(πθ : πθ′ )|θ=θ′ (3.41) i.e. the Hessian of the KL-divergence at θ. Specifically, the matrix F (θ) is called the F (θ) = Es∼µ,a∼πθ [∇ log πθ(a|s)∇ log πθ(a|s)⊤] (3.42) pre-emptively using the RL notation s ∼ µ to denote the sampling of s from the stationary distribution µ(s). Under certain regularity conditions (for exchanging integration and differntiation), this form of the Fisher matrix can be shown to be equivalent to the Hessian of DKL as follows: F (θ) = ∇θ′ DKL(πθ : πθ′ )|θ=θ′ πθ′ (a|s) θ=θ′ = ∇2 ∫ π (a|s) log πθ (a|s) ds da = − ∫ πθ(a|s) ∇2 log πθ(a|s) ds da (*) = Es∼µ,a∼π [−∇2 log πθ(a|s)] = E − 1 ∇2π (a|s) + E 1 ∇π (a|s)∇π (a|s)⊤ s∼µ,a∼πθ πθ(a|s) s∼µ,a∼πθ πθ(a|s)2

Related to Geometry of statistical manifolds

  • Statistical Sampling Documentation a. A copy of the printout of the random numbers generated by the “Random Numbers” function of the statistical sampling software used by the IRO.‌ b. A description or identification of the statistical sampling software package used by the IRO.‌

  • Usage Statistics The Distributor shall ensure that the Publisher will provide access to both composite system-wide use data and itemized data for the Licensee, the Participating Institutions, individual campuses and labs, on a monthly basis. The statistics shall meet or exceed the most recent project Counting Online Usage of NeTworked Electronic Resources ("COUNTER") Code of Practice Release,3 including but not limited to its provisions on customer confidentiality. When a release of a new COUNTER Code of Practice is issued, the Distributor shall ensure that the Publisher will comply with the implementation time frame specified by COUNTER to provide usage statistics in the new standard format. It is more than desirable that the Standardized Usage Statistics Harvesting Initiative (SUSHI) Protocol4 is available for the Licensee to harvest the statistics.

  • Statistical Data The statistical, industry-related and market-related data included in the Registration Statement, the Sale Preliminary Prospectus, and/or the Prospectus are based on or derived from sources that the Company reasonably and in good faith believes are reliable and accurate, and such data materially agree with the sources from which they are derived.

  • Statistical, Demographic or Market-Related Data All statistical, demographic or market-related data included in the Registration Statement, the Disclosure Package or the Prospectus are based on or derived from sources that the Company believes to be reliable and accurate and all such data included in the Registration Statement, the Disclosure Package or the Prospectus accurately reflects the materials upon which it is based or from which it was derived.

  • Statistics The Parties shall endeavour to promote, in accordance with existing statistical cooperation activities between the Union and ASEAN, the harmonisation of statistical methods and practices including the gathering and dissemination of statistics, thus enabling them to use, on a mutually acceptable basis, statistics on trade in goods and services, foreign direct investment and, more generally, on any other area covered by this Agreement which lends itself to statistical data collection, processing, analysis and dissemination.