Geo self-service framework (alpha)
原文:https://docs.gitlab.com/ee/development/geo/framework.html
Geo self-service framework (alpha)
注意:本文档可能会随时更改. 这是我们正在研究的建议,一旦实施完成,此文档将得到更新. 跟随史诗般的进度.注意: Geo 自助服务框架当前处于 Alpha 状态. 如果您需要复制新的数据类型,请与 Geo 小组联系以讨论选项. 您可以在 Slack 的#g_geo
与他们联系,或在问题或合并请求中提及@geo-team
.
Geo 提供了一个 API,使跨 Geo 节点轻松复制数据类型成为可能. 该 API 以 Ruby 域特定语言(DSL)的形式呈现,旨在使创建数据类型的工程师只需花费很少的精力即可复制数据.
Nomenclature
在深入研究 API 之前,开发人员需要了解一些特定于地理位置的命名约定.
Model
模型是活动模型,在整个 Rails 代码库中都是如此. 它通常与数据库表绑定. 从地理角度来看,模型可以具有一个或多个资源.
Resource
资源是属于模型的一条数据,由 GitLab 功能生成. 使用存储机制将其持久化. 默认情况下,资源不可复制.
Data type
Data type is how a resource is stored. Each resource should fit in one of the data types Geo supports: :- Git repository :- Blob :- Database
有关更多详细信息,请参见数据类型 .
Geo Replicable
可复制资源是 Geo 希望在 Geo 节点之间同步的资源. 受支持的可复制数据类型有限. 实现属于已知数据类型之一的资源的复制所需的工作量很小.
Geo Replicator
地理复制器是知道如何复制可复制对象的对象. 它负责::-触发事件(生产者):-消费事件(消费者)
它与 Geo Replicable 数据类型相关. 所有复制器都有一个公共接口,可用于处理(即产生和使用)事件. 它负责主节点(产生事件的地方)和次节点(消耗事件的地方)之间的通信. 想要将 Geo 纳入其功能的工程师将使用复制器的 API 来实现这一目标.
Geo Domain-Specific Language
语法糖使工程师可以轻松指定应复制哪些资源以及如何复制.
Geo Domain-Specific Language
The replicator
首先,您需要编写一个复制器. 复制器位于ee/app/replicators/geo
. 对于每个需要复制的资源,即使多个资源绑定到同一模型,也应指定一个单独的复制器.
例如,以下复制器复制软件包文件:
module Geo
class PackageFileReplicator < Gitlab::Geo::Replicator
# Include one of the strategies your resource needs
include ::Geo::BlobReplicatorStrategy
# Specify the CarrierWave uploader needed by the used strategy
def carrierwave_uploader
model_record.file
end
# Specify the model this replicator belongs to
def self.model
::Packages::PackageFile
end
end
end
类名应该是唯一的. 它还与注册表的表名紧密相关,因此在此示例中,注册表表将为package_file_registry
.
对于不同的数据类型,Geo 支持包括不同的策略. 选择一个适合您的需求.
Linking to a model
要将此复制器绑定到模型,需要在模型代码中添加以下内容:
class Packages::PackageFile < ApplicationRecord
include ::Gitlab::Geo::ReplicableModel
with_replicator Geo::PackageFileReplicator
end
API
设置好后,可以通过模型轻松访问复制器:
package_file = Packages::PackageFile.find(4) # just a random id as example
replicator = package_file.replicator
或者从复制器取回模型:
replicator.model_record
=> <Packages::PackageFile id:4>
复制器可用于生成事件,例如在 ActiveRecord 挂钩中:
after_create_commit -> { replicator.publish_created_event }
Library
所有这些背后的框架位于ee/lib/gitlab/geo/
.
Existing Replicator Strategies
在编写一种新的复制器策略之前,请检查以下内容,以查看现有策略之一是否已经可以处理您的资源. 如果不确定,请咨询地理团队.
Blob Replicator Strategy
使用Geo::BlobReplicatorStrategy
模块,Geo 可以轻松支持使用CarrierWave 的 Uploader::Base
模型.
首先,每个文件应具有其自己的主要 ID 和模型. Geo 强烈建议将每个文件都视为头等公民,因为根据我们的经验,这大大简化了跟踪复制和验证状态.
例如,要添加对具有Widget
widgets
表的Widget
模型引用的文件的支持,您将执行以下步骤:
Replication
在
Widget
类中包含Gitlab::Geo::ReplicableModel
,并使用with_replicator Geo::WidgetReplicator
指定 Replicator 类.此时,
Widget
类应如下所示:# frozen_string_literal: true
class Widget < ApplicationRecord
include ::Gitlab::Geo::ReplicableModel
with_replicator Geo::WidgetReplicator
mount_uploader :file, WidgetUploader
def self.replicables_for_geo_node
# Should be implemented. The idea of the method is to restrict
# the set of synced items depending on synchronization settings
end
...
end
创建
ee/app/replicators/geo/widget_replicator.rb
. 实现#carrierwave_uploader
方法,该方法应返回CarrierWave::Uploader
. 并实现类方法.model
以返回Widget
类.# frozen_string_literal: true
module Geo
class WidgetReplicator < Gitlab::Geo::Replicator
include ::Geo::BlobReplicatorStrategy
def self.model
::Widget
end
def carrierwave_uploader
model_record.file
end
end
end
创建
ee/spec/replicators/geo/widget_replicator_spec.rb
并执行必要的设置,以定义共享示例的model_record
变量.# frozen_string_literal: true
require 'spec_helper'
RSpec.describe Geo::WidgetReplicator do
let(:model_record) { build(:widget) }
it_behaves_like 'a blob replicator'
end
创建
widget_registry
表,以便 Geo 次要对象可以跟踪每个 Widget 文件的同步和验证状态:# frozen_string_literal: true
class CreateWidgetRegistry < ActiveRecord::Migration[6.0]
DOWNTIME = false
disable_ddl_transaction!
def up
unless table_exists?(:widget_registry)
ActiveRecord::Base.transaction do
create_table :widget_registry, id: :bigserial, force: :cascade do |t|
t.integer :widget_id, null: false
t.integer :state, default: 0, null: false, limit: 2
t.integer :retry_count, default: 0, limit: 2
t.text :last_sync_failure
t.datetime_with_timezone :retry_at
t.datetime_with_timezone :last_synced_at
t.datetime_with_timezone :created_at, null: false
t.index :widget_id
t.index :retry_at
t.index :state
end
end
end
add_text_limit :widget_registry, :last_sync_failure, 255
end
def down
drop_table :widget_registry
end
end
Create
ee/app/models/geo/widget_registry.rb
:# frozen_string_literal: true
class Geo::WidgetRegistry < Geo::BaseRegistry
include Geo::ReplicableRegistry
MODEL_CLASS = ::Widget
MODEL_FOREIGN_KEY = :widget_id
belongs_to :widget, class_name: 'Widget'
end
方法
has_create_events?
在大多数情况下应该返回true
. 但是,如果您添加的实体没有创建事件,则根本不要添加该方法.Update
REGISTRY_CLASSES
inee/app/workers/geo/secondary/registry_consistency_worker.rb
.Create
ee/spec/factories/geo/widget_registry.rb
:# frozen_string_literal: true
FactoryBot.define do
factory :geo_widget_registry, class: 'Geo::WidgetRegistry' do
widget
state { Geo::WidgetRegistry.state_value(:pending) }
trait :synced do
state { Geo::WidgetRegistry.state_value(:synced) }
last_synced_at { 5.days.ago }
end
trait :failed do
state { Geo::WidgetRegistry.state_value(:failed) }
last_synced_at { 1.day.ago }
retry_count { 2 }
last_sync_failure { 'Random error' }
end
trait :started do
state { Geo::WidgetRegistry.state_value(:started) }
last_synced_at { 1.day.ago }
retry_count { 0 }
end
end
end
Create
ee/spec/models/geo/widget_registry_spec.rb
:# frozen_string_literal: true
require 'spec_helper'
RSpec.describe Geo::WidgetRegistry, :geo, type: :model do
let_it_be(:registry) { create(:geo_widget_registry) }
specify 'factory is valid' do
expect(registry).to be_valid
end
include_examples 'a Geo framework registry'
describe '.find_registry_differences' do
... # To be implemented
end
end
小部件现在应该由 Geo 复制!
Verification
将验证状态字段添加到
widgets
表中,以便 Geo 主数据库可以跟踪验证状态:# frozen_string_literal: true
class AddVerificationStateToWidgets < ActiveRecord::Migration[6.0]
DOWNTIME = false
def change
add_column :widgets, :verification_retry_at, :datetime_with_timezone
add_column :widgets, :verified_at, :datetime_with_timezone
add_column :widgets, :verification_checksum, :binary, using: 'verification_checksum::bytea'
add_column :widgets, :verification_failure, :string
add_column :widgets, :verification_retry_count, :integer
end
end
在
verification_failure
和verification_checksum
上添加部分索引,以确保可以高效执行重新验证:# frozen_string_literal: true
class AddVerificationFailureIndexToWidgets < ActiveRecord::Migration[6.0]
include Gitlab::Database::MigrationHelpers
DOWNTIME = false
disable_ddl_transaction!
def up
add_concurrent_index :widgets, :verification_failure, where: "(verification_failure IS NOT NULL)", name: "widgets_verification_failure_partial"
add_concurrent_index :widgets, :verification_checksum, where: "(verification_checksum IS NOT NULL)", name: "widgets_verification_checksum_partial"
end
def down
remove_concurrent_index :widgets, :verification_failure
remove_concurrent_index :widgets, :verification_checksum
end
end
要做的事情:在二级服务器上添加验证. 这应作为以下内容的一部分完成:Geo:自助服务框架-包文件验证的首次实现
小部件现在应由 Geo 验证!
Metrics
指标由Geo::MetricsUpdateWorker
收集,保存在GeoNodeStatus
以显示在 UI 中,然后发送给 Prometheus.
- 将字段
widget_count
,widget_checksummed_count
,widget_checksum_failed_count
,widget_synced_count
,widget_failed_count
和widget_registry_count
到ee/app/models/geo_node_status.rb
GeoNodeStatus#RESOURCE_STATUS_FIELDS
数组中. - 将相同的字段添加到
ee/app/models/geo_node_status.rb
GeoNodeStatus#PROMETHEUS_METRICS
哈希中. - 将相同字段添加到
doc/administration/monitoring/prometheus/gitlab_metrics.md
Sidekiq metrics
表中. - 将相同的字段添加到
doc/api/geo_nodes.md
GET /geo_nodes/status
示例响应中. - 将相同的字段添加到
ee/spec/models/geo_node_status_spec.rb
和ee/spec/factories/geo_node_statuses.rb
. Set
widget_count
inGeoNodeStatus#load_data_from_current_node
:self.widget_count = Geo::WidgetReplicator.primary_total_count
添加
GeoNodeStatus#load_widgets_data
来设置widget_synced_count
,widget_failed_count
和widget_registry_count
:def load_widget_data
self.widget_synced_count = Geo::WidgetReplicator.synced_count
self.widget_failed_count = Geo::WidgetReplicator.failed_count
self.widget_registry_count = Geo::WidgetReplicator.registry_count
end
Call
GeoNodeStatus#load_widgets_data
inGeoNodeStatus#load_secondary_data
.Set
widget_checksummed_count
andwidget_checksum_failed_count
inGeoNodeStatus#load_verification_data
:self.widget_checksummed_count = Geo::WidgetReplicator.checksummed_count self.widget_checksum_failed_count = Geo::WidgetReplicator.checksum_failed_count
小部件复制和验证指标现在应该可以在 API,管理区域 UI 和 Prometheus 中使用!
GraphQL API
在
ee/app/graphql/types/geo/geo_node_type.rb
向GeoNodeType
添加一个新字段:field :widget_registries, ::Types::Geo::WidgetRegistryType.connection_type,
null: true,
resolver: ::Resolvers::Geo::WidgetRegistriesResolver,
description: 'Find widget registries on this Geo node',
feature_flag: :geo_self_service_framework
新添加
widget_registries
字段名的expected_fields
在阵列ee/spec/graphql/types/geo/geo_node_type_spec.rb
.Create
ee/app/graphql/resolvers/geo/widget_registries_resolver.rb
:# frozen_string_literal: true
module Resolvers
module Geo
class WidgetRegistriesResolver < BaseResolver
include RegistriesResolver
end
end
end
Create
ee/spec/graphql/resolvers/geo/widget_registries_resolver_spec.rb
:# frozen_string_literal: true
require 'spec_helper'
RSpec.describe Resolvers::Geo::WidgetRegistriesResolver do
it_behaves_like 'a Geo registries resolver', :geo_widget_registry
end
Create
ee/app/finders/geo/widget_registry_finder.rb
:# frozen_string_literal: true
module Geo
class WidgetRegistryFinder
include FrameworkRegistryFinder
end
end
Create
ee/spec/finders/geo/widget_registry_finder_spec.rb
:# frozen_string_literal: true
require 'spec_helper'
RSpec.describe Geo::WidgetRegistryFinder do
it_behaves_like 'a framework registry finder', :geo_widget_registry
end
Create
ee/app/graphql/types/geo/widget_registry_type.rb
:# frozen_string_literal: true
module Types
module Geo
# rubocop:disable Graphql/AuthorizeTypes because it is included
class WidgetRegistryType < BaseObject
include ::Types::Geo::RegistryType
graphql_name 'WidgetRegistry'
description 'Represents the sync and verification state of a widget'
field :widget_id, GraphQL::ID_TYPE, null: false, description: 'ID of the Widget'
end
end
end
Create
ee/spec/graphql/types/geo/widget_registry_type_spec.rb
:# frozen_string_literal: true
require 'spec_helper'
RSpec.describe GitlabSchema.types['WidgetRegistry'] do
it_behaves_like 'a Geo registry type'
it 'has the expected fields (other than those included in RegistryType)' do
expected_fields = %i[widget_id]
expect(described_class).to have_graphql_fields(*expected_fields).at_least
end
end
Add integration tests for providing Widget registry data to the frontend via the GraphQL API, by duplicating and modifying the following shared examples in
ee/spec/requests/api/graphql/geo/registries_spec.rb
:it_behaves_like 'gets registries for', {
field_name: 'widgetRegistries',
registry_class_name: 'WidgetRegistry',
registry_factory: :geo_widget_registry,
registry_foreign_key_field_name: 'widgetId'
}
现在应该可以通过 GraphQL API 获得各个小部件同步和验证数据!
- 注意复制”更新”事件. Geo Framework 目前不支持复制”更新”事件,因为此时添加到框架的所有实体都是不可变的. 如果您要添加的实体属于这种情况,请遵循https://gitlab.com/gitlab-org/gitlab/-/issues/118743和https://gitlab.com/gitlab-org/gitlab /// issues / 118745作为添加新事件类型的示例. 添加通知后,请同时删除它.
Admin UI
要做的事情:这应该作为《 地理手册》的一部分完成:实现自助服务框架可复制的前端
窗口小部件同步和验证数据(总计和个人)现在应该在管理界面中可用!