Bosh issues at Production === Because the environments of Production and 3Staging are different, the issues were not happened at 3Stag. will be recorded in the Docs. ## Different AZ for instances and volumes What do I change if the resources of compute and volume are located at different Availability Zone (AZ)? - variable.yaml ``` director_name: goc-bosh auth_url: https://apigateway.twcc.tw api_host: openstack-taichung-default project: ENT108161 connection_options: headers: x-api-key: 9697d977-5315-48ba-9471-74d6872241fa openstack_domain: nchc122 internal_cidr: 192.168.209.0/24 internal_gw: 192.168.209.254 internal_ip: 192.168.209.100 external_ip: 140.110.157.193 net_id: e52b46b0-5572-4023-89e9-a715d5c48cf2 az: tccm + ignore_server_availability_zone: true instance_type: 02_vCPU_016GB_MEM_100GB_HDD default_key_name: boshkey default_security_groups: - bosh ``` - bosh.yaml ``` cloud_provider: cert: ((mbus_bootstrap_ssl)) mbus: https://mbus:((mbus_bootstrap_password))@((internal_ip)):6868 properties: agent: mbus: https://mbus:((mbus_bootstrap_password))@0.0.0.0:6868 blobstore: path: /var/vcap/micro_bosh/data/cache provider: local disk_pools: - disk_size: 65536 name: disks cloud_properties: + availability_zone: nova instance_groups: ``` - cpi.yaml ``` - path: /instance_groups/name=bosh/properties/openstack? type: replace value: auth_url: ((auth_url)) api_host: ((api_host)) project: ((project)) connection_options: ((connection_options)) domain: ((openstack_domain)) default_key_name: ((default_key_name)) default_security_groups: ((default_security_groups)) human_readable_vm_names: true + ignore_server_availability_zone: ((ignore_server_availability_zone)) use_dhcp: false ... - path: /cloud_provider/properties/openstack? type: replace value: auth_url: ((auth_url)) api_host: ((api_host)) project: ((project)) connection_options: ((connection_options)) domain: ((openstack_domain)) default_key_name: ((default_key_name)) default_security_groups: ((default_security_groups)) + ignore_server_availability_zone: ((ignore_server_availability_zone)) human_readable_vm_names: true use_dhcp: false ``` - cloud-config ``` disk_types: - name: default disk_size: 3000 + cloud_properties: + availability_zone: nova - name: large disk_size: 50_000 + cloud_properties: + availability_zone: nova ``` ## Fail to upload stemcell **Unknown CPI Error** ![](https://i.imgur.com/NfH4tXW.jpg) It takes around **2 minites** for uploading stemcell, but default timeout is 60 seconds. ``` {"time":1617351002,"stage":"Update stemcell","tags":[],"total":5,"task":"Extracting stemcell archive","index":1,"state":"started","progress":0} {"time":1617351006,"stage":"Update stemcell","tags":[],"total":5,"task":"Extracting stemcell archive","index":1,"state":"finished","progress":100} ... {"time":1617351148,"stage":"Update stemcell","tags":[],"total":5,"task":"Save stemcell bosh-openstack-kvm-ubuntu-xenial-go_agent-raw/621.57 (e3a06160-4af1-41a6-b1d3-10f0cd0a086b)","index":5,"state":"started","progress":0} {"time":1617351148,"stage":"Update stemcell","tags":[],"total":5,"task":"Save stemcell bosh-openstack-kvm-ubuntu-xenial-go_agent-raw/621.57 (e3a06160-4af1-41a6-b1d3-10f0cd0a086b)","index":5,"state":"finished","progress":100} ``` We need to change the default **read_timeout** - variable.yaml ```yaml= director_name: goc-bosh auth_url: https://apigateway.twcc.tw api_host: openstack-taichung-default project: ENT108161 connection_options: + read_timeout: 360 headers: x-api-key: 9697d977-5315-48ba-9471-74d6872241fa openstack_domain: nchc122 internal_cidr: 192.168.209.0/24 internal_gw: 192.168.209.254 internal_ip: 192.168.209.100 external_ip: 140.110.157.193 net_id: e52b46b0-5572-4023-89e9-a715d5c48cf2 az: tccm ignore_server_availability_zone: true instance_type: 02_vCPU_016GB_MEM_100GB_HDD default_key_name: boshkey default_security_groups: - bosh ```