after VOEZ launch === <!-- .slide: data-background="#FFDFEF" --> <!-- .slide: data-transition="zoom" --> how to resolve problems of mobile game server development and service maintenance :dizzy: > [name=郭學聰 Hsueh-Tsung Kuo] [time=Fri, 09 Jun 2017] [color=red] --- <!-- .slide: data-transition="convex" --> ## who am I? ![fieliapm](https://pbs.twimg.com/profile_images/591670980021387264/aZAYLRUe_400x400.png) ---- <!-- .slide: data-transition="convex" --> * programmer from Rayark, a game company in Taiwan * backend engineer * usually develop something related to my work in Python, Ruby, Golang, C# * built almost entire VOEZ game server by myself only --- <!-- .slide: data-transition="convex" --> ## VOEZ current status ---- ### spec <!-- .slide: data-transition="convex" --> * rhythm game following after famous titles Cytus and Deemo * profile, save data, achievement, leaderboard... are stored on server * based on Google Cloud Platform ---- <!-- .slide: data-transition="convex" --> ### resource * official website: https://www.rayark.com/g/voez/ * teaser: https://youtu.be/Bh6gQyJHbxI * walkthrough wiki: http://voez.info/ * <small>Apple iTunes: https://itunes.apple.com/jp/app/voez/id1007929736</small> * <small>Google Play: https://play.google.com/store/apps/details?id=com.rayark.valkyrie</small> * <small>Nintendo Switch: https://ec.nintendo.com/JP/ja/titles/70010000000044</small> ---- <!-- .slide: data-transition="convex" --> ### teaser {%youtube Bh6gQyJHbxI %} --- <!-- .slide: data-transition="convex" --> ## outline ---- <!-- .slide: data-transition="convex" --> 3. VOEZ current status ---- <!-- .slide: data-transition="convex" --> 5. HTTP protocol 6. genuine & purchase verification 1. clean leaderboard: signature 2. legal game play: activation 7. stability 1. principle 2. publish order and request order 3. database cache mechanism 4. reliable inter-server request 5. execution resource allocation for request handler and for database ---- <!-- .slide: data-transition="convex" --> 8. performance 1. from database to static file 2. statistics 3. server operation with CDN 4. service downtime and update 9. timezone 1. all about environment variable: TZ 10. conclusion 11. Q&A --- <!-- .slide: data-transition="convex" --> ## HTTP protocol ---- <!-- .slide: data-transition="convex" --> request ``` POST /api/leaderboard/song/add_score HTTP/1.1 Host: voez-api.rayark.net Signature: cHl0aG9uaXN0YQ== { "token": "593a19bc593a19bc", "player": "pythonista", "timestamp": 1496979900, "score": { "total": 999999.0, ...... } } ``` ---- <!-- .slide: data-transition="convex" --> response ``` HTTP/1.1 200 OK Server: nginx/1.9.12 Content-Type: application/json; charset=UTF-8 Content-Length: 22 Cache-Control: public,s-maxage=300 Date: Fri, 09 Jun 2017 10:20:30 GMT { "status": "ok" } ``` ---- <!-- .slide: data-transition="convex" --> API layout 1. gamedata/event/asset download 2. player info setting/getting 3. leaderboard (top/self/friends) 4. start-play authorization & score uploading 5. avatar lottery --- <!-- .slide: data-transition="convex" --> ## genuine ## & ## purchase verification ---- <!-- .slide: data-transition="convex" --> * ~~leaderboard~~ <- fake score * ~~game play~~ <- pirated app & private server ---- <!-- .slide: data-transition="convex" --> ### clean leaderboard: signature - request: send request & attach ==*signature*== <!-- .element: class="fragment" data-fragment-index="1" --> ---- <!-- .slide: data-transition="convex" --> ```sequence client->server: request w/ signature note right of server: verify signature server->client: status ``` ---- <!-- .slide: data-transition="convex" --> ```python= # client side def sign_message(private_key_string, message): private_key = Crypto.PublicKey.RSA.importKey(private_key_string) signer = Crypto.Signature.PKCS1_v1_5.new(private_key) message_hash = Crypto.Hash.SHA256.new(message) signature = signer.sign(message_hash) return base64.standard_b64encode(signature).decode('utf-8') # server side def verify_message(public_key_string, message, signature_b64): public_key = Crypto.PublicKey.RSA.importKey(public_key_string) verifier = Crypto.Signature.PKCS1_v1_5.new(public_key) message_hash = Crypto.Hash.SHA256.new(message) signature = base64.standard_b64decode(signature_b64.encode('utf-8')) return verifier.verify(message_hash, signature) ``` ---- <!-- .slide: data-transition="convex" --> ### legal game play: activation * request: send request & RSA encrypted request hash plus **nonce** (a random variable) * response: send response & attach signature * ==sign response hash plus **nonce**== <!-- .element: class="fragment" data-fragment-index="1" --> * ==client verify signature== <!-- .element: class="fragment" data-fragment-index="2" --> ---- <!-- .slide: data-transition="convex" --> ```sequence client->server: request w/ RSA.encrypt(merge(request.hash, nonce)) note right of server: verify request.hash server->client: response w/ RSA.sign(merge(response.hash, nonce)) note left of client: verify signature ``` ---- <!-- .slide: data-transition="convex" --> ```python= # client side def encrypt_message(public_key_string, message): public_key = Crypto.PublicKey.RSA.importKey(public_key_string) cipher = Crypto.Cipher.PKCS1_OAEP.new(public_key) encrypted_message = cipher.encrypt(message) return base64.standard_b64encode(encrypted_message).decode('utf-8') def encrypt_message_hash_with_nonce(public_key_string, message, nonce): message_hash_digest = Crypto.Hash.SHA256.new(message).digest() # this computation may take too long return encrypt_message(public_key_string, merge(message_hash_digest, nonce)) # server side def decrypt_message(private_key_string, encrypted_message_b64): private_key = Crypto.PublicKey.RSA.importKey(private_key_string) cipher = Crypto.Cipher.PKCS1_OAEP.new(private_key) encrypted_message = base64.standard_b64decode(encrypted_message_b64.encode('utf-8')) return cipher.decrypt(encrypted_message) def verify_encrypted_message_hash_and_extract_nonce(private_key_string, message, encrypted_message_hash_with_nonce_b64): message_hash_with_nonce = decrypt_message(private_key_string, encrypted_message_hash_with_nonce_b64) message_hash_digest = Crypto.Hash.SHA256.new(message).digest() # this computation may take too long (decrypted_message_hash_digest, nonce) = unmerge(message_hash_with_nonce) return (decrypted_message_hash_digest == message_hash_digest, nonce) ``` --- <!-- .slide: data-transition="convex" --> ## stability ---- <!-- .slide: data-transition="convex" --> ### principle * always read & write game data to DB atomically * available DB updating status * before request: * start transaction, not update yet * after request: * success: update completed * fail: not update * caution: * impossible to follow this rule everywhere ---- <!-- .slide: data-transition="convex" --> ### publish order and request order * request order * client read game info A, then B, then C * publish order <!-- .element: class="fragment" data-fragment-index="1" --> * publish game info C, then B, then A * remove game info A, then B, then C * the above principle is suitable for database and online resource distribution <!-- .element: class="fragment" data-fragment-index="2" --> ---- <!-- .slide: data-transition="convex" --> ```python= class Revision(mongo_engine.Document): # read this 1st, publish this 2nd, remove this 1st revision_id = mongo_engine.StringField(required=True) timestamp = mongo_engine.DateTimeField(required=True) name = mongo_engine.StringField(required=True) meta = { 'indexes': [ { 'fields': ['timestamp'], 'unique': True, }, ], } class SongAssetMeta(mongo_engine.Document): # read this 2nd, publish this 1st, remove this 2nd revision_id = mongo_engine.StringField(required=True) song_id = mongo_engine.StringField(required=True) asset_set_checksum = mongo_engine.StringField(required=True) song_cls = mongo_engine.StringField(required=True) song_pack_id = mongo_engine.StringField(required=False) song_title = mongo_engine.StringField(required=True) meta = { 'indexes': [ { 'fields': ['revision_id', 'song_id'], 'unique': True, }, ] } ``` ---- <!-- .slide: data-transition="convex" --> ### database cache mechanism * database in HDD or SSD: slow * Redis or Memcached in RAM: fast ---- <!-- .slide: data-transition="convex" --> * workable mechanism in concurrency environment 1. try to read cache 2. if cache missed, read original data and update data to cache 3. read cache and return data * caution * cache should be set or get atomically * don't build cache from previous cache content * ex: increase & drecrease counter ---- <!-- .slide: data-transition="convex" --> ```python= class SongAssetMetaCache(object): def set_meta(self, revision_id, song_id, asset_set_checksum, song_cls, song_pack_id): meta_key = 'song_asset_meta:%s:%s' % (revision_id, song_id) song_meta = {'asset_set_checksum': asset_set_checksum, 'song_cls': song_cls, 'song_pack_id': song_pack_id} self.strict_redis.hmset(meta_key, song_meta) self.strict_redis.expire(meta_key, self.cache_expire_time) def get_meta(self, revision_id, song_id): return self.strict_redis.hmget(revision_id, song_id) class SongAssetMetaModel(object): def reconstruct_cache(self, revision_id, song_id): try: song_asset_meta = SongAssetMeta.objects.get(revision_id=revision_id, song_id=song_id) except SongAssetMeta.DoesNotExist: SongAssetMeta.delete(revision_id, song_id) else: self.__song_asset_meta_cache.set_meta(song_asset_meta.revision_id, song_asset_meta.song_id, song_asset_meta.asset_set_checksum, song_asset_meta.song_cls, song_asset_meta.song_pack_id) def get_song_asset_meta(self, revision_id, song_id): song_asset_meta = self.__song_asset_meta_cache.get_meta(revision_id, song_id) if song_asset_meta is None: self.reconstruct_cache(revision_id, song_id) song_asset_meta = self.__song_asset_meta_cache.get_meta(revision_id, song_id) return song_asset_meta ``` ---- <!-- .slide: data-transition="convex" --> ### reliable inter-server request * inter server connection and transaction: 1. server A calculate data 2. server A start transaction and write data 3. server A request server B * success: server A finish transaction * fail: server A revert transaction ---- <!-- .slide: data-transition="convex" --> ```python= def consume(player_access_token): failure_count = 0 while True: try: purchase.consume_coin(player_access_token) except (requests.exceptions.RequestException, server_error.ServerError): failure_count += 1 if failure_count >= 3: raise else: break def gacha(player_access_token): selected_avatars = randomly_select_avatars(player_access_token) transaction = begin_appending_avatars_to_player_data(player_access_token, selected_avatars) try: consume(player_access_token) except: revert_appending_avatars_to_player_data(transaction) raise else: finish_appending_avatars_to_player_data(transaction) ``` ---- <!-- .slide: data-transition="convex" --> ### execution resource allocation for request handler and for database * database processing speed per request should be superior to request handler * [x] queue huge requests with load balancer but keep database server reliable ---- <!-- .slide: data-transition="convex" --> ![SINoALICE](https://i.imgur.com/0LZjGBO.jpg) https://i.imgur.com/0LZjGBO.jpg --- <!-- .slide: data-transition="convex" --> ## performance ---- <!-- .slide: data-transition="convex" --> #### Python is so SLOW > :hash: "skip Python code execution > if you can" > [name=Hsueh-Tsung Kuo] [time=Fri, 09 Jun 2017] [color=red] > <!-- .element: class="fragment" data-fragment-index="1" --> ---- <!-- .slide: data-transition="convex" --> ### from database to static file * announce public game info from static file * only per-user data is served from database ---- <!-- .slide: data-transition="convex" --> ```python= def publish_asset_revision(revision): song_list = [process(song) for song in SongAssetMeta.objects(revision_id=revision.id)] song_list_json = json.dumps(song_list, ensure_ascii=False, default=bson.json_util.default) location = get_cloud_storage_location() with io.BytesIO(json_data) as fp: cloud_storage.upload_file(location, fp, 'application/json', len(json_data)) ``` ---- <!-- .slide: data-transition="convex" --> ### statistics * what can we do when exporting log to Google BigQuery * data analysis (boring) * find unusual behavior and alert maintainer (help DevOps) * collect & announce seasonal event result (!?) ---- <!-- .slide: data-transition="convex" --> #### collect & announce seasonal event result * run script with specific intervals (using crontab) * run BigQuery and collect result * save result to Google Cloud Storage and make it public ---- <!-- .slide: data-transition="convex" --> ### server operation with CDN ```sequence client->CDN: request 1 Note left of CDN: cache miss CDN-->server: request 1 server-->CDN: response 1 CDN->client: response 1 client->CDN: request 2 Note right of CDN: cache hit CDN->client: response 2 (same as response 1) ``` ---- <!-- .slide: data-transition="convex" --> #### headers to operating with CDN ``` Cache-Control: public,s-maxage=300 Date: Fri, 09 Jun 2017 10:20:30 GMT ``` ---- <!-- .slide: data-transition="convex" --> #### tips when operating with CDN * asset files * different revisions of asset files should be located at individual URLs * URL must contains revision ID or checksum * entry data which lists asset files * attach event end time via **Cache-Control** or **Expires** * CDN will cache contents until event ended * client can use expire time as refresh timer * ex: back to main menu & display updated game event ---- <!-- .slide: data-transition="convex" --> ```python= @app.route('/api/asset/asset_info/<directory>/<file_name>', methods=('GET',)) @cache_control(get_cdn_cache_maxage()) def get_asset_info(directory, file_name): # handle asset info return flask.Response(response=asset_info_json_data, mimetype='application/json') @app.route('/api/asset/song_list', methods=('GET',) template_response_headers() def get_song_list(): remaining_second = get_current_song_list_remaining_second() cache_maxage = min(remaining_second, get_cdn_cache_maxage()) # handle song list response = flask.Response(response=song_list_json_data, mimetype='application/json') add_precise_cache_control_to_headers(response.headers, cache_maxage) return response ``` ---- <!-- .slide: data-transition="convex" --> ```python= def __add_cache_control_to_headers(headers, s_maxage): headers['Cache-Control'] = 'public,s-maxage=%d' % (s_maxage,) def add_precise_cache_control_to_headers(headers, s_maxage): __add_cache_control_to_headers(headers, s_maxage) def template_response_headers(headers={}): def decorator(func): @wraps(func) def decorated_function(*args, **kwargs): flask.g.current_unix_timestamp = time.time() response = flask.make_response(func(*args, **kwargs)) original_headers = response.headers for (header, value) in headers.items(): original_headers.setdefault(header, value) original_headers.setdefault('Date', werkzeug.http.http_date(flask.g.current_unix_timestamp)) return response return decorated_function return decorator def cache_control(s_maxage): headers = {} __add_cache_control_to_headers(headers, s_maxage) return template_response_headers(headers) ``` ---- <!-- .slide: data-transition="convex" --> ### service downtime and update * cachable: GET 200, 203, 300, 301, 302, 307, 410 (when using Google CDN) * downtime: 503 (non-cachable: even if CDN exists, success HTTP response will be sent to client as long as server become alive again) * update: 410 (cachable: CDN will tell client "this path is abandoned" without bothering original server) --- <!-- .slide: data-transition="convex" --> ## timezone ---- <!-- .slide: data-transition="convex" --> ### all about environment variable: TZ :100: the incredible variable: # TZ=Asia/Taipei <!-- .element: class="fragment" data-fragment-index="1" --> ```python= os.environ['TZ'] = 'Europe/Berlin' time.tzset() ``` <!-- .element: class="fragment" data-fragment-index="2" --> ---- <!-- .slide: data-transition="convex" --> ### all about environment variable: TZ * if you want to announce event and calculate login day accumulation belong to localtime, set TZ to timezone to which you want to refer. * if daylight saving time is applied in this timezone, then UTC offset of this timezone will vary, and this variation will automatically follow rules of this timezone or country. ---- <!-- .slide: data-transition="convex" --> ```python= def get_utc_offset_in_second(timestamp): struct_time = time.localtime(timestamp) return calendar.timegm(struct_time)-int(time.mktime(struct_time)) def get_utc_offset_in_timedelta(timestamp): return datetime.datetime.fromtimestamp(timestamp)-datetime.datetime.utcfromtimestamp(timestamp) def get_unix_epoch_day_in_day(timestamp, utc_offset_in_second, local_day_boundary_in_second=None): if local_day_boundary_in_second is None: local_day_boundary_in_second = 0 return (timestamp+utc_offset_in_second-local_day_boundary_in_second)//86400 def get_unix_epoch_day_in_datetime(datetime_obj, utc_offset_in_timedelta, local_day_boundary_in_timedelta=None): if local_day_boundary_in_timedelta is None: local_day_boundary_in_timedelta = datetime.timedelta() return (datetime_obj+utc_offset_in_timedelta-local_day_boundary_in_timedelta).replace(hour=0, minute=0, second=0, microsecond=0) ``` --- <!-- .slide: data-transition="convex" --> ## conclusion ---- <!-- .slide: data-transition="convex" --> > :hash: "don't repeat the same mistakes we did before!" > [name=Hsueh-Tsung Kuo] [time=Fri, 09 Jun 2017] [color=red] ---- <!-- .slide: data-transition="convex" --> ### special thanks * Rayark Inc. * CTO & CIO * VOEZ team * backend team * QA team * customer service team * IT team * other teams * iKala Interactive Media Inc. --- <!-- .slide: data-transition="zoom" --> ## Q&A --- <style> .reveal code { font-size: 12px !important; line-height: 1.2; } body { background-color: Indigo; } .rightpart{ float:right; width:50%; } .leftpart{ margin-right: 50% !important; height:50%; } .reveal section img { background:none; border:none; box-shadow:none; } p.blo { font-size: 50px !important; background:#B6BDBB; border:1px solid silver; display:inline-block; padding:0.5em 0.75em; border-radius: 10px; box-shadow: 5px 5px 5px #666; } p.blo1 { background: #c7c2bb; } p.blo2 { background: #b8c0c8; } p.blo3 { background: #c7cedd; } p.bloT { font-size: 60px !important; background:#B6BDD3; border:1px solid silver; display:inline-block; padding:0.5em 0.75em; border-radius: 8px; box-shadow: 1px 2px 5px #333; } p.bloA { background: #B6BDE3; } p.bloB { background: #E3BDB3; } .slide-number{ margin-bottom:10px !important; width:100%; text-align:center; font-size:25px !important; background-color:transparent !important; } iframe.myclass{ width:100px; height:100px; bottom:0; left:0; position:fixed; border:none; z-index:99999; } h1.raw { color: #fff; background-image: linear-gradient(90deg,#f35626,#feab3a); -webkit-background-clip: text; -webkit-text-fill-color: transparent; animation: hue 5s infinite linear; } @keyframes hue { from { filter: hue-rotate(0deg); } to { filter: hue-rotate(360deg); } } .progress{ height:14px !important; } .progress span{ height:14px !important; background: url("") repeat-x !important; } .progress span:after, .progress span.nyancat{ content: ""; background: url('') !important; width: 34px !important; height: 21px !important; border: none !important; float:right; margin-top:-7px; margin-right:-10px; } </style>
{"metaMigratedAt":"2023-06-14T12:55:43.182Z","metaMigratedFrom":"Content","title":"after VOEZ launch","breaks":true,"contributors":"[]"}
    2794 views