ctlab - HackMD

開會討論
$$r_1=-(d-d_{prev})$$ $$r_2 =Rot_{22}-1.0$$ 11/20開會內容動機手臂(DoF 7)抓取物體 4 FC: 64->32->16->7 目前可假定燒杯位置資料已知視覺辨識部分直接使用朱宏國老師現有已train好的model就好
 Kelvin Yang changed 5 years agoView mode Like Bookmark
reward design
old reward policy preprocess $d = \text{goal_position} - \text{endpoint_position}$ $d *= \text{self.edge}$ init $r = -||d||$ bonus
kai2679 changed 5 years agoView mode Like Bookmark
最新版本
自己放上來~ 可用於訓練抓取的世界檔案及controller -- https://github.com/CT-Lab/PandaController/tree/world 其他容器的模型(燒杯、錐形瓶、試管) -- https://drive.google.com/drive/folders/1Odx8-J8v4Kq0fxJ86OAcKvfVwAfXLe0I dopamine -- https://github.com/CT-Lab/Dopamine_Webots/tree/Selfdefined/controllers/endpoint2D
kai2679 changed 5 years agoView mode Like Bookmark
維持燒杯水平上移
Ver1 getOrientation() \begin{equation*} R_{3,3} = \begin{pmatrix} R[0] & R[1] & R[2] \ R[3] & R[4] & R[5] \ R[6] & R[7] & R[8] \ \end{pmatrix} \end{equation*}
Kelvin Yang changed 5 years agoView mode Like Bookmark
趨勢觀察
https://imgur.com/a/CP8PFEN constrain len orientation total
石詠太 changed 5 years agoView mode Like Bookmark
supervisorController.py
import numpy as np from deepbots.supervisor.controllers.supervisor_emitter_receiver import SupervisorCSV from PPOAgent import PPOAgent, Transition from utilities import normalizeToRange from ArmUtil import ToArmCoord, PSFunc from CalObs import get_delta_r_and_theta import random as rand # from ikpy.chain import Chain # from ikpy.link import OriginLink, URDFLink class PandaSupervisor(SupervisorCSV):
GuluLingpo changed 5 years agoView mode Like Bookmark
checkConvergence.py
import matplotlib.pyplot as plt number = input("1 for constrain score, 2 for len score, 3 for orientation score, 4 for total score") if number == 1 : fp = open("Episode-constrain-score.txt", "r") print("now showing Episode-constrain-score") elif number == 2 : fp = open("Episode-len-score.txt", "r") print("now showing Episode-len-score")
GuluLingpo changed 5 years agoView mode Like Bookmark
Deepbots攻略手冊
Deepbots官方介紹 https://github.com/aidudezzz/deepbots Deepbots is a simple framework which is used as "middleware" between the free and open-source Cyberbotics' Webots robot simulator and Reinforcement Learning algorithms. When it comes to Reinforcement Learning the OpenAI gym environment has been established as the most used interface between the actual application and the RL algorithm. Deepbots is a framework which follows the OpenAI gym environment interface logic in order to be used by Webots applications. deepbots套件解釋 supervisor.py 從agent得到選擇的action selectedAction, actionProb = agent.work(observation, type_="selectAction")
Kelvin Yang changed 5 years agoView mode Like Bookmark
專題文稿
名稱射月半導體計畫 or 七軸機械手臂智慧控制內容清大(100字以內) 將最新的機械手臂移植到模擬環境中，並研究通訊方法讓我們可以使用 python 控制手臂模型進行機器學習，使手臂可以執行專業動作，並在完成後將其套用到實體手臂，取代人類操作具有高危險性的項目。隨著疫情升溫，各國致力於治療藥品研發。我們欲以Franka Emika Panda機械手臂提升藥品開發效率以及替代研發人員進行危險藥品實驗。我們在Webots建立模擬環境，除逆向運動學，更加入了強化學習，以達到智慧控制與彈性的應用能力。交大(200字以內)
石詠太 changed 5 years agoView mode Like Bookmark
reward design 問題與討論
09/25 要不要試跑看看 basic 版本？ left_bonus_count 的機制會不會有 "手臂拿完五次 bonus reward 之後他就不知道要幹嘛了" 的疑慮因為現在碰到目標沒有分數了當初設計的想法是將 left_bonus_count 的初始值設為 ON_GOAL_FINISH_COUNT ，希望能讓手臂學會越早到達 target 越好（最好還是連續五 step 都在 target 上）想像兩個情況：手臂很快地到達 target 附近，但是卻不連續的碰了 target 五次，然後離開到很遠的地方
 GuluLingpo changed 5 years agoView mode Like Bookmark
筆記型電腦
重灌須知 BIOS SATA硬碟設定為AHCI模式(為了能在usb重灌時讀取到ssd) secure boot disable(使得能簡單更新驅動) 目前狀態： Ubuntu 18.04 Teamviewer: 448 916 289 passwd: 同實驗室電腦passwd
Kelvin Yang changed 5 years agoView mode Like Bookmark
dopamine-rl 3.1.7 Webots 整合
tags: dopamine Pypi: https://pypi.org/project/dopamine-rl/ 使用須知 27/07/2020: Dopamine now runs on TensorFlow 2. However, Dopamine is still written as TensorFlow 1.X code. This means your project may need to explicity disable TensorFlow 2 behaviours with: import tensorflow.compat.v1 as tf tf.disable_v2_behavior()
Kelvin Yang changed 5 years agoView mode Like Bookmark
工作分工
看 paper 並製作 ppt deadline: 8/14（本周五） Security for Distributed Deep Neural Networks Towards Data Confidentiality & Intellectual Property Protection 負責人：蘇晏鋒 A Data Augmentation-based Defense Method Against Adversarial Attacks in Neural Networks 負責人：石詠太 SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation 負責人：陳楚翔 DeepPeep: Exploiting Design Ramifications to Decipher the Architecture of Compact DNNs
Kelvin Yang changed 5 years agoView mode Like Bookmark
工作紀錄/陳楚翔
9/5 1.發現能將.STEP轉成.stl的converter:https://www.makexyz.com/convert/step-to-stl 7/17 1.增加夾爪的控制，現在整隻手臂都能活動了。但還沒有boundingObject跟physics，所以應該還不能夾東西 7/9 1.找到可能有用的boundingObjects生成法: 先將匯入的模型取一個DEF名稱
 陳楚翔 changed 5 years agoView mode Like Bookmark
主題方向
tags: 主題方向照顧機器人移動食物到病患附近讓他們可以自主吃。刷油漆機器人刷油漆並避開障礙物如窗戶之類的物體。垃圾分類機器人應用情境垃圾分類，機械手臂的基本上就是把特定種類的垃圾從垃圾堆中挑出，並正確分類。物體判斷用vision完成，每種種類的垃圾都會被放在特定位置。
GuluLingpo changed 5 years agoView mode Like Bookmark
ZigZag: A Memory-Centric Rapid DNN Accelerator Design Space Exploration Framework
Slides Abstract & Introduction For efficient embedded deep learning system Facing resource and power constraints Need co-design between DNN algorithm, memory hierarchy, and dataflow A lot of research has been done towards developing specialized hardware accelerators, but ad-hoc and local-optimal resulting from limited design space exploration DSE: Design Space Exploration 3 main design spaces:
kai2679 changed 5 years agoView mode Like Bookmark
工作紀錄 / 楊鈞愷
前言這裡只記錄了我到目前為止實作的歷程以及想法，還有遭遇的問題與解法，試圖在走過的路上留下足跡，以供日後參考。關於想法與相關資訊可能未經查證，需特別注意。 FRANKA EMIKA Panda: 從 Gazebo 到 Webots 在ROS框架下，多半使用URDF(Unified Robot Description Format)和XACRO這樣的XML文件來描述機器人的模型而Gazebo就是ROS框架與URDF/XACRO主要支援的模擬器使用官方franka_ros github中提供的URDF/XACRO模型檔要如何在Gazebo上模擬可以參考此網誌然而，Webots並不支援URDF/XACRO格式的文件
 kai2679 changed 5 years agoView mode Like Bookmark
碩士
清大資工交大資工成大資工台大資工
 Kelvin Yang changed 5 years agoView mode Like Bookmark
Deepbots
deepbots套件解釋 reward機制 # compute reward here ## do not exceed the limit value if -2.897<newObservation[0] and newObservation[0]<2.897 and\ -1.763<newObservation[1] and newObservation[1]<1.763 and\ -2.8973<newObservation[2] and newObservation[2]<2.8973 and\ -3.072<newObservation[3] and newObservation[3]<-0.0698 and\ -2.8973<newObservation[4] and newObservation[4]<2.8973 and\
Kelvin Yang changed 5 years agoView mode Like Bookmark
工作紀錄 / 石詠太
7/8 嘗試使用網路上找到的 dae 檔在 blender 上轉檔，除了沒辦法直接轉成我們想要的 wrl 檔以外，由於 dae 檔在 blender 開啟後，沒有各部件不會保留他們的相對位置，所以要自己重新拼湊有點麻煩，另外他各個部件又可以拆解成其他各小的零件，小零件一開始是沒有組合在一起的，所以組裝時會很容易出現問題。完成 gazebo 筆記 5。研究 webots 合成關節。 7/9 在網路上研究各式 Franka Emika 的資料，希望可以找到適合用來建模的資訊。使用 gazebo 版的模型找機械手臂的相關資訊。
石詠太 changed 5 years agoView mode Like Bookmark