行為懲罰

行為懲罰（粵拼：hang4 wai4 cing4 fat6；英文：behavioral punishment），通常就噉叫懲罰，喺心理學同動物行為學上係操作制約（operant conditioning）必然涉及嘅一個概念。最基本上，懲罰係指一啲會「弱化」（punish）帶嚟自己嘅行為嘅結果。

舉個例說明，想像依家韞隻大家鼠入個斯金納箱裏面^[1]，响佢每次撳碌反應槓桿（response lever）嗰時，都會有電流通過佢腳下嘅電網（electrified grid）嚟電佢，令佢有痛嘅感覺；實證嘅研究表明咗，噉做會好快令隻大家鼠唔想再去撳嗰碌反應槓桿－响成個過程當中，「痛」呢個結果係個懲罰，弱化咗「撳反應槓桿」呢樣行為。即係話簡化噉講，懲罰可以想像成「會降低一隻動物做某樣行為嘅動力」嘅後果^[2]^[3]。

對懲罰嘅心理學同神經科學研究有廣泛嘅應用價值：呢啲研究會諗「懲罰嘅過程由邊啲腦區控制」等嘅問題，而呢啲研究會俾人應用喺精神醫學（例：要點樣用藥控制恐懼症呢啲源於懲罰相關功能失靈嘅心理病？）^[4]、人工智能（例：點樣教人工智能程式好似真嘅動物噉透過懲罰嚟學習？）^[5]同埋執法（例：啲刑罰要點樣設計先可以有效噉弱化「做犯法嘢」呢樣行為？）^[6]等嘅多個領域嗰度。

基礎

操作制約（operant conditioning）係心理學同動物行為學上嘅一個概念，泛指一隻動物因為對某個刺激（ $s$ ）起某個反應（ $r$ ）持續噉引致某啲後果，而將 $s$ 同 $r$ 聯想埋一齊（刺激同反應之間嘅關係起變化；Stimulus-Response），通常會導致 $s$ 同 $r$ 之間嘅關係變強或者變弱。喺定義上，會令 $r$ 變強嘅後果係所謂嘅強化（reinforcement），而會令 $r$ 變弱嘅後果就係所謂嘅懲罰^[7]。

當中「弱化」一樣行為指嘅嘢包括^[7]：

令一樣行為發生得冇咁頻密（例：唔再郁手撳掣）；
令一樣行為發生嘅時間短啲（例：撳掣嗰陣對手冇撳住個掣咁耐）；
令一樣行為嘅強度下降（例：撳掣撳得細力啲）；
令一樣行為嘅等待期長啲（例：撳掣撳得冇咁快手）... 等等。

舉個簡單例子說明，想像家吓研究者擺一隻大家鼠喺一個斯金納箱入面，個箱入面有條槓桿（ $s$ ），每當隻大家鼠撳條槓桿（ $r$ ）嗰時，隻大家鼠腳下嘅地板就會有電流通過，電到隻大家鼠覺得痛；實證嘅研究表明咗，噉做會令到隻大家鼠學識「撳槓桿」（ $r$ ）同「痛」（結果）之間有啦掕，跟住會唔再郁手撳條槓桿（ $r$ 受到弱化）－「痛」係一個行為懲罰，會「弱化」隻動物知道會帶嚟痛嘅行為，降低隻動物做嗰樣行為嘅機會率同埋做嗰樣行為嘅強度（例：撳起槓桿上嚟撳得冇咁大力）。行為懲罰呢種現象喺人等嘅多種動物身上都可以輕易觀察得到^[7]。

進化起源

喺最基本上，行為懲罰可以用進化心理學嘅角度嚟諗，分做原級懲罰物（primary punisher）同次級懲罰物（secondary punisher）兩大種：

原級懲罰物係指隻動物唔使學都會避開嘅嘢，例子有痛同獵食者呀噉：而家想像有兩個原始人，第一個原始人天生就識驚（避開）一啲佢知會帶嚟痛楚嘅嘢，另外嗰個原始人冇呢種本能，一定要有人教先至識「唔好做一啲會帶嚟痛楚嘅行為」；假設第啲因素不變，第二個人比較大機率會喺遇到危險嗰陣唔識即刻要走佬而死－生存能力冇咁強；有啲嘢係喺進化史上一路都威脅住人（同第啲動物）生存嘅，搞到現存嘅動物物種冚唪唥都進化到唔使學都識「唔好做會帶嚟呢啲嘢嘅行為」^[8]。不過，一種原級懲罰物嘅懲罰效果（ $v$ ）依然可以受各種因素影響－同一個物種嘅唔同個體之間可以喺「對原級懲罰物嘅反應」上有差異（例：有啲人怕痛啲有啲人冇咁怕痛，即係痛對唔同人嚟講 $v$ 都唔同）^[9]。
次級懲罰物係指一隻動物要學先識要「唔好會帶嚟呢啲嘢嘅行為」嘅懲罰；例如想像隻大家鼠，痛楚對佢嚟講係原級懲罰物；家陣佢俾人擺咗响個斯金納箱裏面，個實驗者每次用電電佢（會令佢痛）打前都俾佢聽一個鐘響嘅聲，令到隻大家鼠將痛同「鐘響」聯想埋一齊，就有可能令到「鐘響」對佢嚟講變成一個懲罰，會一聽到個鐘響就驚，而且避開唔做會帶嚟鐘響嘅行為－鐘響嘅聲成為咗個次級懲罰物^[10]。

懲罰程序

懲罰程序（schedules of reinforcement）係指做出一個懲罰「出現嘅時間」以及「出唔出現」同相應嘅行為之間係咪成可靠嘅關係。

個體差異

數學模型

精神醫學研究

恐懼症

應用行為分析

人工智能研究

睇埋

文獻

Mitchell, J. T., Kimbrel, N. A., Hundt, N. E., Cobb, A. R., Nelson‐Gray, R. O., & Lootens, C. M. (2007). An analysis of reinforcement sensitivity theory and the five‐factor model (PDF). European Journal of Personality: Published for the European Association of Personality Psychology, 21(7), 869-887.

攷

↑ Pineño, O. (2014). ArduiPod Box: A low-cost and open-source Skinner box using an iPod Touch and an Arduino microcontroller (PDF). Behavior research methods, 46(1), 196-205.
↑ Hoffman, M. B., & Goldsmith, T. H. (2003). The biological roots of punishment. Ohio St. J. Crim. L., 1, 627.
↑ Wringe, W. (2016). An expressive theory of punishment. Springer.
↑ Bloom, C. M., Post, R. J., Mazick, J., Blumenthal, B., Doyle, C., Peters, B., ... & Davenport, D. G. (2013). A discriminated conditioned punishment model of phobia (PDF). Neuropsychiatric disease and treatment, 9, 1239.
↑ Lima, G., Cha, M., Jeon, C., & Park, K. (2020). Explaining the punishment gap of ai and robots (PDF). arXiv preprint arXiv:2003.06507.
↑ Appelbaum, P. S. (2005). Law & psychiatry: Behavioral genetics and the punishment of crime. Psychiatric Services, 56(1), 25-27.
↑ ^7.0 ^7.1 ^7.2 Schacter, Daniel L., Daniel T. Gilbert, and Daniel M. Wegner. "B. F. Skinner: The role of reinforcement and Punishment", subsection in: Psychology; Second Edition. New York: Worth, Incorporated, 2011, 278–288.
↑ Six Things We're Born to Fear 互聯網檔案館嘅歸檔，歸檔日期2020年8月5號，.. Discover Magazine.
↑ McNaughton, N., & Corr, P. J. (2004). A two-dimensional neuropsychology of defense: fear/anxiety and defensive distance. Neuroscience & Biobehavioral Reviews, 28(3), 285-305.
↑ Miguez, G., Laborda, M. A., & Miller, R. R. (2014). Classical conditioning and pain: conditioned analgesia and hyperalgesia. Acta psychologica, 145, 10-20.

拎

[1] Pineño, O. (2014). ArduiPod Box: A low-cost and open-source Skinner box using an iPod Touch and an Arduino microcontroller (PDF). Behavior research methods, 46(1), 196-205.

[hoffman2003-2] Hoffman, M. B., & Goldsmith, T. H. (2003). The biological roots of punishment. Ohio St. J. Crim. L., 1, 627.

[3] Wringe, W. (2016). An expressive theory of punishment. Springer.

[4] Bloom, C. M., Post, R. J., Mazick, J., Blumenthal, B., Doyle, C., Peters, B., ... & Davenport, D. G. (2013). A discriminated conditioned punishment model of phobia (PDF). Neuropsychiatric disease and treatment, 9, 1239.

[5] Lima, G., Cha, M., Jeon, C., & Park, K. (2020). Explaining the punishment gap of ai and robots (PDF). arXiv preprint arXiv:2003.06507.

[6] Appelbaum, P. S. (2005). Law & psychiatry: Behavioral genetics and the punishment of crime. Psychiatric Services, 56(1), 25-27.

[schacter2011-7] 7.0 ^7.1 ^7.2 Schacter, Daniel L., Daniel T. Gilbert, and Daniel M. Wegner. "B. F. Skinner: The role of reinforcement and Punishment", subsection in: Psychology; Second Edition. New York: Worth, Incorporated, 2011, 278–288.

[8] Six Things We're Born to Fear 互聯網檔案館嘅歸檔，歸檔日期2020年8月5號，.. Discover Magazine.

[9] McNaughton, N., & Corr, P. J. (2004). A two-dimensional neuropsychology of defense: fear/anxiety and defensive distance. Neuroscience & Biobehavioral Reviews, 28(3), 285-305.

[10] Miguez, G., Laborda, M. A., & Miller, R. R. (2014). Classical conditioning and pain: conditioned analgesia and hyperalgesia. Acta psychologica, 145, 10-20.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]