自然语言处理_汉语词频表--根据《现代汉语频率词典》输入
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
汉语词频表--根据《现代汉语频率词典》输入
数据摘要:
根据《现代汉语频率词典》输入的汉语使用频率统计数据。
中文关键词:
汉语,词频表,频率,词典,输入,
英文关键词:
Chinese,Frequency table,Frequency,Dictionary,Input,
数据格式:
TEXT
数据用途:
统计汉字使用频率,中文信息处理,汉字编码
数据详细介绍:
PlaceLab Couple Dataset
Overview of the PLCouple Dataset
This is about one month of all non-identifying data from a 2.5 month stay in the PlaceLab of a couple. We cannot provide audio and video because it could reveal the identity of the participants.
Sensors included are all the standard PlaceLab wired sensors, described here:
S. S. Intille, K. Larson, E. Munguia Tapia, J. Beaudin, P. Kaushik, J. Nawyn, and R. Rockinson, "Using a live-in laboratory for ubiquitous computing research," in Proceedings of PERVASIVE 2006, vol. LNCS 3968, K. P. Fishkin, B. Schiele, P. Nixon, and A. Quigley, Eds. Berlin Heidelberg: Springer-Verlag, 2006, pp. 349-365.
The mobile stick-on object usage and accelerometer-based sensors are called MITes and are described in this publication:
E. Munguia Tapia, S. S. Intille, L. Lopez, and K. Larson, "The design of a portable kit of wireless sensors for naturalistic data collection," in Proceedings of PERVASIVE 2006, vol. LNCS 3968, K. P. Fishkin, B. Schiele, P. Nixon, and A. Quigley, Eds. Berlin Heidelberg: Springer-Verlag, 2006, pp. 117-134.
The infrared MITes were developed as part of this work at MERL:
C. R. Wren and E. Munguia-Tapia, "Toward Scalable Activity Recognition for Sensor Networks," in Proceedings of The Second International Workshop in Location and Context-Awareness (LoCA '06), vol. 3987 / 2006, M. Hazas, J. Krumm, and T. Strang, Eds. Dublin, Ireland: Springer Berlin / Heidelberg, 2006, pp. 168-185.
RFID tagging is provided using the Intel RFID glove, described in this publication:
Philipose, M., Smith, J.R., Jiang, B., Mamishev, A., Roy, S., Sundara-Rajan, K., "Battery-free wireless identification and sensing." IEEE Pervasive Computing 4(1), 37–45 (2005)
About 100 hours of the data are annotated. The annotation was done using custom annotation software called Handlense [Overview of HandLense and executable]. Only the activity of the male subject was annotated. This paper has details about how the 100 hours of annotation was done:
B. Logan, J. Healey, Matthai Philipose, E. Munguia Tapia, and S. Intille, "A
long-term evaluation of sensing modalities for activity recognition," in Proceedings of the International Conference on Ubiquitious Computing, vol. LNCS 4717. Berlin Heidelberg: Springer-Verlag, 2007, pp. 483–500.
Directory Structure