Data Engineering Meetup #1
åºæ¬æ å ±
- æ¥æ
- ã
- éå¬åœ¢åŒ
- çŸå°éå¬
- äŒå Ž
- LINEæ ªåŒäŒç€Ÿ æ°å®¿ãªãã£ã¹
ã€ãã³ãå 容
æŠèŠ
Data Engineering Meetup ã¯ãããŒã¿ãšã³ãžãã¢ãªã³ã°ïŒããŒã¿ã®åéã»ç®¡çã»åŠçã»å¯èŠåãªã©ïŒã«é¢ããæè¡ã®æ å ±äº€æã»å ±æãè¡ãããã®ããŒãã¢ããã€ãã³ãã§ããLINEæ ªåŒäŒç€Ÿã®æå¿ã®ãšã³ãžãã¢ã«ãã£ãŠéå¶ãããŠããŸãã
ã¿ã€ã ããŒãã«
| æé | ã¿ã€ãã« / ç»å£è (æ¬ç§°ç¥) |
|---|---|
| 19:00 - 19:15 | éå Žã»åä» |
| 19:15 - 19:20 | ã€ãã³ã説æ |
| 19:20 - 19:45 | ãSpark 2.4 & 3.0 - What's next -ã æ ªåŒäŒç€Ÿãšãã»ãã£ã»ãã£ã»ããŒã¿ ç¿ç° æµ©èŒ |
| 19:45 - 20:10 | ãæç¶å¯èœãªããŒã¿åºç€ã®ããã®ããŒã¿ã®å€æ§æ§ã«å¯Ÿããåãçµã¿ã æ ªåŒäŒç€Ÿãµã€ããŒãšãŒãžã§ã³ã åæ æç± |
| 20:10 - 20:15 | äŒæ© |
| 20:15 - 20:40 | ãDeep Dive into Spark SQL with Advanced Performance Tuningã Databricks Inc. äžæ° åä¹ |
| 20:40 - 21:05 | ãImproving Spark SQL Performanceã LINEæ ªåŒäŒç€Ÿ åç° åäº |
| 21:05 - 22:00 | æèŠªäŒ |
ã»ãã·ã§ã³å å®¹çŽ¹ä» (æ¬ç§°ç¥)
1) Spark 2.4 & 3.0 - What's next -
æ ªåŒäŒç€Ÿãšãã»ãã£ã»ãã£ã»ããŒã¿ ç¿ç° 浩èŒ
Apache Sparkã¯ããŒãžã§ã³ã¢ãããéããããšã«ãããã©ãŒãã³ã¹ã ãã§ã¯ãªããŠãŒã¶ããªãã£ã®åäžã䞡茪ã§è¡ãããŠããŸããæšå¹Ž11æã«ãªãªãŒã¹ãããSpark 2.4ã§ã¯Kubernetes察å¿ã®åŒ·åãAvroãã©ãŒããããžã®å¯Ÿå¿ãé«é颿°ãå«ããã«ãã€ã³é¢æ°ã®æ¡å ãªã©ãè¡ãããŸããã ãŸã次æã¡ãžã£ãŒã¢ããããŒããšãªã3.0ã§ã¯ãé«é颿°ã®æŽãªãæ¡å ãAIã«é¢é£ãããŠãŒã¹ã±ãŒã¹ãã«ããŒããåãçµã¿ãProject Hydrogenããæ¬æ Œçã«å§åããèŠéãã§ãã æ¬ã»ãã·ã§ã³ã§ã¯ã2.4ã®ã¢ããããŒãããããããã€ã€ã3.0以éã§ã®ã¢ããããŒããæ€èšãããŠããäž»ã ã£ãæ©èœããšã³ãã³ã¹ã¡ã³ããæ»ãæãã§ã玹ä»ããŸãã
2) æç¶å¯èœãªããŒã¿åºç€ã®ããã®ããŒã¿ã®å€æ§æ§ã«å¯Ÿããåãçµã¿
æ ªåŒäŒç€Ÿãµã€ããŒãšãŒãžã§ã³ã åæ æç±
ãµã€ããŒãšãŒãžã§ã³ãã§ã¯AbemaTVãAmebaããã°ãã¯ãããšãã倿§ãªãµãŒãã¹ãæäŸããŠãããããŒã¿æŽ»çšã«ãããŠã¯æ§ã ãªåœ¢åŒã®ããŒã¿ãåŠçããå¿ èŠããããŸããæ¬çºè¡šã§ã¯ãHBaseé¢é£ã·ã¹ãã ã®çµ±åãªã©ãæç¶çã«ããŒã¿åºç€ãéçºã»éçšããŠããããã®ããŒã¿ã®å€æ§æ§ã«å¯Ÿããåãçµã¿ã«ã€ããŠç޹ä»ããŸãã
3) Deep Dive into Spark SQL with Advanced Performance Tuning
Databricks Inc. äžæ° åä¹
Spark SQLã¯Apache Sparkã®ã³ã¢ã¢ãžã¥ãŒã«ã®äžã€ã§ãSQLã䜿ããããAPIã«ããé¢ä¿æŒç®ãã¹ã±ãŒã©ãã«ã§å¹ççã«è¡ãã³ã³ããŒãã³ãã§ããæ§ã ãªããŒã¿ãœãŒã¹(äŸ: Hive, Cassandra, Kafka, Oracleãªã©)ããã¡ã€ã«ãã©ãŒããã(äŸ: Parquet, ORC, CSV, JSONãªã©)ã®ããŒã¿ãåŠçãè§£æããããšãã§ããŸãã æ¬è¬æŒã§ã¯ãSpark SQLã®ã¯ãšãªåŠçã©ã€ããµã€ã¯ã«ã®æè¡ç詳现ã«ã€ããŠè§£èª¬ãããŸãã©ã®ããã«ããã©ãŒãã³ã¹ãã¥ãŒãã³ã°ãããã®ãã玹ä»ããŸãã
4) Improving Spark SQL Performance
LINEæ ªåŒäŒç€Ÿ åç° åäº
LINE ã§ã¯ã "OASIS" ãšãããç¬èªã«éçºããå 補ã®ããŒã¿åæããŒã«ã 2018 幎 4 æããéçšããŠããã LINE ã®å瀟å¡ãããã®ããŒã«äžã§ Spark ã¢ããªã±ãŒã·ã§ã³ (Spark, Spark SQL, PySpark, SparkR) ãæžããŠå®è¡ããããšã§ãæ åœãµãŒãã¹ã®ããŒã¿åæãã¬ããŒãäœæã ETL éçºã»éçšãªã©ãè¡ã£ãŠããŸããå šç€Ÿå¡ãèªç±ã« Spark SQL ã¯ãšãªãæžããŠå®è¡ã§ããç°å¢ã«ãããŠãããŒã¿åºç€ã®ãªãœãŒã¹ãå¹ççã«äœ¿çšãããããã«ããããã«ã¯ãé·æéå®è¡ãããéå¹ççãªã¯ãšãªã®åŠçæ§èœããããŒã«ã»ããŒã¿åºç€åŽã§æ¹åããããšãéèŠã«ãªããŸãããã®çºè¡šã§ã¯ãããŒãã«ã»ããŒãã£ã·ã§ã³ã®çµ±èšæ å ±ã®ååŸããç¬èªã®ã¯ãšãªæé©åã«ãŒã«ã®é©çšã Cost-based Optimizer ã®æ§èœæ€èšŒãªã©ã OASIS ã«ããã Spark SQL ã®æ§èœæ¹åã®åãçµã¿ãã玹ä»ããŸãã
äŒå Ž
LINEæ ªåŒäŒç€Ÿ (æ±äº¬éœæ°å®¿åºæ°å®¿åäžç®1çª6å· JRæ°å®¿ãã©ã€ãã¿ã¯ãŒ åä»:5F)
JRæ°å®¿é§
çŽçµïŒãã©ã€ãã¿ã¯ãŒæ¹æïŒïŒåŒäº¬ç·ãç·æŠæ¬ç·ãäžå€®æ¬ç·ãæ¹åæ°å®¿ã©ã€ã³ãå±±æç·ãæç°ãšã¯ã¹ãã¬ã¹ïŒ
æ°å®¿äžäžç®é§
åŸæ©1åïŒæ±äº¬ã¡ããäžžã®å
ç·ãå¯éœå¿ç·ãéœå¶å°äžéïŒ
ãã¹ã¿æ°å®¿çŽçµ
å ¥é€šæ¹æ³ã»åä»
- æ°å®¿ãã©ã€ãã¿ã¯ãŒ 5Fãšã³ãã©ã³ã¹ã«èšçœ®ããåä»ã§å ¥é€šæç¶ããããŠãã ããããã®é connpass ã®æ¬ã€ãã³ãã§çºè¡ããåä»ç¥šããæç€ºãã ããã
- ã¹ã¿ããããã²ã¹ãã«ãŒããåãåãé ãããšã¬ããŒã¿ãŒã§äŒå Žãšãªã23Fã«ãäžããã ãããã²ã¹ãã«ãŒãã¯ç¡ãããªããããæ³šæãã ãããã€ãã³ãäžã¯éŠããäžããããšãããããããŸãã
- ãåž°ãã®éã«å¿ ãã¹ã¿ããã«è¿åŽé¡ããŸãã
â» 19:15 ãŸã§ã« 5F åä»ã«ãè¶ããã ãããåä»ã®éœåäžããã以éã¯å ¥é€šããã ããªãå ŽåãããããŸãã
åå è²»
ç¡æ
æã¡ç©
connpassã§çºè¡ãããåä»ç¥š
泚æäºé
â» ãã¡ãã®ã€ãã³ãæ å ±ã¯ãå€éšãµã€ãããååŸããæ å ±ãæ²èŒããŠããŸãã
â» æ²èŒã¿ã€ãã³ã°ãæŽæ°é »åºŠã«ãã£ãŠã¯ãæ å ±æäŸå ããŒãžã®å 容ãšå·®ç°ãçºçããŸãã®ã§äºããäºæ¿ãã ããã
â» ææ°æ å ±ã®ç¢ºèªãåå ç³èŸŒæç¶ããã€ãã³ãã«é¢ãããåãåããçã¯æ å ±æäŸå ããŒãžã«ãŠãé¡ãããŸãã

ãåãåãã
é¢é£ããã€ãã³ã

ãç¡æããã¢ã€ã¹ã売ãããšæººãã人ãå¢ãããïŒå æãšçžé¢ãèªã¿è§£ãå ææšè«ã®åºç€ â CVæ¹åã»æææ±ºå®ã«å¹ãããžãã¹æèè¡
2026/04/10(é) éå¬
ãç¡æãMicrosoft 365 Copilotã«ããæ¥åèªååè¶ å ¥é-çæAIæä»£ã®ä»äºã®é²ãæ¹-
2026/04/22(æ°Ž) éå¬
ãç¡æè¬åº§ãBIããŒã«ã§ã¯èº«ã«ã€ããªããæ¥åã«æŽ»ããâæ°å€æèŠâãé€ããããŒã¿å©çšã»æŽ»çšè¶ å ¥éã
2026/04/18(å) éå¬
AIæä»£ã«ãããæ©æ¢°åŠç¿ã®å¿ èŠæ§â 誰ãããæ ¹æ ãã«åºã¥ããŠæææ±ºå®ã§ããæ¹æ³ â
2026/04/07(ç«) éå¬
ãç¡æãâèšèã®ããŒã¿âã§æŠç¥ãã€ããïŒçŸå Žã§äœ¿ããããã¹ããã€ãã³ã°è¶ å ¥é; SNSã»å£ã³ããæŽ»ããå®ååèªç¶èšèªåŠçã»ãããŒ
2026/04/07(ç«) éå¬- TOP
- ã€ãã³ã
- Data Engineering Meetup #1
